¦b¹q¸£¶}©l¨Ï¥Îªº®ÉÔ¡A©Ò¦³ªº¦r³£¥u¥Î 1byte ¨ÓÀx¦s¡A 1byte ¥]§t 8bits¡A¨CÓ bits ³£¥u¯àªí¥Ü on/off¡A ¤]´N¬O 1byte ¥u¯àªí¥Ü 0000 0000 ¨ì 1111 1111 ªº½s½X½d³ò¡A ¥u¦³ 256 Ó½s½XªÅ¶¡¡A³o¹ï¤¤¤å¦Ó¨¥¡A¬O¤£°÷ªº¡C
§Ú̪¾¹D¤¤¤å¦r¦b¹q¸£¤W¬O¥Ñ¨âӦ줸²Õ(two bytes)©Ò½s½X²Õ¦¨ªº¡C ³Ì±`¨£ªº½s½X¤è¦¡¦³¥xÆW¦a°Ï©Ò³q¦æªº Big5 ½s½X¡A¤Î¤j³°¦a°Ï©Ò¨Ï¥Îªº GB ½s½X¡C¦Ó¥B¶}ÀYªº¦ì¤¸²Õ´X¥G³£¬O¤j©ó 128 ªº¼ÆÈ¡A¤]´N¬O©Ò¿× non-ASCII ½Xªº½d³ò(ASCII ¬O«ü¤p©ó 128 ªº½s½X)¡C
±µ¤U¨Ó´N¬O¸U½X©bÄË¡A²³¦h¤¤¤å¼Ð·Çªº°ÝÃD¡A ¥Ø«e¥xÆW¨Ï¥Îªº¤¤¤å½s½X¦s¦b³\¦h°ÝÃD¡A²Ä¤@¬O¤¤¤å½s½X¦³¼ÆºØ Big5¡BCCCII¡BCNS11643¡BBig5E¡BBig5+¡BISO 10646¡BCP950¡A ²Ä¤G¬O±`¥Îªº Big5 ½s½X¦r¼Æ¤£¨¬¡C
ÁöµM±`¥Îªº Big5 ¤w¸g¨Ï¥Î 2bytes ¨Óªí¥Ü¤¤¤å¦r¡A¦ý¬O 2bytes = 16bits = 2^16 = 65536 Ó½s½XªÅ¶¡¡A ¥H Big5 ªº¼Ð·Ç¦Ó¨¥¡A¬°¤Fn©M ASCII ¯à°÷¬Û®e¡A¥u¯à¨Ï¥Î¨â¸U¦h¦r¡A ²{¦sªº¤¤¤å¦r³Ì¤Ö¦b¤C¸U¥H¤W¡A³y¦¨³\¦h¦r¦b Big5 ªº¨t²Î¤U¡A µLªk¨Ï¥Î¡C¦b¥[¤W¤¤¤å¼Ð·ÇÁc¦h¡A«o¤S¨S¦³³Ì«áªº¼Ð·Ç³W®æ¡A ¦U®a¼t°Ó©Ò¹ê°µ²£«~¤]´N¥¼¥²¬Û®e¡C
¦p¤£¥h¦Ò¼{¯S®í²Å¸¹¡A¤Î«á¨Óªº¤CÓÂX¥R¦r¡A TW-BIG5 ªº±Æ§Ç¤è¦¡¦p¤U:
±N©Ò¦³ªº¦r¤À¦¨¨â¤j¸s: ±`¥Î¦r°Ï»P¦¸±`¥Î¦r°Ï
¨C¤@Ó¦r°Ï¤À§O¥Îµ§µe¨Ó±Æ§Ç
¦P¤@Óµ§µeªº¦r¡A¨Ì³¡º¨Ó±Æ
TW-BIG5 ¨CÓ¦r¥Ñ¨âӦ줸²Õ²Õ¦¨¡A¨ä²Ä¤@¦ì¤¸²Õ½s½X½d³ò¬° 0xA1 ... 0xF9¡A ²Ä¤G¦ì¤¸²Õ½s½X½d³ò¬° 0x40 ... 0x7E »P 0xA1 ... 0xFE¡AÁ`p¦¬¤J 13868 Ó¦r (¥]¬A 5401 Ó±`¥Î¦r¡B7652 Ó¦¸±`¥Î¦r¡B7 ÓÂX¥R¦r¡B¥H¤Î 808 Ó¦U¦¡²Å¸¹)¡A¨ä¤¤¥i¥H¤jP¹º¤À¬°¥H¤U´XÓ¦r°Ï:
Table 3-1. TW-BIG5 ¦r°Ï»P½s½X½d³ò
| ²Ä¤@¦ì¤¸²Õ | ²Ä¤G¦ì¤¸²Õ | ¦r°Ï | ¨î©w |
|---|---|---|---|
| A1..A2 | 40..7E, A1..FE | ¦UºØ²Å¸¹°Ï | 1984 |
| A3 | 40..7E, A1..BF | ¦UºØ²Å¸¹°Ï (¥]¬A¼ÐÂI²Å¸¹¡BASCII ¥þ§Î²Å¸¹¡Bª`µ²Å¸¹µ¥) | 1984 |
| A3 | E1 | ¼Ú¤¸²Å¸¹ | CP950 |
| A4..C5 | 40..7E, A1..FE | ±`¥Î¦r°Ï | 1984 |
| C6 | 40..7E | ±`¥Î¦r°Ï | 1984 |
| C6 | A1..FE | ¨u¥Î²Å¸¹°Ï | Ê¤Ñ |
| C7 | 40..7E, A1..FE | ¨u¥Î²Å¸¹°Ï (¥]¬A¤é¤å¡B«X¤åµ¥) | Ê¤Ñ |
| C8 | 40..7E, A1..D3 | ¨u¥Î²Å¸¹°Ï (¥]¬A«X¤å¡B¿é¤Jªk¯S®í²Å¸¹µ¥) | Ê¤Ñ |
| C9..F8 | 40..7E, A1..FE | ¦¸±`¥Î¦r°Ï | 1984 |
| F9 | 40..7E, A1..D5 | ¦¸±`¥Î¦r°Ï | 1984 |
| F9 | D6..DC | ¤CÓÂX¥R¦r | Ê¤Ñ |
| F9 | DD..FE | ªí®æ²Å¸¹°Ï | Ê¤Ñ |
CNS11643 ¤¤¤å¼Ð·Ç¥æ´«½X¡A ¤@¯ë²ºÙ CNS ½X©Î CNS11643 ¡A ¦]¤¤¥¡¼Ð·Ç§½ ¤½§G¤Î×q¤§¥ý«á¦Ó¦³¤GÓª©¥»¡A¤@¯ëºÙ¬° CNS11643-1986 ª©(1986¤½§iª©)¤Î CNS11643-1992 ª©(1992¦~×qª©)¡A ¦¹¤GÓª©¥»¤§®t²§¶È¦b©ó¨ä©Ò½s©w¤§¦r¼Æ¤£¦P¡C 75 ¦~¤½§i¤§ CNS11643 ©w¸q 13,051Óº~¦r¡A81 ¦~×qª©ÂX½s¬°48,027 Óº~¦r¡C 75 ¦~¤½§i¤§ CNS11643 ¦WºÙ¬°¡u³q¥Îº~¦r¼Ð·Ç¥æ´«½X¡v¡A©ó 81 ¦~ÂX¥R«á×q CNS11643 ¦WºÙ¬°¡u¤¤¤å¼Ð·Ç¥æ´«½X¡v¡C
¨âÓ¹q¸£¨t²În¥æ´«¸ê®Æ®É¡A¥H¨â¤è¬Ò»{¥i¤§½X¶Ç¿é¡A ¦¹ºÙ¬°¥æ´«½X¡A¥æ´«½X¤§³]p¶·²Å¦X°ê»Ú¼Ð·Ç¡C ¦]¦¹¤@¯ë¦b¹q¸£¤º³¡³B²z®É¨Ï¥Î¤º½X¡A¦Ó¦b¶Ç¿é®É¡A ¦AÂà´«¬°¼Ð·Ç¥æ´«½X¡C
Unicode (Universal Multiple Octet Coded Character Set)¡A Unicode http://www.unicode.org °ê»Ú¼Ð·Ç²Õ´©ó1984¦~4¤ë¦¨¥ß IOS / IEC JTC1 / SC2 / WG2¤u§@²Õ¡A°w¹ï¦U°ê¤å¦r¡B ²Å¸¹¶i¦æ²Î¤@©Ê½s½X¡A1991¦~¬ü°ê¸ó°ê¤½¥q¦¨¥ßUnicode Consortium¡A ¨Ã©ó1991¦~10¤ë»PWG2¹F¦¨¨óij¡A±Ä¥Î¦P¤@½s½X¦r¶° ¡C ¥Ø«eUnicode¬O±Ä¥Î16 bit¤§½s½X¬[ºc¡A¨ä¦r¶°¤º®e»PISO 10646¤§ BMP(Basic Multilingual Plane)¬Û¦P¡C
Unicode (Unicode Consortium)¨Ã©ó1992¦~6¤ë³q¹L DIS(Draft International Standard)¡A¥Ø«eª©¥»¬°Unicode v2.0 ©ó1996¦~¤½§G¡A ¤º®e¥]§t²Å¸¹6811¡B º~¦r20902¡BÁú¤å«÷µ11172¡B ³y¦r°Ï6400¡A«O¯d20249¦@65534Ó½X¦ì ¡C
¥Ø«e½X¦ìªÅ¶¡¬° U+0000 ¨ì U+10FFFF¡A¦@¦³ 1,114,112 ӪŶ¡¡C
Unihan3.2 ¥]§t¶W¹L¤C¸UÓ CJK ¦r¡C
ISO 10646 HK WWW: http://www.iso10646hk.net/