  UTF-8 and Unicode FAQ for Unix/Linux

  н/ ڸ  UTF-8  ڵ  FAQ
  Markus Kuhn(Markus.Kuhn@cl.cam.ac.uk
  <mailto:Markus.Kuhn@cl.cam.ac.uk>)
  16 March 2001
   (kook@hanyang.co.kr <mailto:kook@hanyang.co.kr>)
  2001 3 29

    н Ȥ  ȯ濡 UTF-8 ڵ带 ϴ
      ֽϴ.   Ǽ Ȥ Ÿ ߰Ͻ
   Ϸ ֽñ ٶϴ.
  ______________________________________________________________________

  

  1. UCS ISO 10646 ΰ?
  2.  (Combining Characters) ΰ?
  3. UCS  (UCS implementation levels) ΰ?
  4. UCS   äõǾ°?
  5. ڵ ΰ?
  6. ڵ ISO 10646   ΰ?
  7. UTF-8 ΰ?
  8. ڵ带 ϴ α׷  ΰ?
  9.  ڵ带  ϴ°?
  10. Ʈ  ؾ߸ ϴ°?
  11. ڵ UTF-8  C 
  12. UTF-8   ȰȭǾ ϴ°?
  13. UTF-8 ϴ X-term   ϴ°?
  14. xterm 󸶳  ڵ带 ϴ°?
  15. ISO-10646 X11 Ʈ    ִ°?
  16. UTF-8 ͹̳ ķͿ õ ̽ ΰ?
  17. UTF-8     ø̼   ִ°?
  18. UTF-8 ϱ ؼ 밡 ġ ΰ?
  19. ڵ带 ٷ 밡  (free) ̺귯 ִ°?
  20. UTF-8 ϱ   Ű   ΰ?
  21. ֶ󸮽 󿡼 UTF-8    ϴ°?
  22. Ʈ ũƮ glyph Ī(Postscript glyph names)>  UCS ڵ õǾ ִ°?
  23.  ǵ UCS subset ִ°?
  24. X11 R6.4  ڵ忡   ִ°?
  25. ̷    ϸ Ʈ ִ°?
  26.    ڷ

  ______________________________________________________________________

  1.  UCS ISO 10646 ΰ?

  ISO 10646 ǥ Universal Character Set(UCS) ϰ ִ.
  UCS  ٸ  ڼ ǥ(character set standards) 
  ϴ ڼ̴. ̰ ٸ ڼ° ȣ ȣȯ Ѵ.
    ؽƮ ڿ UCS ȯϰ ٽ  ڵ ȯ
     սǵ  ̴.

  ISO 10646  31Ʈ ڼ ϰ ִ. ׷ ݱ
  ڵ ̷ ū ڵ (of this huge code space)߿ 
  ó 65534° ġ(0x0000  0xFFFD)
  ġ߾. ̷ UCS 16Ʈ  ⺻ ٱ
  (Bagic Multilingual Plane : BMP) Ȥ  0(Plane 0) θ.
  BMP   ڵ  Ǵ ڵ  Ȥ 
    鸸 ϴ ణ ٸ  
  Եȴ(: ).  ȹ 0x000000  0x10FFFF
  21Ʈ ڵ  ܺο ҴǴ ڵ  ̶ ϰ
  ִ. ̰ 鸸  Ѵ 缺ִ ̷ ڵ 
  ̴. ISO 10646-1  1993⿡ ʷ ȵ,  
  BMP   ϰ ִ. BMP  ܺο ڵǴ ڵ
  ϰ ִ ι° Ʈ ISO 10646-2 غ ߿ , װ
  ϼǱ  ɸ 𸥴.  ڿ ̾ Ӿ
  ο ڵ BMP  Եǰ ,  ϰ ִ
  ڵ    ̸  Ȯϰ ִ.

  UCS  ڿ ڵ ȣ Ӹ ƴ϶  Ī Ҵϰ ִ.
  UCS Ȥڵ  Ÿ 16 Ϲ "ƾ 빮 A"
  Ÿ U+0041ó տ "U+" λ簡 ٴ´. UCS  U+0000
   U+007F US-ASCII(ISO 646 IRV)  ǹ̸ ´. ׸
  U+0000  U+00FF  ISO 8859-1(Latin-1) .
  U+E000 U+F8FF  BMP  ܺ  ū 
   뵵  ȴ.

  UCS   Ī  .

  International Standard ISO/IEC 10646-1, Information technology --
  Universal Multiple-Octet Coded Character Set (UCS) -- Part 1:
  Architecture and Basic Multilingual Plane. Second edition,
  International Organization for Standardization, Geneva, 2000-09-15.

  ̰ PDF Ϸ  CD-ROM Ʈ 80 ( 54 ȭ,  45
  ̱޷,  32 Ŀ) ISOκ ¶ ֹ
  <http://www.iso.ch/cate/d29819.html>   ִ.

  2.   (Combining Characters) ΰ?

  UCS  code point  (combining characters) ҴǾ
  . ̰͵ Ÿڱ⿡   ʴ ׼Ʈ Ű . 
  ڴ   üδ ϳ  ڰ ƴϴ. װ ռ ڿ
  ϴ ׼Ʈų Ȥ  ũ̴. ̷,  ڿ 
  ׼Ʈ   ϴ. Ϲ  öڹ ϴ
  ó  ߿ ׼Ʈ  ڵ     
   ȣȯ Ȯϱ ؼ UCS ׵ ڽŸ ڵ带 ´.
  ̸  (precomposed characters) ˷ ׼Ʈ 
  ڵ ڽŸ ڵ ġ ,   ڿ ڵ ѽ
  ٸ ڷν Ÿ  ִ. ̸  ڵ  
  ڵ  ʴ ISO 8859    ڵ ȣȯ ؼ
  UCS  ϴ. չ ī  ڿ ׼Ʈ ٸ
   ȣ ̴  ϴµ,   Ư ⺻ ڿ Ѱ
  Ȥ   ȣ  ʿ   İ  ǥ
  ĺ   ǥ ؼ ߿ϴ.

  չڴ ׵ ϴ ڸ .  ,  umlaut 
  Ĵ ̸  UCS ڵ U+00C4 Ÿų  " 
  ȣ"(combin ing diaeresis) ڸ մ Ϲ "ƾ 빮 A"
   Ÿ  ִµ, U+0041 U+0308 .   ڴ
  ټ ׼Ʈ  ų ⺻  Ʒ ο  ũ 
   ʿ䰡     ִ. Ÿ ڸ   , ϳ ⺻
     ڰ ִ 2 ʿϴ.

  3.  UCS  (UCS implementation levels) ΰ?

   ý۵  ڿ  UCS   ī
  ϸ    . ׷Ƿ, ISO 10646   
   ϰ ִ.

  o   1 :  ڿ ѱ ڸ(ΰ Ȥ   ڵ ѱ
      ̷ Ưϸ   ѱ ǥ  ڵ)
      ʴ´.

  o   2 :  1    ü(script) ־ չ
      (fixed list) Ѵ( , ¾, ƶ,
     ε, ۶󵥽þ, Ⱦ, ε-ƸȾ, -ε, Ÿо,
     ȵ, īŸī, ̽þ , ±   ִ).
     ̷ ڵ ּ  ڵ   ̴ UCS
     ϰ Ÿ  .

  o   3 :  UCS ڵ Ѵ.   ڵ  
     ڻ ƿ ȭǥ(Ȥ ʴ) Ÿ  ִ.

  4.  UCS   äõǾ°?

   ̴.   1993⿡ ISO 10646-1:1993 
  ä  ǥ δ     װ
  ؿ  ȣ  ׿ ΰ   Ŀ ǥߴ.

  o  ߱: GB 13000.1-93

  o  Ϻ: JIS X 0221-1995

  o  ѱ: KS X 1005-1:1995 (ISO 10646-1:1993  1-7 )

  5.  ڵ ΰ?

  ,  սŲ ڼ  ΰ  õ
  ־.    ǥ ⱸ(ISO) <http://www.iso.ch/> ISO 10646
  Ʈ, ٸ ϳ   Ʈ (ʱ⿡
  ̱ȸ簡 κ̾ ) ҽÿ  ڵ Ʈ
  <http://www.unicode.org/>.  Ե,  Ʈ ߴ
  ȸ  1991濡 ΰ  ٸ  ڼ 谡
  ϴ ٰ ƴ϶  ޾Ҵ. ׵ Բ   ڵ
  ̺   Բ ۾ߴ.  Ʈ   ϸ
  ׵    ǥѴ. ׷ ڵ ҽÿ
  ISO/IEC JTC1/SC2 ȣȯ ڵ ISO 10646  ڵ
  ̺ ϱ ߴ. ׸ ׵   Ȯ 
    ϰ ִ. ڵ 1.1  ISO 10646-1:1993
  ߰, ڵ 3.0  ISO 10646-1:2000 Ѵ.

  ڵ ǥ  Ϲ åó amazon.com
  <http://www.amazon.com/exec/obidos/ASIN/0201616335/mgk25>κ 
  50 ޷ ֹ  ִ.

       The Unicode Consortium: The Unicode Standard, Version 3.0
       <http://www.amazon.com/exec/obidos/ASIN/0201616335/mgk25>,
       Reading, MA, Addison-Wesley Developers Press, 2000, ISBN
       0-201-61633-5.

  ؽƮ μ̰ ڼ   ۾ Ѵٸ, е
  ݵ  ī Ǹ ؾ߸ Ѵ.

  6.  ڵ ISO 10646   ΰ?

  ڵ ҽÿ ǥ ڵ ǥ
  <http://www.unicode.org/unicode/standard/s tandard.html>  
   3 ⺻ ߾ (BMP) Ѵ.  ǥ  
  ڵ  ġ   Ī Ѵ.

  ڵ ǥ ΰ  ڿ õ ξ   ü踦
  ϰ  Ϲ  μ  ý    
   ڷᰡ ȴ.  ڵ   ƾ ¾ ȥϴ 
   ؽƮ ϹǷν,   ̼  ϱ
   ˰ ڿ 񱳸        
  ϰ ִ.

  ٸ  ISO 10646 ǥ  ˷ ISO 8859 ǥذ  
   ڼ ̺  ̻ ƴϴ. ̰ ǥذ õ 
   ϰ,   ڵ ȵ ϸ, ISO 6429 ISO
  2022  ٸ ISO ǥذ õ UCS ϴ   
   Ѵ. ISO ǥذ ϰ õ ٸ ͵  ִ. 
  , UCS ڿ Ŀ  ISO 14651 ִ. ISO 10646-1 ǥ
  Ǹ Ư¡δ װ ټ ٸ ŸϷ   
  glyph  Ѵٴ ̴. ݸ ڵ ǥ  
  ڸ  ߱  θ ش.

  7.  UTF-8 ΰ?

    UCS ڵ   ڿ Ҵϴ ڵ
  ̺   ̴. ׷  Ȥ    
    Ʈ  Ÿ  ִ    
  ȵ Ѵ.  ϱ    ڵ ڵ
  ؽƮ 2 Ȥ 4Ʈ  (sequences of eit her 2 or 4
  bytes sequences)ν Ѵ. ̷    Ī 
   UCS-2 UCS-4̴. ٸ  õ ʴ´ٸ,  ߿
  Ʈ ̵ ù° ´(Bigendian convention). ASCII Ǵ
  Latin-1   ASCII Ʈ տ 0x00 Ʈ ϹǷν,
  UCS-2 Ϸ ȯų  ִ. UCS-4  Ѵٸ,  ASCII
  Ʈ տ  ſ  0x00 Ʈ ؾ߸ Ѵ.

  н ȯ濡 UCS-2(Ǵ UCS-4) ϴ  ſ ɰ 
  ҷ . ̷ ڵ  ڿ ϸ C ̺귯 Լ
  ĶͿ Ư  ǹ̸  '\0' Ȥ '/'  ſ 
   Ʈ κ   ִ. ̿ , ټ н
   ASCII  ϸ, ū  ̴ 16Ʈ ܾ ڷ
    . ̷   UCS-2 ϸ ؽƮ   ȯ
     ڵ ܺ ڵ(suitable external encoding of
  Unicode) ƴϴ.

  ISO 10646-1  Annex R
  <http://www.cl.cam.ac.uk/~mgk25/ucs/ISO-10646-UTF-8.html> RFC 2279
  <ftp://sunsite.doc.ic.ac.uk/packages/rfc/rfc2279.txt> ǵ UTF-8
  ڵ ̷  . ̰ н Ÿ 
  üϿ ڵ带 ϱ  ǽ   ̴.

  UTF-8    ִ:

  o  U+0000 U+007F UCS ڵ 0x00 0x7f Ʈ 
     ڵȴ(ASCII ȣȯ). ̰  7Ʈ ASCII ڵ
     ϴ   ڿ ASCII UTF-8  ο 
     ڵ ´ٴ  ǹѴ.

  o  U+007F ū  UCS ڵ   Ʈ ν
     ڵǸ, ̰͵   ߿ Ʈ(bit set) .
     ׷Ƿ ٸ  κп  ASCII Ʈ(0x00-0x7f) Ÿ
      .

  o  ASCII ƴ ڸ Ÿ ƼƮ  ù° Ʈ
     ׻ 0xC0 0xFD  , װ ̷ ڸ 
     󸶳  Ʈ ʿ  Ų. ƼƮ  
      Ʈ 0x80 0xBF  ִ.  
     resynchronization   ְ  ֹ ʰ ڵ 
      Ʈ Ҿ ʰ ȴ.

  o    2 sup {31} UCS ڵ带 ڵ  ִ.

  o  UTF-8 ڵ ڵ ̷ 6Ʈ ̱ ,
     16Ʈ BMP  ڵ  3Ʈ ̱ ϴ.

  o  Bigendian UCS-4 Ʈ ڿ   ȴ.

  o  0xFE  0xFF Ʈ  UTF-8 ڵ  ʴ´.

   Ʈ   ڸ Ÿ  Ѵ. Ǵ
     ڵ ȣ  ޶.

  xxxƮ ġ  ǥ   ڵ ȣ Ʈ
  ä.  xƮ  ߿ ʴ.   ڵ ȣ
  Ÿ   ª ƼƮ    ִ. ƼƮ
   ù ° Ʈ  1Ʈ  ü  Ʈ
   ٴ  ϶.

  : "ڵ  U+00A9 = 1010 1001"(۱ ȣ)  
  UTF-8  ڵȴ.

  11000010 10101001 = 0xC2 0xA9

  ׸  U+2260 = 0010 0010 0110 0000(۱ ȣ)  
  UTF-8  ڵȴ.

  11100010 10001001 10100000 = 0xE2 0x89 0xA0

  ̷ ڵ  Ī Ȯ ǥ UTF-8̸, UTF UCS
  Transformation Format ǹѴ. utf8Ȥ UTF_8  ٸ 
  UTF-8  .  ڵ ü  ʰ 
   쿡±.

  UTF-8 ڵ ó  ־ ߿   : Ȼ
   , UTF-8 ڴ  ڸ ڵϱ ؼ ʿ ̻
   UTF-8  ޾Ƶ鿩¾ ȴ
  <http://www.unicode.org/unicode/uni2errata/UTF-8_Corrigendum.html>.
    U+000A( ǵ) ڴ  0x0A  UTF-8
  Ʈκ ޾Ƶ鿩߸ ϸ,  ټ  ϰ
  (overlong)  ޾Ƶ鿩 ȵȴ.

    0xc0 0x8A
    0xe0 0x80 0x8A
    0xf0 0x80 0x80 0x8A
    0xf8 0x80 0x80 0x80 0x8A
    0xfc 0x80 0x80 0x80 0x80 0x8A

   ª ڵ ã  UTF-8 꽺Ʈ ׽Ʈ ϱ 
  ϰ  UTF-8    ִ.  ϰ  
  UTF-8   Ʈ     Ѵ.

   UTF-8 Ȥ UCS-4 ͻ󿡼 ڵ ġ U+FFFE U+FFFF Ӹ
  ƴ϶ ڵ ġ U+D800  U+DFFF(UTF-16 ) ؼ 
  ȴ. UTF-8 ڴ ̷ ͵  , ߸ 
  Ȥ ʹ   ؾ  Ѵ.

  Markus Kuhn UTF-8 decoder stress test file
  <http://www.cl.cam.ac.uk/~mgk25/ucs/examples/UTF-8-test.txt> ߸
    ϰ  UTF-8  ü  ϰ 
  ڴ  ش.

  8.  ڵ带 ϴ α׷  ΰ?

  1993 Ŀ ߵ ֱ α׷  ̹ ڵ/ISO
  10646-1 ڵ  Ư    ִ. ̰ Ada95
   Wide_Character̸ ڹ  Char̴.

  ISO C  ƼƮ ڵ ̵ (wide character)
  ϱ  Ŀ ϰ , Amendment 1 to ISO C
  <http://www.lysator.liu.se/c/na1.html> 1994 9 Ǿ 
     ͵ ߰Ǿ. ̷   ַ 
  ƽþ ڵ ڵϱ ؼ Ǿ UCS ϱ ؼ
  ʿ ͺ ξ  . UTF-8 ISO C ǥ ϳ
  ƼƮ ڿ wchar_t  ȣϱ  ϳ ڵ ε,
  ̰  ȯ濡  32Ʈ ȣִ ̸, ڵ
  ڸ ϱ ؼ   ִ. C Ϸ yyyymmL ¸
     __STDC_ISO_10646__  ũ ϹǷν,
  wchar_t   Ͽ ģ ڵ  Ѵٴ 
  ϴ ȣ ø̼ǿ   ִ(199712L  ,
     ISO/IEC 10646 װ   
  Ǿ Ÿ  Ѵٴ    ִ).

  9.   ڵ带  ϴ°?

  UTF-8 ǥǱ  ٸ   ִ   
  ASCII Ȯ ڸ Ͽ.  ISO 8859-1 ISO 8859-2,
  ׸ ISO 8859-7, þƿ KOI-8, Ϻ EUC
  Shift-JIS  θ Ͽ. ̷ Ͽ  ȯϴ 
  , ø̼ Ʈ ϱ ؼ ̷ ڵ
    ؾ߸ ߴ.

  ڵ ᱹ ̷  ڵ  ̸, UTF-8 
  ָ ̷ ̴. UTF-8 Ʒ  쿡  ̴.

  o  ؽƮ (ҽ ڵ, html , ̸ ޽ )

  o  ϸ

  o  ǥ , 

  o  ȯ 

  o     ڸ ̱

  o  ڳ, , ͹̳ ķͿ  ø Ʈ

  o  Ʈ  ASCII ڵ ؼǴ ̸  ̵
     ȴ.

  UTF-8 忡 xterm̳  ܼ ̹  ͹̳
  ķʹ  ϴ UTF-8   ŰԷ ϸ, ŰԷ
  ׶ μ ǥ Է .   μ
  ǥ  ͹̳ ķͷ   װ  UTF-8 ڴ
  óϰ 16Ʈ Ʈ Ͽ ÷Ѵ .

     ڵ   - -μ
  Ű     ִ.  Ʋ ASCII ڸ
  ϱ        ̸, ٸ 8Ʈ ڼ
  ξ ܼ ̴.  ͹̳ ķͿ   
  ó ܰ迡 ٷ UTF-8 ȯ   ̴. ̰ ISO
  10646-1   1 (  ڵ  ʰ)
   ǹϸ,  μ  ʿ  ʴ ƾ,
  ׸, Ű    ȣ  ü(script)鸸 
  ǹѴ. ̷  UCS  ISO 8859    ϸ
   ߿     ٸ  ڵ ,
  ڵ ƼƮ  Ÿ  ִٴ ̴.

   ᱹ  ڰ ǰ ̸ (precomposed)
  ڵ        Ǿ߸ Ѵ.
   ϰ ڸ, 󿡼 ڵ  ؽƮ
  ڵϴ    Unicode  Technical Report #15
  <http://www.unicode.org/unicode/reports/tr15/>  ǥ  C
  (Normalization Form C) Ǿ߸ Ѵ.

  ִ POSIX ȣȯ PC ü Ǹ ȸ  ϳ(⼭ ̸
   ʰڴ)  Ͽ, Ǵ ڵ Ʈ 
  ĺϱ ؼ  ڵ    극ũ ̽
  (ZERO WIDTH NOBREAK SPACE: U+FEFF) ϰ ڰ ߴµ,
  ̷ Ģ   Ͽ Ǵ ڵ Ʈ (byte-
  order) ĺϱ ؼ   극ũ ̽ ڴ
  signature Ȥ "Ʈ- ũ(byte-order mark: BOM)"ν  
   .  BOM̳ signature   ʴ´. BOM̳
  signature    ϰ ִ ASCII-  Ģ
  ߸ ̴. POSIX ýۿ,    μ
   Ͽ ʿ ϴ ڵ ̸ ȮѴ.  signature
  "UTF-8N"   UTF-8 ϵ ȣ ϱ ؼ װ  ߾.
  ׷ ̷  ǥ   POSIX 迡 
  ʴ´.

  10.  Ʈ  ؾ߸ ϴ°?

  UTF-8       ִ. ̰ Ʈ  ϵ
  ȯ̶ θ ϰڴ. Ʈ ȯ ʹ UTF-8  
   Ǹ  ſ   Ʈ ȭų ʿ䰡 ִ.
  ϵ ȯ α׷ о̴ UTF-8 ʹ  ū(wide) 
  迭 ȯ  ̸,  ø̼    ó
  ̴.

  κ ø̼ǵ  Ʈ ȯε  Ѵ.
  Ʈ ȯ н 󿡼 UTF-8 ޾Ƶ  ְ ϴ ̴. 
  , cat echo  α׷   ʿ䰡 . װ͵
   ISO 8859-2 ̵ UTF-8̵  Ϻϰ Ѵ.
  ֳϸ, װ͵  μȭ ʰ   Ʈ Ʈ
  ϱ ̴. װ͵  UTF-8 󿡼   ȭ Ͼ
  ʴ '\n'   ڵ ASCII  ڸ νѴ. ׷Ƿ UTF-8
  ڵ  ڵ ͹̳ ķͿ ϴ ̷
  ø̼ǿ ؼ Ϻϰ ̷.

  Ʈ  Ͽ  ڿ    ϴ  α
   ؼ ణ  ʿ ̴. UTF-8 忡, α׷
  0x80 0xBF    Ʈ  ؼ ȵȴ. ֳϸ
  ̷ Ʈ   Ʈ(continuation bytes)̸ ϰ ִ
  ڴ ƴϱ ̴. UTF-8 Բ C strlen(s)  
  ڿ  ڼ ٸ   ̴.  ,   UTF-8
   õǾٸ ڼ ϱ Ͽ mbstowcs (NULL,s,0)
  Լ   ִ.

   , ls α׷ ؾ߸ Ѵ. ֳϸ   丮
   ֱ  ̺ ̾ƿ ü踦 ߱ ؼ ڼ
  ϸ ϱ ̴. ̿ ϰ     Ʈ
  ϰ   ϴ α׷ װͿ ˸° UTF-8 ؽƮ
     ϱ   ˾ƾ߸ Ѵ.   ڸ 
  Ͱ   Լ  ڿ ϴ  Ʈ   ؼ
  ణ ؾ߸ Ѵ.  , ncurses  귯 ϴ
  α׷ Ӹ ƴ϶ vi emacs    ̿  
  ޴´.

   Ŀ  Ʈ ȯε    , UTF-8
  Ϻϰ ϱ   ణ  ʿϴ. ڿ( 
  ϸ, ȯ  ) ϴ κ Ŀ Լ   
  ´.   쿡  ʿ  ִ.

  o  ܼ Ŭ̿ Ű ̹(ϳ ڸ VT100 ķ)
      UTF-8 ڵ  ڵؾ ϸ  ڵ ڼ 
      (subset) ؾ߸ Ѵ.

  o  VFAT WinNT  ܺ  ý ̹ ϸ Ÿ
      ڵ ȯѾ߸ Ѵ.  ȿߴ ȯ ɼ Ͽ
     UTF- 8 ؾ߸ ϸ, mount   Ų μ 
     UTF-8 ϸ   ֵ Ŀ ̹ ˷߸ Ѵ. VFAT
      WinNT  ý ̹ ڵ带 ϰ ֱ ,
     UTF-8 ȯ ߿ ս Ͼ ʴ´ٴ  Ѵ.

  o   POSIX ý tty ̹      
       ְ ϴ  "cooked" 带 Ѵ.   Լ
       ۿǵ ϱ ؼ stty tty UTF-8 尡
     0x80 0xBF   ȿ  迭 ڷ 
     ʵ UTF-8 带  ؾ߸ Ѵ. Bruno Haible
     <http://clisp.cons.org/~haible/> ϴ stty Ŀ tty
     ̹    ġ
     <ftp://ftp.ilog.fr/pub/Users/haible/utf8/> Ѵ.

  11.  ڵ UTF-8  C 

  GNU glibc 2.2 캸,  ϴ ϰ , wchar_t
     32Ʈ ISO 10646  Ǵ  ִ. ISO
  C99 ʿ ϱ  __STDC_ISO_10646__ ũ ǿ ؼ
  ø̼ǿ ̰ ñ׳η . ISO C Ƽ-Ʈ ȯ
  Լ(wprintf(), mbstowcs() ) glibc 2.2 Ȥ  ̻󿡼 Ϻϰ
  Ǹ, UTF-8  ϸ鼭 wchar_t   ƼƮ
  ڵ  ̿ ȯϱ ؼ   ִ.

    Ʒ    ִ.

  wprintf(L"Schone Grue!\n");

  ׷ ڿ Ʈ ̷ ؽƮ ڰ ȯ  LC_CTY
  PE( , en_US.UTF-8 Ȥ de_DE.ISO_885 9-1)  Ͽ
  õ ڵ  ̴.  Ϸ C ҽ Ͽ ϴ
  ڵ  Ͽ ؾ߸ Ѵ. ׸   ڿ
  ڵ wchar_t ڿ ؼ  Ʈ Ͽ Ȯϰ 
  ̴. ½ÿ -Ÿ ̺귯 wc har_t ڿ α׷
  Ǵ ȯ Ͽ  ´ ڵ ٽ ȯų ̴.

  12.  UTF-8   ȰȭǾ ϴ°?

   ϴ ø̼ 8Ʈ ڼ(ISO 8859-*, KOI-8 ) 
  UTF-8 θ Ѵٸ װ UTF-8 带 Ѵٰ    
       ˾Ƴ߸ Ѵ. ٶǴ,   ȿ
     UTF-8 ϰ  ̸, е װ
  ⺻    ̴. ׷  8Ʈ ° UTF-8 
   Ʋ Ǳ  ƴϴ.

   Ǵ ø̼ǵ ׵  UTF-8 带 ȰȭŰ 
   ü ٸ   ġ Ѵ.  :

  o  xterm   ɼ "-u8" X ҽ "XTerm*utf8: 1"

  o  gnat/gcc   ɼ "-gnatW8"

  o  stty   ɼ "iutf8"

  o  mined   ɼ "-U"

  o  xemacs UTF-8  ϴ MULE ڵ ̿ ȯ
     Ű  elisp Ű ִ.

  o  vim 'fileencoding' ɼ

  o  less ȯ  LESSCHARSET=utf-8

  Ư   ɼ̳  ø̼    ī
   ϴ    ̴. ׷Ƿ   ǥ  
   ʿ䰡 ִ.

  13.  UTF-8 ϴ X-term   ϴ°?

  (Thomas Dickey <http://dickey.his.com/> ϴ XFree86
  <http://www.xfree86.org/> 4.0 Ȥ    ϴ xterm
  <http://dickey.his.com/xterm/xterm.html>  ̹ UTF-8 
  ϰ ִ.   XFree86 4.0  ʴ´ٸ, ν
   ֽ xterm   <ftp://dickey.his.com/xterm/
  xterm.tar.gz>  ٿε     "./configure
  --enable-wide-chars ; make"  Ͽ    ִ.

  xterm  带 UTF-8 ȯŰ ؼ   ɼ -u8
  ϰ UTF-8   *-ISO10646-1 Ʈ ϶. ISO 10646-1
  Ʈ ISO 8859-1 Ʈ   Ϻϰ ȣȯǱ  ISO
  8859-1   *-ISO10646-1 Ʈ    ִ.

  14.  xterm 󸶳  ڵ带 ϴ°?

  XFree86 4.0.1 Ե Xterm      ʿ
    ϴ ISO 10646-1  1 Ѵ. ٽ 
  , ͹̳ ǹ  ü(terminal semantic)  UTF-8 ڵ
   ְ 16Ʈ ڿ ׼   ִٴ  ϸ, ͹̳ 
    ü ISO 8859-1  ⺻ .

   ֽ xterm   
  <ftp://dickey.his.com/xterm/xterm.tar.gz> ̿   ο
  Լ(Robert Brady <http://www.zepler .org/~rwb197/xterm/> )
  ߰Ѵ.

  o  Ͼ ǥǹڸ  ι   Ʈ  ڵ ȯ

  o     ľ(simple overstriking combining characters)

  õ Ϲ ڰ XY ȼ ũ⸦ ´ٸ , xterm  ΰ
  2XY ȼ ũ  Ʈ εϷ õ ̴(AVERAGE_WIDTH Ӽ
     ´ٴ  ϸ XLFD ). Unicode Technical
  Report #11 <http://www.unicode. org/unicode/reports/tr11/> 
  East Asian Wide(W) Ȥ East Asian FullWidth(F)  Ư(Width
  property)   ڵ ڵ Ÿ ؼ xte rm
  ̷ Ʈ  ̴.

  ̽ ų(nonspacing) ѷΰ ִ(enclosing)  ڵ(
   , ڵ ͺ̽
  <ftp://ftp.unicode.org/Public/UNIDATA/UnicodeData.t xt>ȿ Ϲ
  īװ ڵ <ftp://ftp.unicode.org/Publi
  c/UNIDATA/UnicodeData.html#GeneralCategory> Mn ̳ Me  ִ
  ڵ)    ϸ ̰ ⺻ - glyph(base-character
  glyph)     glyph  ľǷν(  OR-
  ing)   ִ. ̰ ⺻  Ʒ  ׼Ʈ ޾Ƶ 
  ְ ϰ,    ׼Ʈ ޾Ƶ  ְ Ѵ.  
  ľ Բ ϱ ؼ Ư    Ʈ ؼ
    Ѵ. ׷ Ư "ǰ" ȭ Ʈ  
  Ʈ ȿ Ű ū   ׼Ʈ ϴ   
    𸥴. ׷Ƿ ̸ (precompose d) ڵ 
   쿡 װ     ̴.

  ؽƮ  ø̼ α׷ӵ ؾ  :

  xterm  Ͼ ڿ  ڸ  ν  
   ó  ̴. ֳϸ ƾ/׸/Ű  ڴ
  ϳ ο ġ ʿ ϴ ݸ, Ͼ ڴ 2 
   ʿ ϸ  ڴ ϳ ʿ  ʱ ̴.

   ҽ ׷  н  <http://www.UNIX-
  systems.org/online.html> ϳ ڰ 󸶳  ο ϰ
    ø̼ ׽Ʈ   ֵ ϴ   C Լ
  wcwidth() <http://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.html>
  wcswidth() <http://www.cl.cam.ac.uk/~mgk25/ucs/wcswidth.html> 
  ϰ ִ.

  #include <wchar.h>
  int wcwidth(wchar_t wc);
  int wcswidth(const wchar_t *pwcs, size_t n);

  Markus Kuhn   (free) ϴ wcwidth() Լ
  <http://www.cl.cam.ac.uk/~mgk25/ucs/wcwidth.c> C ̺귯 
   Լ  ʴ ÷ ø̼ǿ    
  .

  xterm  ٰ ̷    ڵ  
        ɵ Ƹ   
  ̴.

  o  ¾ ƶ ڵ   

  o  ƶ ̼   

  o  ε ἱ(ligature)  

  o  ѱ ڸ

  o   ڸ   

  ׷Ƿ ¾ ƶ ڵ ¾ ƶ ڿ ͹̳η
    ڿ  Ű  ä ø̼ 
  α׷ ؾ߸ Ѵ. ٽ ,  μ xterm ؼ
   ƴ϶ ø̼ǿ ؼ ̷߸ Ѵ. ¾  ƶ
  õ Ȳ ̸  glyph ̼  ȿ ־
    ISO 8859  ǰ ִ. ν   
   xterm   ̰ 󸶳 Ȯϰ ۵  Ȯ
  ʴ. ISO 6429  = ECMA-48
  <http://www.ecma.ch/ecma1/STAND/ECMA-048.HTM> ڵ bidi ˰
  <http://www.unicode.org/unicode/reports/tr9/>   
  Ʈ(starting point)  Ѵ. E CMA Technical Report TR/53
  <http://www.ecma.ch/ecma1/TECHREP/E-TR-053.HTM>  ϶.

    ø̼ǿ  ؽƮ   ȹ̶ ,
  ڵ Bidi ˰  (free)  Dov Grobgeld
  FriBidi <http://imagic.weizmann.ac.il/~dov/freesw/FriBidi/> Ȥ Mark
  Leisher PretBidi ˰
  <http://crl.nmsu.edu/~mleisher/ucdata.html> .

  ֱٿ  Robert B rady <http://www.zepler.org/~rwb197/xterm/>
  bidi    ʱ  ġ ǥϱ  xterm
   ƶ, ѱ ڸ  ε ؽƮ  ˰  
  ʴ´. VT100 ֹķͿ ̷ ͵ ϴ  ˸ >
  Ȥ ٶ  ҸȮϴ. ø̼ǵ ƶ ѱ 
  ϴ ˰  ø̼    ִ. ֳϸ xt
  erm ø̼ Ͽ  ʿ ̼  ϴ
   ϱ ̴. ε ü ؼ X Ʈ ī ʼ
  ἱ(ligature) ü  ٸ  ἱ  ڵ >
  ν   ʰ ִ. ׷ Ҹ  xterm  
  . ε  ʿ ø̼ xterm  VT100 ķ 
  ſ Pango <http://www.pango.org/>   ڵ X11 
  ̺귯 ϴ   .

  15.  ISO-10646 X11 Ʈ    ִ°?

        ڵ Ʈ X11  ϰ
  Ǿ,   Ʈ   ӵ ϰ ִ.

  o  Markus Kuhn  ٸ ڵ Բ X11   -misc-
     FIXED-*-Iso8859-1 Ʈ   ڵ(ƾ, ׸ ,
     Ű,  ǥ ,    ȣ Ƹ޴Ͼƾ, 
     ƾ, īŸī, ±, ׿ ٸ ڿ   Ʈ鿡
      ϴ) ϴ  ϶ Ȯ״.   
      X11   ڵ Ʈ 
     <http://www.cl.cam.ac.uk/~mgk25/ucs-fonts.html>  캸.
     ̷ Ʈ  XFree86 <ht tp://www.xfree86.org/> 4.0.1 Ȥ
      ̻  Բ  ȴ.

  o  Markus  X11R6.4   Ե  Adobe Bmp;H BDF
     Ʈ ISO 10646-1 
     <http://www.cl.cam.ac.uk/~mgk25/download/ucs
     -fonts-75dpi100dpi.tar.gz> غߴ. ̷ Ʈ ̹ 
     ƮũƮ Ʈ  ϶(뷫 30 ڵ ߰ Ǿ
     κ CP1252 MS-  ȴ.  ,  
     οȣ  ȣ(-)  ִ) Ѵ. ׷ ̷ ͵
     ISO 8859-1 ڵ Ʒ   .  ISO 10646-1
        ׼   ִ.

  o  XFree86 4.0 ISO 10646-1 ڵ  /ũμƮ Ʈ
      X ø̼ǿ   ְ ϴ յ ƮŸ Ʈ 
     <http://www.dcs.ed.a c.uk/home/jec/programs/xfsft/> Բ 
     ȴ.

  o   XFree86  κ  BDF Ʈ 
      ϰ װ͵ ISO 10646-1 ڵ  ü 
     . X   8Ʈ Ʈ  Ʈ û 
     绡 ISO 106 46-1 Ʈ Ϸκ ISO 8859-*  ٸ Ʈ
     ڵ ϴ   ڵ ȯ Ȯ ̴. 
     Ʈ ISO 10646-1 Ʈ ڵ   ؾ߸ Ѵ.

  o  ClearlyU (cu12) <ftp://crl.nmsu.edu/CLR/multiling/unicode/fonts/>
     Mark Leisher <mailto:mleisher@crl.nmsu.edu>  3700 ̻
     ڵ  X11  12  100dpi( ġ 
     ) ϴ ũ ſ  ISO 10646-1 BDF Ʈ
     .(̹   <http://crl.nmsu.edu/~mleisher/cu-examples.html>)

  o  NEW: Dmitry Yu. Bolkhovityanov ؽ Ʈ  IBM PC
     ķͿ ϱ ؼ BDF  ȿ   VGA Ʈ
     <http://www.inp.nsk.su/~bolkhov/files/fonts/univga/index.html>
     .

  o  Roman Czyborra GNU   Ʈ <http://czyborra.com/unifont/>
     Ʈ  (free) Ϻ 816/1616   ڵ Ʈ
       ϰ ִ.

  o  etl-unicode <ftp://ftp.x.org/contrib/fonts/etl-unicode.tar.gz>
     Primoz Peterlin <mailto:primoz.peterlin@biofiz.mf.uni-lj.si>
     غ ISO 10646-1 BDF Ʈ̴.

  o  George Williams Type1 ڵ Ʈ йи
     <http://bibliofile.mc.duke.edu/gww/fonts /Unicode.html>
     µ, ̰   BDF ϴ. ״  PfaEdit
     <http://bibliofile.mc.duke.ed u/gww/FreeWare/PfaEdit/>
     ƮũƮ Ʈ Ʈ  ͸ ߴ.

  ڵ X11 Ʈ Ī -ISO10646-1̶ ܾ . ̰
    ڵ ISO 10646-1 16Ʈ Ʈ  X  Ʈ
  (X Logical Font Descriptor: XLF D)
  <ftp://sunsite.doc.ic.ac.uk/packages/X11/pub/R6.4/xc/doc/hardco
  py/XLFD/xlfd.PS.gz>  CHARSET_REGISTRY CHARSET_ENCODING  
   ϵ <ftp://sunsite.doc.ic.ac.uk/packages/X11
  /pub/R6.4/xc/registry> ̴. *-ISO10646-1  Ʈ ü ڵ
  ڼ õ    ϰ  . ׸ ڵ
  ׵  Ʈ ̵  ʿ   
  ϴ Ȯؾ߸ Ѵ.

  *-IS010646-1 Ʈ Ϲ Ʈ ȿ    ڸ
  Ÿ ؼ ڵ尡 ƴ(non-Unicode) Ư gly ph Ű
  DEFAULT_CHAR  ϰ ִ(Ϲ 0x00 ġϰ H ũ⸦
   (-)  box̴). ̰    
  ʴ ڰ ִٴ  ϰ ˾  ֵ  Ѵ. xterm
   6x13  ũ  - Ʈ(the small er fixed-width
  fonts)  ڵ带    ̴. ֳ 
  (Kanji)   ü(scripts)   а
  ϴ ȼ   ξ  ū ȼ  Ÿ ̴.
   ϱ   ڵ Ʈ CEN MES-3 丮
  <http://www. egt.ie/standards/iso10646/pdf/cwa13873.pdf>  
  1000 3000 ڵ   ̴.

  е *-ISO10646-1 Ʈ ASCII  ο ȣ 
  <http://www.cl.cam.ac.uk/~ mgk25/ucs/quotes.html> ǥ   
  ο ȣ ġϰ ٸ ÷ ϱ ؼ ణ ٲ
  ٴ  ˾    ̴.

  16.  UTF-8 ͹̳ ķͿ õ ̽ ΰ?

  VT100 ͹̳ ķ͵ ٸ ڼµ ̸ ȯϱ ؼ IS O
  2022(=ECMA-35  <http://www.ecma.ch/ecma1/STAND/ECMA-035.HTM>) ESC
   ޾Ƶδ.

  UTF-8 ISO 2022   "ٸ ڵ ý(other coding sys
  tem)"̴(ECMA 35  15.4 ). UTF-8 ISO 2022 SS2/SS3/G0/G1/
  G2/G3 ϴ  ܺο ִ. ׷Ƿ  ISO 2022 UTF-8
  ȯϸ,  SS2/SS3/G0/G1/G2/G3  UTF-8  ٽ ISO 2022
  ư  ǹ̸ Ұ ȴ. UTF-8   ڵ̹Ƿ,
   Ű(self-terminating) ª  Ʈ  (
  short byte sequence) ȯϴ    ڰ ǹ̰
  ִ Ϻϰ Ѵ. ISO 10646-1  G0 G1 ISO 8859-1
  װ͵ . ׸ G2/G3 ISO 10646   ʴ´. ֳϸ
   ڴ  ġ   浵 Ͼ ʱ ̴
  . 쿬 ̳ʸ  ͹̳ο  Ŀ ͹̳ ̻ ׷-
    ȯ ä ִ  UTF-8  ʴ. ̰
  UTF-8 忡 ִ  ͹̳ ISO 2022  ϶ ξ  
   ϵ Ѵ. ׷Ƿ ͹̳ 쿬 ISO 2022  ư 
   װ UTF-8     ȿ̴.

  ISO 2022 ǥ ISO 2022 忡   ̽  %
    ϰ ִ(ٸ ڵ ý , DOCS). ׸ ׷
     UTF-8 <ftp://ftp.informatik.uni-erlangen.de/pub
  /doc/ISO/charsets/ISO-10646-UTF-8.html> ؼ ISO 2375  ڵ 
   Ϻ(I nternational Register of Coded Character Sets) <ht
  tp://www.itscj.ipsj.or.jp/ISO-IR/>  2.8 ϵ .

  o  ESC %G ISO 2022 ٽ ư ϴ  ISO 2022
     õ    UTF-8 ȰȭŲ.

  o  ESC %@ ESC %G ļ UTF-8  쿡 UTF-8 ISO 2022
     ǵư Ѵ.

  o  ESC %/G UTF-8  1 ȯ  ȯŲ.

  o  ESC %/H UTF-8  2 ȯ  ȯŲ.

  o  ESC %/I UTF-8  3 ȯ  ȯŲ.

  ͹̳ ķͰ UTF-8 忡 ִ ȿ G2/G3 ȯŰ 
      ISO 2022 ̽  õȴ. UTF
  -8 忡 ϴ ͹̳ ķ   ISO 2022 ,
  UTF-8 ISO 2022 ü ٽ ȯŰ ESC %@̴.

   UTF-8 尡 0x80 0x9F   Ʈ  
  ,  CSI  C1  ڵ ϴ  Ѵ. U
  TF-8 忡 ִ ͹̳ ķʹ   ڸ ؼϱ 
  UTF-8 ڴ ԷµǴ Ʈ Ʈ ؾ߸ Ѵ  
  ϴ  ߿ϴ. C1 ڵ U+007F Ѵ ٸ ڵ ó
  UTF-8  ڵȴ.

  17.  UTF-8     ø̼   ִ°?

  o  XFree86 4.0 Ȥ  ̻  Բ Ǵ xterm <http://d
     ickey.his.com/xterm/xterm.html>("./configure --enable- wide-chars;
     make"  ϰ xterm      ɼ -u8
     ϶).

  o  NEW: Yudit 2.0 <http://www.yudit.org/> Gaspar Sinai  
      X11 ڵ ̴.

  o  Thomas Wolff <http://www.inf.fu-berlin.de/~wolff/>  Mined 9
     8 <http://www.inf.fu-berlin.de/~wolff/mined.html> UTF-8 
      ؽƮ ̴.

  o  Cooledit <http://cooledit.org/> 3.15.0 ķ UT F-8 UCS
     Ѵ.

  o  NEW: QEmacs <http://www-stud.enst.fr /~bellard/qemacs/> UTF-8
     ͹̳ο ϱ    ̴.

  o  346  less  <http://www.flash.net/~marknu/less/> UTF-8
     Ѵ.

  o  C-Kermit 7.0  <http://www.columbia.edu/kermit/ckermit.html>
     (transfer), ͹̳   ڼ¿  UTF-8 Ѵ.

  o  Perl <http://www.perl.org/> "use utf8;" ɼ  û 쿡
      5.6  ٽ UTF-8 (core UTF-8 support)
     <http://rf.net/~james/perli18n .html>Ѵ. ̰   UTF-8
     ǰ(׸ UTF-8 ±ȭ Ǹ) length() Լ  Ʈ 
     ſ ڵ ȯ ǹѴ. UTF-8  ȭϱ   
          ̴(perl-unicode@perl.org
     <mailto:perl-unic ode-help@perl.org> ϸ Ʈ ).  
      ɰ ѵǴ   perldoc perlunicode  perldoc
     utf8 о.

  o  Python 1.6 <http://www.python.org/1.6/>  ڵ 
     <http://starship.python.net/crew/lemburg/unicode-proposal.txt>
     ϼϰ ִ.

  o  Tcl/Tk <http://dev.scriptics.com/>  8.1 ڵ带 ⺻
     ڼ <http://dev.scriptics.com/doc/howto/i18n.html> Ͽ
     ȴ. ׷ ϰԵ Tk Ʈ ó ڵ   
     <http://dev.scriptics.com/ticket/issue-view.tcl?m
     sg_id=2349&return_url=%2fticket%2findex%2etcl%3fmsg%5fid%3d%26proj
     ect%5fid%3d11182%26domain%5fid%3d13586%26query%5fstring%3dunicode%26ca
     tegory%5fid%3d23828%26orderby%3dscore%252a%252cmsg%255fid%252a%252c%26
     submitby%3dmine%26assign%3dmine%26status%3dactive%26create26view%3dtab
     le> ־ 16Ʈ *-iso10646-1 Ʈ ڵ ڿ 
     ÷ϱ ؼ   .

  o  Exmh <http://www.beedub.com/exmh/> MH  ý  GUI
     Ʈ̸  Tcl/Tk 8.1 ̻  Ǵ   
     2.1.1 ĺ ڵ带 Ѵ.

  o  2000-03-06  ĺ CLISP <http://clisp.cons.org> UTF-8
       Ƽ-Ʈ ڵ wcwidth()  wcswidth()Լ
       API  char-wid th  string-width Լ Բ
       ִ.

  o  Sam <http://hawkwind.utcs.utoronto.ca:8001/mlists/sam.html> vi
      Plan9 UTF-8 ̸,  Win32  
     ִ.(Plan9 <http://plan9.bell-labs.com/plan9/>  ڵν
     UTF-8 Ϻϰ ȯǴ(switchedc ompletely to UTF-8 as its
     character encoding) <ftp://ftp.informatik.uni-
     erlangen.de/pub/doc/ISO/charsets/UTF-8-Pl an9-paper.ps.gz> 
     ü)

  o  Matty Farrow <http://www.gh.cs.usyd.edu.au/~matty/>  9term
     <http://www.gh.cs.usyd.edu.au/~matty/9term/>  Plan9 ü
     ڵ/UTF-8 ͹̳ ķ н Ʈ ̴.

  o  Wily <http://www.cs.su.oz.au/~gary/hobby/wily/auug.html>  Plan9
     Acme ͸ н  ̴.

  o  ucm-0.1 <ftp://ftp.dcs.ed.ac.uk/pub/jec/programs/> Juliusz
     Chroboczek <http://www.dcs.ed.ac.uk/home/jec/>Juliu sz Chroboczek
      ڵ  ̴. ̰ ڵ ڸ ϰ
     ø̼ ٿ  ְ ϴ  ̴.

  o  Serge Winitzki <http://www.linuxstart.com/~winitzki/>  
     txtbdf2ps <http://www.linuxstart.com/~winitzki/txtbdf2ps.html>
     BDF ȼ Ʈ Ͽ UTF-8 plaintext Ʈ ũƮ
     ϴ  Perl ũƮ̴.

  o  FIGlet 2.2 <http://st-www.cs.uiuc.edu/users/chai/figlet.html> Ǵ
         ׷ (block graphics elements)ν
     뽺̽ (monospaced characters) Ͽ ū ڷ  
     ؽƮ(banner text) ϴ  ̴.

  o  Edmund Grimley Evans <http://www.rano.org/> UCS Ʈ 
     Ͽ BOGL <http://www.msu.edu/user/pfaffben/>  ӹ
     ׷ ȮϿ.  UCS Ʈ  ̿Ͽ bterm̶ Ҹ
      UTF-8 ܼ ͹̳ ķ͸  Ͽ.

  18.  UTF-8 ϱ ؼ 밡 ġ ΰ?

  o  Bruno Haible <http://clisp.cons.org/~haible/> stty  Ŀ
     tty  groff    ġ <ftp://ftp.ilog.fr/pub/Use
     rs/haible/utf8/> غߴ.

  o  Miyashita Hisashi Emacs 20.6  ̻   ڼ 
     Ű MULE-UCS <ftp://ftp.m17n.org/pub/mule/Mule-UCS/> 
     ۼߴ. ̰ Mule ڵ(Emacs  ϴ) ISO 1
     0646 ̸ ȯų  ִ.

  o  Otfried Cheong GNU Emacs  ڵ ڵ
     <http://www.cs.uu.nl/~otfried/Mule/> , Ǵٸ Emacs
     ڼ utf-8 ν  BMP ϴ MULE-UCS 
       Ȯ Ѵ. װ    MULE-UCS
      ª ġ ȳ Ѵ.

  o  Tomohiko Morioka UTF-8 xemacs  ġ
     <http://turnbull.sk.tsukuba.ac.jp/Tools /XEmacs/> Ҵ.

  o  Edmund Grimley Evans <http://www.rano.org/> ̸ α׷
     Mutt curses library  Slang  UTF-8 ġ <
     http://www.rano.org/mutt.html> غߴ.

  19.  ڵ带 ٷ 밡  (free) ̺귯 ִ°?

  o  NEW: Ulrich Drepper  GNU C ̺귯 glibc 2.2.1
     <http://sourceware.cygnus.com/glibc/>  UTF-8  Ϻ
     Ƽ-Ʈ  ϱ ؼ ڵ  
     ˰(sorting order algorithm) ϸ, ̰ ٸ  
     ڵ ٽ ڵ  ִ. κ  ǵ 
     ̷ glibc 2.2.1 ׷̵  ,  
      ڵ glibc 2.2.1 ҽ
     <ftp://sourceware.cygnus.com/pub/glibc/releases>  ġϷ
     õ  ִ(̷  ٷ Ⱑ ư ϴ). Bruno
     Haible glibc 2.2 ġ ħ <http://clisp.cons.org/%
     7ehaible/glibc22-HOWTO.html> Ӹ ƴ϶     
     Ȳ  Ulrich TODO list <http://www.cygnus.com/
     ~drepper/TODO.html> CVS ī̺긦  ϶.

  o  ڵ带    Ʈ
     <http://oss.software.ibm.com/icu/>(  ڵ带  IBM
     Class <http://www.alphaworks.ibm.com/tech/icu/>).

  o  Mark Leisher <http://crl.nmsu.edu/~mleisher/> wchar_ t 
     ׽Ʈ ڵ(wchar_t support test code)  UCData  ڵ 
     Ӽ(UCData Unicode character property) bidi ̺귯

  o  Bruno Haible <http://clisp.cons.org/~haible/> libiconv
     <http://clisp.cons.org/~haible/packages-libiconv.html> ڼ ȯ
     ̺귯  iconv()
     <http://www.opengroup.org/onlinepubs/007908799/xsh/iconv.h.html>
     Լ ϴµ ̰  Լ ϳ   ʰų
     Լ ص ڵκ Ȥ ڵ ȯ ʴ
     ý  ̴.

  o  Bruno Haible libutf8 <http://clisp.cons.org/~haible/packages-
     libutf8.html> UTF-8 ڿ óϱ , Ư   
     UTF-8   ʴ ÷   Լ
     Ѵ.

  o  Tom Tromey <mailto:tromey@cygnus.com> libunicode <http:
     //people.redhat.com/otaylor/pango-
     mirror/download/libunicode-0.4.tar.g z> ̺귯 Gnome ũž
     Ʈ ȯ, G nome    ִ. ̰
       Ŭ ȯ   Ѵ.( VS
     <http://cvs.gnome.org/lxr/source/libunicode/>)

  o  FriBidi <http://imagic.weizmann.ac.il/~dov/freesw/FriBidi/>
     Unicode bidi ˰    ̸ Dov Grobgeld 
     ߴ.

  o  Arabjoin <http://czyborra.com/arabjoin/> ƶ ڿ UTF-8
     ؽƮ( δ U+06xx ƶ Ͽ ڵȴ ) Է
     ޾Ƶ̰, ƶ glyph  ϸ,  ̴  
     ĵǴ 8Ʈ  UTF-8 Ʈ ϴ Roman Cryborra 
      Perl ̴. ̰ ƶ ڸ ٸ ϴ  ƴ 
     ܼ  glyph ʿ   ϴ xterm Ȥ yudi
     t   ڵ  (renderer)   
      ִ  ش.

  o  NEW: Charlint <http://www.w3.org/Inter national/charlint/> W3C
       <http://www.w3.org/TR/charm od/>   ǥȭ
     ̴.

  o  Markus Kuhn   wcwidth()  Լ <ucs/wcwidth.c> , C
     ̺귯  ڳ ڿ UTF-8 ͹̳ ķ
     ũ 󸶳  (column) ġ ϰ ִ ˾Ƴ
      Լ  ʴ ÷ ø̼ǿ  
     ִ.

  o  Markus Kuhn ȯ(transtab) <download/transtab.tar.gz> 
     ڵ忡 ASCII  8Ʈ ڼ ȯϱ  ø
     ̼   ȯ ̴̺. ̰ ڵ ڵ 
     ġȯ ڿ    ϰ  Ұ
     ڵ Ÿ ؼ  ̸̳ Ÿڱ⿡ ϴ
     ü ǥ(fa llback notation) ϴ.  ̺ POSIX
       (POSIX locale definition file) ԽŰ ؼ
     ISO/IEC TR 14652>ISO/IEC TR 14652  Ǿ ִ. </itemize>
     <!--  20   --> <sect> X  ̺귯 
     ڵ   Ȳ  Ѱ? <p> <itemize> <item><url
     url= <volatile/ISO-1465 2.pdf> name="Pango - Unicode and Complex
     Text P rocessing"> GTK+ <http://www.gtk.org/> Ϻ Ư
      ڵ  ߰ϱ  Ʈ ȯ̴.

  o  Qt 2.0 <http://www.troll.no/announce/qt-200.html>  
     *-ISO10646-1 Ʈ  ϰ ִ.

  20.  UTF-8 ϱ   Ű   ΰ?

  o  NEW:  vi  αִ Ŭ  Vim ֽ   ׽Ʈ
      6.0s <ftp://ftp.vim.org/pub/vim/unreleased/>  ̵
     ڿ 2  ڸ   ÿ UTF-8 Ѵ.
     ڼ  Bram Moolenaar announcement <http
     ://mail.nl.linux.org/linux-utf8/2000-07/msg00036.html>  о.

  21.  ֶ󸮽 󿡼 UTF-8    ϴ°?

  Solaris 2.8  ķ UTF-8  κ ȴ.
  UTF-8  ϱ ؼ UTF-8   ϳ ϶.  
    C  ԷѴ.

  setenv LANG en_US.UTF-8

   UTF-8 ؽƮ Է  ϱ ؼ dtterm ͹̳ ķ
  ͸   , mp print filter ƮũƮ Ϳ 
  UTF-8   ̴. en_US.UTF-8  ν Mo tif
  CDE ũž ø̼  ̺귯 ؼ ,
  OpenWindows, XView  OPENLOOK DeskSet ø̼ǰ ̺귯
  ؼ  ʴ´.

     Ѵٸ, en_US.UTF-8     Sun's
  Overview <h
  ttp://docs.sun.com:80/ab2/coll.45.13/I18NDG/@Ab2PageView/10821?Ab2Lang=C&Ab2
  Enc=iso-8859-1> о.

  22.  Ʈ ũƮ glyph Ī(Postscript glyph names)>  UCS
  ڵ õǾ ִ°?

  Adobe Unicode and Glyph Names
  <http://partners.adobe.com/asn/developer/typeforum/unicodeg n.html>
  ̵带 о.

  23.   ǵ UCS subset ִ°?

  40000 ڸ  ڵ带 Ϻϰ ϴ  Ŵ
  Ʈ̴ . ׷     Ǵ õ ڸ ϴ
  Ͱ ڵȭ ģ   ϳ ڵӿ  ʿ ڿ
  ϴ ܼ  ͵  ߿ϴ(Ư   ؼ).
   ٸ UCS µ ̹ ȮǾ.

  o  Windows Glyph List 4.0 (WGL4)
     <http://partners.adobe.com/asn/developer/opentype/wgl4.htm> 8Ʈ
     MS-DOS, Windows, Mac  ũμƮ    ִ
     ISO ڵ   ϴ 650 ڷ  ̴. 
     Windows Ʈ   WGL4   Ѵ.  WGL4 CEN
     MES-1( <http://www.cl.cam.ac.uk/~mgk25/ucs/wgl4.txt>= WGL4 ׽Ʈ
     ">)  ϴ ̴.

  o    UCS  MES-1, MES-2  MES-3
     <http://www.egt.ie/standards/iso10646/pdf/cwa13873.pdf>  ǥ
     ȸ CEN/TC304 ؼ CWA 13873 ȿ  Ǿ.

  o  MES-1  335 ڸ  ſ  ƾ  ڼ̴.
     ̰ Ȯϰ ISO 6937   ִ  ڿ ̿  EURO
     SIGN Ѵ. ̰ ISO 8859 1,2,3,4,9,10,15 κ 
     ڸ MES-1 Ѵٴ  ǹѴ. :   
           ո ߾  UCS 
     ڼ ϴ ̶,  MES-1 MES-1  Windows
     ڵ  1252ʿ   ִ  ߿ 14 ΰ
     ڵ ؼ  ̴: U+0192, U+02C6, U+02DC, U+2013,
     U+2014, U+201A, U+201E, U+2020, U+2021, U+2022, U+2026, U+2030,
     U+2039, U+203A.]

  o  MES-2 1052 ڵ 
     ƾ/׸/Ű/̱/׷ƾ   ڼ̴.
     ̰ ( EU 鸸 ƴ)   ϴ
     󿡼 Ǵ    8Ʈ ڵ  Ѵ.
     ̰    ϴ    ȣ 
     ϰ ִ.  MES-2 MES-1 ϴ ڼ̴.  
       Ȥ   ؼ ϰ ִٸ, MES-2 õ
      ڼ̴. [:  ȸ-ġ , MES-2
      8 WGL4ڵ  ʰ ִ: U+2113, U+212E,
     U+2215, U+25A1, U+25AA, U+25AB, U+25CF, U+25E6.   MES-2
     Ѵٸ,   8 WGL4ڵ ߰ؾ߸ϸ, ׷
     Ŀ ڼ WGL4 ġų  ִ.

  o  MES-3 2819ڸ  ſ  UCS ̴. ̰ ܼ
      ڵ鿡Դ  ִ ſ  UCS
     (collection)  Ѵ. ̰   ڵ
      ̴. MES-3 MES-2 WGL4 ϴ ڼ̴.

  o  JIS X 0221-1995 Ϻ ڵ  7 ġ ʴ UCS
      ϰ ִ.

  o  ⺻ Ϻ(6884 ): JIS X 0208-1997, JIS X 0201-1997

  o  Ϻ -ǥ (Non-ideographic) (1913 ): JIS X
     0212-1990 - (non-kanji)   ٸ - 

  o  Ϻ ǥ   1(918 ):  JIS X 0212-1990  

  o  Ϻ ǥ   2(4883 ):  JIS X 0212-1990 
     

  o  Ϻ ǥ   3(8745 ):  ߱ 

  o     Alphanumeric(94 ): ȣȯ ؼ

  o     īŸī (63 ): ȣȯ ؼ

  o  ISO 10646 ǥ װ ü ,  ϰ ϱ
     ؼ ϴ  (collections)
     <http://www.egt.ie/standards/iso10646/ ucs-collections.html>
     . ڵ嵵 , Ȱ  ڵ ǥ 
     ǿ ϴ ڵ 
     <ftp://ftp.unicode.org/Public/UNIDATA/Blocks.txt> (blocks of
     characters) ϰ ִ.

  o  RFC 1815 <ftp://sunsite.doc.ic.ac.uk/packages/rfc/rfc1815.txt>
     ISO 10646    JIS X 0221-1995  𸣴
      ؼ 1995⿡   ޸̴. װ 14 UCS
       "ISO-10646-J-1"̶ Ҹ  UCS ¿ 
     ϰ , 14 UCS   JIS X 0208 .
     ̰  1995 Ϻ Windows NT  Ե  Ư
     Ʈ 쿬 ٴ ̴.  RFC 1815 ó  ô뿡
     ڶ  , ϴ  ̴ּ.

  o  Markus Kuhn ucs-fonts.tar.gz
     <http://www.cl.cam.ac.uk/~mgk25/download/ ucs-fonts.tar.gz>
     README Ͽ  UCS  TARGET1, TARGET2  TARGET3
     ϰ ִµ ̵ ϴ MES  ˸° Ȯ ̸,
     xterm Ʈ Ű ϼϱ  ٰ Ǿ.

  Markus Kuhn ϼ(uniset)
  <http://www.cl.cam.ac.uk/~mgk25/download/uniset.tar.gz> Perl
  ũƮ  α׷  ϴ  üũϱ⸦ ϰų
  ο α׷  ;ϴ  Ͽ UCS  
     (set) ϴ  ϰ ִ.

  24.  X11 R6.4  ڵ忡   ִ°?

  X11 R6.4  <ftp://ftp.x.org/pub/R6.4/>(1998) X ҽÿ
   X11  ý ǥؿ ´  α׷ ֽ ̴.
  κ  X11 ǥ <ftp://ftp.x.org/pub/R6.4/xc/doc/hardcopy/>
   α׷ н ȯ濡 ڵ忡   
   ҷŰ ִ.

  o

     UTF-8 ߶󳻱 ̱: ICCCM <ftp://ftp.x.org/pub/R6.4/
     xc/doc/hardcopy/ICCCM/icccm.PS.gz> ǥ  UCS ڿ
     ȯϴ   ʰ ִ.   ϴ
     COMPOUND_TEXT <ftp://
     ftp.x.org/pub/R6.4/xc/doc/hardcopy/CTEXT/ctext.PS.gz> ī
     (CTEXT) UTF-8  ϳ ڵν ߰Ͽ. ̰ 
         ذå ƴϴ.

  o  CTEXT ټ  ISO 2022 ī̴. ׷ ڵ
     CTEXT  ٸ ߰ ׸ ϴ  ƴ϶  ī
     ü ξ ϰ  ϸ  ɷ  𰡷
     ü ȸ Ѵ.

  o  ϴ  ø̼ǵ CTEXT   Ͱ  
     , Ӱ ߰ UTF-8 ɼ  ʴ´. CTEXT ڴ
      ISO 2022 ڵ   ο UTF-8 ڵ 
      ؾ߸ Ѵ.  ׷ ΰ ÿ   .
     ٽ ؼ, CTEXT UTF-8  ߰Ѵٰ  ϴ
     CTEXT ø̼   ȣȯ .

  o   CTEXT   6 Ȯϰ   UTF-8 ߰
     ϰ ִ. "'ٸ ڵ ý' ϵ ISO Ŀ
     ؽƮ  ʴ´; Ȯ ׸Ʈ ISO 2022 ƴ
     ڵ(non-2022 encodings)   ī̴."

     Juliusz Chroboczek <http://www.dcs.ed.ac.uk/home/jec/> Ӽ
     (property type)  ǥ(selection target)   ִ
     ο UTF8_STRING Ҹ ̿Ͽ UTF-8  θ ٷ 
     ICCM Ȯ忡 ؼ, "ڵ ؽƮ -Ŭ̾Ʈ ȯ
     <http://www.dcs.ed.ac.uk/home/jec/programs/xfsft/
     UTF8-selections.text>   ʾ(Inter-Client Exchange of
     Unicode Text draft proposal)" ۼϿ.

  o    Ʈ  : Ʈ   ̿  
     Ÿ  ϴ Xlib API X11    幰
     ϴ Ʈ(sparsely populated fonts) ó  
      .  X Ŭ̾Ʈ  Ʈ ׼ϴ  Ϲ
      XLoadQueryFont() Լ ȣϴ ̸, ̰
     XFontStruct  ޸𸮸 Ҵϰ κ װ 
     ҷ´. XFontStruct  12Ʈ XCharStruct Ʈ(entry)
     迭 Ѵ. ̷ 迭 ũ   ڵ ġ - ù
     °  ڵ ġ + 1 ̴. ׷Ƿ U+0020 U+FFFD 
     ϴ  "*-iso10646-1" Ʈ 65502 Ҹ  XCharStruct
     迭 Ҵǵ  ̴( CharCell Ʈ鿡 ؼ
       ߻ ̴).  ̰  Ʈ  1000
     ڵ ϰ ִ   786 ųιƮ Ŭ̾Ʈ-̵
     ޸(client-side memory)   ʿ  ǹѴ.

     ݱ ŷؿԴ ϰ õ  ֺ ϵ:

  o  XFree86 4.0 Բ Ǵ -ƽþƱǿ -misc-fixed-*-iso10646-1
     Ʈ U+31FF ̻   ڵ  ʴ´.
     ̰ ʿ  ޸𸮷 153 ųιƮ Ѵ. ̰
        , ׷  ϵ ƴϴ.(BDF Ͽ
     Ÿ U+31FF ̻    ڵ  ϸ,
     ̷  ذ   ٸ ִ. ׷   
     -1 ڵǸ,  X  ؼ õȴ.)

  o  Bruno Haible  Ŭ̾Ʈ  XCharStruct
     ϰ,  Ʈ εϴ ټ ڸ  Xlib 
     ޸𸮸 ϴ, XFree86 4.0  BIGFONT 
     Ȯ(extension) ۼϿ.

     ̷ ֺ ϵ XFontStruct 幰 ϴ Ʈ鿡
      ʴٴ   ذ  ʴ´. ׷ ׵
     API Ȥ Ŭ̾Ʈ ҽ ڵ   ʿ  ߿ ȿ
      Ѵ. Ѱ  ذå XFontStr uct 迭
      ڸ ϴ  Ʈ ؽ ̺ ϴ
     ξ  𰡷 Ȯ Ȥ üϴ   ̴. ̷
     XFontStruct     ڿ ε  ἱ
     ÿ ϱ   ʿ  å  ̴.

  o  Keysyms: ݱ ǵ ٷδ keysyms    Unicode
       Ѵ. Markus Kuhn U-00000000  U-00FFFFFF
       UCS ڴ 0x01000000  0x01ffffff 
     keysym  Ÿ  ִٰ Ͽ(׸ xterm ̰
     Ͽ).  ̰ θ ǰ ִ ó ü 31Ʈ UCS
       ʴ´.  ׷ ̰ UTF-16  ̻ 忡
     Ÿ  ִ U-0010FFFF  ڵ ϸ, ISO  ū
      UCS ڵ Ҵ   ʴ( ̷ ISO
     10646κ U-0010FFFF Ѵ ڵ  ڴ  ִ).
      ڵ  U+ABCD  ؼ keysyms 0x0100abcd
        ִ.   keysyms UCS(X11 ǥؿ
     ߸    ϳ̴) ̸ ȯŰ  ȵ
     ̺  ҽ ڵ xterm keysym2ucs.c
     <http://www.cl.cam.ac.uk/~mgk25/ucs/keysym2ucs.c>  .
     Markus  X  ǥ η A: KEYSYM ڵ
     <http://www.cl.cam.ac.uk/ ~mgk25/ucs/X11.keysyms>  ʾ 
     ۼߴ. װ UCS ȣ  ̺ ߰ KEYSYM ڵ(PDF
     <http: //www.cl.cam.ac.uk/~mgk25/ucs/keysyms.pdf>)̴.

  o   : X11     ڸ  ε
      ʴ´ٴ    ִ. Ʈ  ׼Ʈ ڵ
     ̴µ ʿ Ͱ ϴ( , TeX  Ʈ
     ׼Ʈ ڵ ̴  ִ). پ  ũ
          (zero-width characters)
     ϴ     ľ(simplest overstriking
     combining characters) ϱ ؼ  ؿԴ. ׷
     ̰ ϴ      õ ʰ ִ.
     ( , CharCell 뽺̽ Ʈ(monospaced fonts) 
      ڸ  ʴ´.) ׷Ƿ ̰  а Ȯ
      ƴϴ.

  o  ἱ: ε ü ἱ ġȯ ϴ Ʈ   ʿ
     Ѵ. ̰ ν  ó X11  ׿ ԵǾ
      ʴ.

  o  UTF-8 : X11 R6.4  ν UTF-8  
        ʰ ִ.  UTF  ,
     װ ҿϸ 翡  UTF-1 ڵ Ѵ. UTF-8
      ϱ ؼ Ϲ ڵ ȯ  ʿ Ӹ
     ƴ϶  Ű (entry) , ϴ ISO 8859 keysym
     Ű带 UCS Ͽ ġϱ, ռ Ű (compose key) 
     ص Ȯ   ѱ۰   (entry support)
     Էϱ    ISO 14755 <http://www.cl.cam.ac.
     uk/~mgk25/volatile/ISO-14755.pdf> 16  ʿϴ.

  o   (sample implementation): xterm, xfontsel, 
     Ŵ   ǥ   ڵ  Ӹ ƴ϶
       ڵ ǥ Ʈ   ߰Ǿ߸
     Ѵ. ̷ κп    ̹ XFree86 ο
     ̷    ϵ  ߿ ȵ  ذ
     ʾҴٴ Ƿ   ǰ ִ.

  ̷ ̽ ϴ ۾  ΰ? 𸣰ڴ.   Ͽ
   X  ҽÿ   ü X11 ǥذ  
    ϴ Op engroup X.Org <http://www.x.org/> Ϸ
  õ, ׵κ   ִ  䵵 
  ߴ(X.Org   XFree86 ,   
  ߴ).

  25.  ̷    ϸ Ʈ ִ°?

  ݵ unicode@unicode.org
  <http://www.unicode.org/unicode/consortium/distlist.html> ϸ
  Ʈ ؾ߸ Ѵ. ̰ ǥ  ڿ  ٸ 
  ǰ   ּ ̴. α  Ѵ , unicode-
  request@ unicode.org <mailto:unicode-request@unicode.org>
  "subscribe"  Բ "subscribe YOUR@EMAIL.ADDRESS unicode"
    ޽  ȴ.

   GNU/Linux ýۿ Ϲ ϴ ø̼  
   UTF-8 å ϱ ؼ ̴ linux-utf8@nl.linux.org
    Ʈ ִ. α  Ѵٸ, "subscribe linux-
  utf8"̶     majordomo@nl.linux. org
  <mailto:majordomo@nl.linux.org> ޽ .  linux-utf8
  archive <http://www.linux.eu.org/lists/linux- utf8/> ٿ 
  ִ.

  Xlib X  ڵ    ϸ Ʈ
  fonts@xfree86.org <http ://XFree86.Org/mailman/listinfo/fonts>
  i18n@xfree86.org <http://XFree86.Org/mailman/listinfo/i18n> ִ.

  26.     ڷ

  o  Bruno Haible Unicode HOWTO
     <ftp://ftp.ilog.fr/pub/Users/haible/utf8/ Unicode-HOWTO.html>.

  o  The Unicode Standard, Version 3.0
     <http://www.amazon.com/exec/obidos/ASIN/0201616335/mgk25>, Addison-
     Wesley, 2000. Ʈ ڿ   ϰ Ѵٸ ݵ 
      纻 ؾ Ѵ.

  o  Ken Lunde CJKV Information Processing
     <http://www.amazon.com/exec/obidos/ASIN/1565922247/ mgk25>,
     O'Reilly & Associates, 1999.   ƽþ ڼ¿  ִٸ
     Ʋ   å̴.

  o  Unicode  Technical Reports
     <http://www.unicode.org/unicode/reports/>

  o  Mark Davis  Unicode FAQ <http://www.unicode.org/unicode/faq/>

  o  ISO/IEC 10646-1:1993 <http://www.iso.ch/cate/d18741.html>

  o  Frank Tang  Internationalization Secrets
     <http://people.netscape.com/ftang/i18n.html>

  o  IBM's Unicode  Zone <http://www.ibm.com/developer/unicode/>

  o  Unicode Support in the Solaris 7 Operating Environment
     <http://www.sun.com/software/white-papers/wp-unicode/>

  o  introduction of UTF-8 under Plan9 <ftp://ftp.informatik.uni-
     erlangen.de/pub/doc/ISO/charsets/ UTF-8-Plan9-paper.ps.gz> Rob
     Pike Ken Thompson ۼ The USENIX paper 1992⿡ ̹
     UTF-8( ñ⿡ UTF-2 ҷȴ) Ϻϰ  ù °
     ü ؼ ϰ ִ.

  o  Li18nux <http://www.li18nux.net/>   ڵ 
     ȭϱ ؼ   ڵ鿡 ؼ ۵ Ʈ
     ̴.  Ʈ ֱٿ   ؼ Li18nux 2000
     Globalization Specification <http:
     //www.li18nux.net/root/LI18NUX2000/li18nux2k_draft.html>
     ߴ.

  o  Online Single  Unix Specification <http://www.UNIX-
     systems.org/online.html> wcwidth()  Ȯ Լ  
     ISO C Amendment 1 Լ ϰ ִ.

  o  Open Group ISO C Amendment 1 <http://www.unix-
     systems.org/version2/whatsnew/ login_mse.html> ༭.

  o  GNU libc <http://sourceware.cygnus.com/glibc/>

  o  The Linux Console Tools <http://lct.sourceforge.net/>

  o  ڵ ҽþ character database
     <ftp://ftp.unicode.org/Public/UNIDATA/> character set conversion
     tables <ftp://ftp.unicode.org/Public/MAPPINGS/> ڵ 
      Ϸ Դ ʼ ڷ̴.

  o  Microsoft
     <http://www.microsoft.com/globaldev/reference/WinCP.asp> Keld
     <ftp://dkuug.dk/i18n/WG15-collection/charmaps/>s WG15 archive">
     ȯ ̺  ϴ.

  o  Michael Everson ISO10646-1 archive
     <http://www.indigo.ie/egt/standards/iso10646/ pdf/>  ֽ
     ISO 10646-1 amendments ¶      
      ϰ ִ.   Roadmaps to the Universal Character
     Set <http://www.indigo.ie/egt/standards/iso10646/ucs-
     roadmap.html>   ִ.

  o  The Universal Character Set (UCS)
     <http://www.stri.is/TC304/guidecharactersets/guideannexb.html>
      Ұ.

  o  Otfried Cheong's essay on Han Unification in Unicode
     <http://www.cs.uu.nl/~otfried/Mule/ unihan.html>

  o  AMS STIX <http://www.ams.org/STIX/> Ʈ Unicode 4.0 ISO
     10646-2   ڸ   Ȯϴ  ϰ ִ.

  o  Jukka Korpela Soft hyphen (SHY) - a hard problem?
     <http://www.hut.fi/~jkorpela/shy.html> U+00AD ѷ ￡
        ڷ̴.

  o  James Brigg  Perl, Unicode and I18N FAQ
     <http://rf.net/~james/perli18n.html>.

  o  Mark Davis Forms of Unicode
     <http://www-4.ibm.com/software/developer/library/
     utfencodingforms/> UTF-8, UTF-16  UCS-4( ġ 
     UTF-32ε Ҹ)    ϰ ִ.

  o  Alan Wood Unicode and Multilingual Support in Web Browsers and
     HTML <http://www.hclrss.demon.co.uk/unicode/>   Ǿ.

  o  ISO/JTC1/SC22/WG20
     <http://anubis.dkuug.dk/jtc1/sc22/WG20/docs/projects>
     International String Ordering (ISO 14651)
     <http://anubis.dkuug.dk/jtc1/sc22/WG20/docs/
     projects/n731-fdis14651.pdf>  Cultural Convention Specification
     TR (ISO TR 14652)
     <http://anubis.dkuug.dk/jtc1/sc22/WG20/docs/n690.pdf> ( ,
     ̵    ȯ ϴ POSIX   Ȯ)
      پ ڵ  ǥ  ´.

  o  ISO/JTC1/SC2/WG2/IRG <http://www.cse.cuhk.edu.hk/~irg/>
     (Ideographic Rapporteur Group)

  o  Letter Database <http://www.eki.ee/letter/>  ڼ 
     Ī   亯 Ѵ.

  o  ο ߱ ڵ ǥ  <ftp://ftp.oreilly.com/pub/examples/
     nutshell/cjkv/pdf/GB18030_Summary.pdf>GB 18030 UCS 
     ϴ ܼȭ ߱  а Ǵ GB 2312 ڵ 
      ȣȯ  Ȯ ̴.

     ο  ſ  ߰ ̴. ׷Ƿ
  Ģ   üũϰų ſ ̸Ϸ  ִ
  Netminder <http://www.netmind. com/URL-minder/new/register.html>
  ̿Ͽ  ȭ üũ϶.   UTF-8  
  freeware community  Ӹ ƴ϶ 
  <mailto:Markus.Kuhn@cl.cam.ac.uk> ſ ȯѴ.   UTF-8
  ϴ  ſ  ̴. ׷Ƿ      
     ̴.

  Ulrich Drepper, Bruno Haible, Robert Brady, Shuhei Amakawa ġ ִ
    ٸ  ̵   SuSE GmbH, Nrnberg
  Ư  ǥѴ.

  Markus Kuhn <http://www.cl.cam.ac.uk/~mgk25/>
  <Markus.Kuhn@cl.cam.ac.uk> created 1999-06-04 -- last modified
  2001-02-06 --http://www.cl.cam.ac.uk/ ~mgk25/unicode.html

