Saturday, June 20, 2009

Unicode

Unicode uses 4 bytes. There are 3 ways to store unicode character. UTF-32 uses 4 bytes for each character. UTF-16 stores most frequently used character in 2-byte values and less frequently used in 4-bytes value. UTF-8 uses 1-, 2- or 4-bytes values. UTF-8 and UTF-16 uses more processing comparing to UTG-32.

No comments: