From 85fb192d93d983525d4a8fc8dc963e47db70ff8d Mon Sep 17 00:00:00 2001 From: "Edward Z. Yang" Date: Tue, 24 Jun 2008 22:12:56 -0400 Subject: [PATCH] Remove incorrect information about bit-size UTF-8 is a variable-width encoding that uses octets, UTF-16 is a variable-width encoding that uses 16-bit words, and UCS-2 is an obsolete fixed-width encoding that doesn't not support characters beyond the BMP. Explaining this would be unwieldly, so we just removed the information. See also: http://www.reddit.com/info/6mlqc/comments/c04aold Signed-off-by: Edward Z. Yang --- docs/enduser-utf8.html | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/docs/enduser-utf8.html b/docs/enduser-utf8.html index 88a48a70..00dcdca9 100644 --- a/docs/enduser-utf8.html +++ b/docs/enduser-utf8.html @@ -118,9 +118,8 @@ there are now many character encodings floating around.

see a page on the web, chances are it's encoded in one of these encodings.
  • Unicode-based encodings implement the - Unicode standard and include UTF-8, UCS-2 and UTF-16. - They go beyond 8-bits (the first two are variable length, - while the second one uses 16-bits), and support almost + Unicode standard and include UTF-8, UTF-16 and UTF-32/UCS-4. + They go beyond 8-bits and support almost every language in the world. UTF-8 is gaining traction as the dominant international encoding of the web.