From 85fb192d93d983525d4a8fc8dc963e47db70ff8d Mon Sep 17 00:00:00 2001
From: "Edward Z. Yang"
Date: Tue, 24 Jun 2008 22:12:56 -0400
Subject: [PATCH] Remove incorrect information about bit-size
UTF-8 is a variable-width encoding that uses octets, UTF-16
is a variable-width encoding that uses 16-bit words, and
UCS-2 is an obsolete fixed-width encoding that doesn't not
support characters beyond the BMP. Explaining this would be
unwieldly, so we just removed the information.
See also: http://www.reddit.com/info/6mlqc/comments/c04aold
Signed-off-by: Edward Z. Yang
---
docs/enduser-utf8.html | 5 ++---
1 file changed, 2 insertions(+), 3 deletions(-)
diff --git a/docs/enduser-utf8.html b/docs/enduser-utf8.html
index 88a48a70..00dcdca9 100644
--- a/docs/enduser-utf8.html
+++ b/docs/enduser-utf8.html
@@ -118,9 +118,8 @@ there are now many character encodings floating around.
see a page on the web, chances are it's encoded in one
of these encodings.
Unicode-based encodings implement the
- Unicode standard and include UTF-8, UCS-2 and UTF-16.
- They go beyond 8-bits (the first two are variable length,
- while the second one uses 16-bits), and support almost
+ Unicode standard and include UTF-8, UTF-16 and UTF-32/UCS-4.
+ They go beyond 8-bits and support almost
every language in the world. UTF-8 is gaining traction
as the dominant international encoding of the web.