[Solar-talk] UTF-8
Paul M Jones
pmjones at solarphp.com
Tue Jul 25 14:54:55 PDT 2006
On Jul 25, 2006, at 4:33 PM, Rodrigo Moraes wrote:
> On 7/25/06, Paul M Jones wrote:
>> I do not like the sound of that.
>
> lol. yup, it sounds terrible. my dumb guess was that UCS-2 meant that
> they were "more advanced" than UTF-8 (UCS-2 = UTF-16, right?), and
> then UTF-8 would not be an issue for SQL Server. which is totally
> wrong, it is.
IIRC, UCS-2 is fixed-byte; it always uses the same number of bytes to
represent characters (6 I think). UTF-8 is variable-byte; low-range
characters are 1 byte, mid-range are 2-4, and high are 5-6. So yeah
it looks like a problem. I wonder if this means MS-SQL will be
unsuitable for PHP and multi-language data.
--
Paul M. Jones <http://paul-m-jones.com>
Solar: Simple Object Library and Application Repository
for PHP5. <http://solarphp.com>
Savant: The simple, elegant, and powerful solution for
templates in PHP. <http://phpsavant.com>
More information about the solar-talk
mailing list