[Solar-talk] UTF-8

Paul M Jones pmjones at solarphp.com
Tue Jul 25 14:54:55 PDT 2006


On Jul 25, 2006, at 4:33 PM, Rodrigo Moraes wrote:

> On 7/25/06, Paul M Jones wrote:
>> I do not like the sound of that.
>
> lol. yup, it sounds terrible. my dumb guess was that UCS-2 meant that
> they were "more advanced" than UTF-8 (UCS-2 = UTF-16, right?), and
> then UTF-8 would not be an issue for SQL Server. which is totally
> wrong, it is.

IIRC, UCS-2 is fixed-byte; it always uses the same number of bytes to  
represent characters (6 I think).  UTF-8 is variable-byte; low-range  
characters are 1 byte, mid-range are 2-4, and high are 5-6.  So yeah  
it looks like a problem.  I wonder if this means MS-SQL will be  
unsuitable for PHP and multi-language data.



--

Paul M. Jones  <http://paul-m-jones.com>

Solar: Simple Object Library and Application Repository
for PHP5.   <http://solarphp.com>

Savant: The simple, elegant, and powerful solution for
templates in PHP.   <http://phpsavant.com>




More information about the solar-talk mailing list