www.digitalmars.com         C & C++   DMDScript  

D - Unicode type should be 32 bits, not 16

reply "psheer AT icon DOT co DOT za" <nospam nospam.com> writes:
On http://www.digitalmars.com/d/ctod.html it says:

"Although char is an unsigned 8 bit type, and
unicode is an unsigned 16 bit type, they have
their own separate types in order to aid
overloading and type safety"

Unicode should really be 32 bits. The unicode
standard is now many times larger than the
first 64k block.

To set the default 16 bits would become a
major limitation in near future.

glibc has the unicide wide character at 32 bits.

best

-paul


-- 
Paul Sheer Consulting IT Services . . . Tel . . . +27 21 761 7224
Linux development, cryptography,  recuitment,  support,  training
http://www.icon.co.za/~psheer . . . . http://rute.sourceforge.net
L I N U X . . . . . . . . . . . .  The Choice of a GNU Generation
Aug 18 2001
parent "Walter" <walter digitalmars.com> writes:
You're right, and I'll make that change.

psheer AT icon DOT co DOT za wrote in message
<9lleie$b6g$1 digitaldaemon.com>...
On http://www.digitalmars.com/d/ctod.html it says:

"Although char is an unsigned 8 bit type, and
unicode is an unsigned 16 bit type, they have
their own separate types in order to aid
overloading and type safety"

Unicode should really be 32 bits. The unicode
standard is now many times larger than the
first 64k block.

To set the default 16 bits would become a
major limitation in near future.

glibc has the unicide wide character at 32 bits.

best

-paul


--
Paul Sheer Consulting IT Services . . . Tel . . . +27 21 761 7224
Linux development, cryptography,  recuitment,  support,  training
http://www.icon.co.za/~psheer . . . . http://rute.sourceforge.net
L I N U X . . . . . . . . . . . .  The Choice of a GNU Generation
Aug 18 2001