From: Hans Aberg (haberg@math.su.se)
Date: Mon Feb 04 2008 - 07:49:15 CST
On 3 Feb 2008, at 22:45, Jeroen Ruigrok van der Werven wrote:
> http://factor-language.blogspot.com/2008/02/24-bit-strings-are-in.html
>
> Personally I'd wonder about this. I can understand the desire to
> shave bytes
> off in-memory, but given a lot of platforms having issues with
> non-32 bit
> boundaries and the resulting performance or alignment issues I
> seriously
> wonder if it is worth the trade off of not just using UCS4 internally.
I think that 32-bit is probably best for internal use in programs for
speed, avoiding alignment problems; the best way to actually know is
to do some profiling. Externally, for distributed files, UTF-8 seems
best, because most agree on how to sort out the bits the bytes.
Hans Åberg
This archive was generated by hypermail 2.1.5 : Mon Feb 04 2008 - 07:52:47 CST