Neil Hodgson | 8 Oct 13:49
Picon

Re: Adding ZWSP to whitespace.characters

Roger Sperberg:

> Whether it be a dot below the baseline or above the ascenders or a
> line of some sort, do you think it would be possible to draw in
> something (anything) over the text for a ZWSP, the way the dor is
> drawn over the regular space?

   Its possible but you really should take a close look at the code to
see if you can think of a reasonable mechanism.

> However, I don't grasp the significance of what you're saying: that
> is, I don't get how regular spaces are recognized if this distinction
> disqualifies the ZWSP. (Is it that only characters in the first 256
> positions can be whitespace since they can be represented in a single
> byte?)

   Most common encodings (apart from EBCDIC and escape based
encodings) are supersets of ASCII so any byte <= 127 is identical in
meaning to ASCII. Therefore byte value 0x20 is always a space. If
encoded in UTF-8 a ZWSP is the three byte sequence [e2, 80, 8b]. You
could include all three bytes in the set of space bytes but then other
characters that use these bytes such as Ƌ [c6, 8b] would also be
treated as at least partially spaces for word movement purposes.

   Neil

--~--~---------~--~----~------------~-------~--~----~
You received this message because you are subscribed to the Google Groups "scite-interest" group.
To post to this group, send email to scite-interest <at> googlegroups.com
To unsubscribe from this group, send email to scite-interest+unsubscribe <at> googlegroups.com
(Continue reading)


Gmane