8 Oct 13:49
Re: Adding ZWSP to whitespace.characters
Neil Hodgson <scintilladotorg <at> gmail.com>
2008-10-08 11:49:10 GMT
2008-10-08 11:49:10 GMT
Roger Sperberg: > Whether it be a dot below the baseline or above the ascenders or a > line of some sort, do you think it would be possible to draw in > something (anything) over the text for a ZWSP, the way the dor is > drawn over the regular space? Its possible but you really should take a close look at the code to see if you can think of a reasonable mechanism. > However, I don't grasp the significance of what you're saying: that > is, I don't get how regular spaces are recognized if this distinction > disqualifies the ZWSP. (Is it that only characters in the first 256 > positions can be whitespace since they can be represented in a single > byte?) Most common encodings (apart from EBCDIC and escape based encodings) are supersets of ASCII so any byte <= 127 is identical in meaning to ASCII. Therefore byte value 0x20 is always a space. If encoded in UTF-8 a ZWSP is the three byte sequence [e2, 80, 8b]. You could include all three bytes in the set of space bytes but then other characters that use these bytes such as Ƌ [c6, 8b] would also be treated as at least partially spaces for word movement purposes. Neil --~--~---------~--~----~------------~-------~--~----~ You received this message because you are subscribed to the Google Groups "scite-interest" group. To post to this group, send email to scite-interest <at> googlegroups.com To unsubscribe from this group, send email to scite-interest+unsubscribe <at> googlegroups.com(Continue reading)
RSS Feed