Hacker Read top | best | new | newcomments | leaders | about | bookmarklet login

We can make it more fair by pricing UTF-8 bytes.

??? = 9 bytes

how are you = 11 bytes



view as:

per-byte pricing opens the pathway to posting images too!

not really. according to https://en.wikipedia.org/wiki/UTF-8 and https://en.wikipedia.org/wiki/File:Roadmap_to_Unicode_BMP.sv..., any codepoint bigger 0x80 (second half of the first box) is 2 byte per character, and codepoint bigger than 0x800 (anything past the 8th box) is 3 bytes charater. so while it might be fair for CJK languages, it's even less fair for languages that don't mostly use the latin alphabet.

That's true, considering the name of the website they would have the privilege of an even more expensive chat.

Legal | privacy