It also wastes valuable code space that could be used to have more characters encode as 3 bytes instead of 4 in UTF-8.
It also wastes valuable code space that could be used to have more characters encode as 3 bytes instead of 4 in UTF-8.