James Kingston Clarke

TIL About UTF-8 variable width encoding

UTF-8 encoded strings use variable width encoding; they are anywhere from 1-4 bytes to accomodate for all unicode characters.