同じビット列が文字コードによってどんな文字に解釈されるか
文字コード | ビット列 | 表示される文字 |
---|---|---|
ISO-8859-1 | c3a7c2a2c2a95cc3a7c2a2c2a95c40c3a7c2a2c2a95cc3a7c2a2c2a95c534d | 碩\碩\@碩\碩\SM |
SJIS-WIN | c3a7c2a2c2a95cc3a7c2a2c2a95c40c3a7c2a2c2a95cc3a7c2a2c2a95c534d | テァツ「ツゥ\テァツ「ツゥ\@テァツ「ツゥ\テァツ「ツゥ\SM |
EUC-JP | c3a7c2a2c2a95cc3a7c2a2c2a95c40c3a7c2a2c2a95cc3a7c2a2c2a95c534d | 巽蔵息\巽蔵息\@巽蔵息\巽蔵息\SM |
UTF-8 | c3a7c2a2c2a95cc3a7c2a2c2a95c40c3a7c2a2c2a95cc3a7c2a2c2a95c534d | 碩\碩\@碩\碩\SM |
UHC | c3a7c2a2c2a95cc3a7c2a2c2a95c40c3a7c2a2c2a95cc3a7c2a2c2a95c534d | 챌짖짤\챌짖짤\@챌짖짤\챌짖짤\SM |