同じビット列が文字コードによってどんな文字に解釈されるか
文字コード | ビット列 | 表示される文字 |
---|---|---|
ISO-8859-1 | c3a4c3a2c3a4c39ac3a4c39843c3a4c3a2c3a4c39ac3a4c398434d | äâäÃäÃCäâäÃäÃCM |
SJIS-WIN | c3a4c3a2c3a4c39ac3a4c39843c3a4c3a2c3a4c39ac3a4c398434d | テ、テ「テ、テ堙、テ呂テ、テ「テ、テ堙、テ呂M |
EUC-JP | c3a4c3a2c3a4c39ac3a4c39843c3a4c3a2c3a4c39ac3a4c398434d | 辰但辰?辰?C辰但辰?辰?CM |
UTF-8 | c3a4c3a2c3a4c39ac3a4c39843c3a4c3a2c3a4c39ac3a4c398434d | äâäÚäØCäâäÚäØCM |
UHC | c3a4c3a2c3a4c39ac3a4c39843c3a4c3a2c3a4c39ac3a4c398434d | 채창채횣채횠C채창채횣채횠CM |