同じビット列が文字コードによってどんな文字に解釈されるか
文字コード | ビット列 | 表示される文字 |
---|---|---|
ISO-8859-1 | c3a5c2a7c2b8c3a5c2a7c2b840c3a5c2a7c2b8c3a5c2a7c2b8534d | 姸姸@姸姸SM |
SJIS-WIN | c3a5c2a7c2b8c3a5c2a7c2b840c3a5c2a7c2b8c3a5c2a7c2b8534d | テ・ツァツクテ・ツァツク@テ・ツァツクテ・ツァツクSM |
EUC-JP | c3a5c2a7c2b8c3a5c2a7c2b840c3a5c2a7c2b8c3a5c2a7c2b8534d | 奪則存奪則存@奪則存奪則存SM |
UTF-8 | c3a5c2a7c2b8c3a5c2a7c2b840c3a5c2a7c2b8c3a5c2a7c2b8534d | 姸姸@姸姸SM |
UHC | c3a5c2a7c2b8c3a5c2a7c2b840c3a5c2a7c2b8c3a5c2a7c2b8534d | 책짠쨍책짠쨍@책짠쨍책짠쨍SM |