同じビット列が文字コードによってどんな文字に解釈されるか
文字コード | ビット列 | 表示される文字 |
---|---|---|
ISO-8859-1 | c3a9c2adc2afc3aac2b0c2a45c40c3a9c2adc2afc3aac2b0c2a45c534d | é¯갤\@é¯갤\SM |
SJIS-WIN | c3a9c2adc2afc3aac2b0c2a45c40c3a9c2adc2afc3aac2b0c2a45c534d | テゥツュツッテェツーツ、\@テゥツュツッテェツーツ、\SM |
EUC-JP | c3a9c2adc2afc3aac2b0c2a45c40c3a9c2adc2afc3aac2b0c2a45c534d | 辿足俗棚属造\@辿足俗棚属造\SM |
UTF-8 | c3a9c2adc2afc3aac2b0c2a45c40c3a9c2adc2afc3aac2b0c2a45c534d | é¯ê°¤\@é¯ê°¤\SM |
UHC | c3a9c2adc2afc3aac2b0c2a45c40c3a9c2adc2afc3aac2b0c2a45c534d | 챕짯짱챗째짚\@챕짯짱챗째짚\SM |