同じビット列が文字コードによってどんな文字に解釈されるか
文字コード | ビット列 | 表示される文字 |
---|---|---|
ISO-8859-1 | c3a8c2abc2a2c3a5c2a3c2b1c3afc2bdc2bac3afc2bdc2a2c3a5c2a5 | è«¢å£±ï½ºï½¢å¥ |
SJIS-WIN | c3a8c2abc2a2c3a5c2a3c2b1c3afc2bdc2bac3afc2bdc2a2c3a5c2a5 | ティツォツ「テ・ツ」ツアテッツスツコテッツスツ「テ・ツ・ |
EUC-JP | c3a8c2abc2a2c3a5c2a3c2b1c3afc2bdc2bac3afc2bdc2a2c3a5c2a5 | 竪束蔵奪贈賊誰遜尊誰遜蔵奪促 |
UTF-8 | c3a8c2abc2a2c3a5c2a3c2b1c3afc2bdc2bac3afc2bdc2a2c3a5c2a5 | è«¢å£±ï½ºï½¢å¥ |
UHC | c3a8c2abc2a2c3a5c2a3c2b1c3afc2bdc2bac3afc2bdc2a2c3a5c2a5 | 챔짬짖책짙짹챦쩍쨘챦쩍짖책짜 |