To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?B?v?B?vB | 001111110100001000111111011101100011111101000010001111110111011001000010 | 3f423f763f423f7642 |
SJIS-WIN | ?B乘v?B乘vB | 0011111101000010100110001010100101110110001111110100001010011000101010010111011001000010 | 3f4298a9763f4298a97642 |
EUC-JP | ?B乘v?B乘vB | 0011111101000010110100001010101101110110001111110100001011010000101010110111011001000010 | 3f42d0ab763f42d0ab7642 |
UTF-8 | 쀄B乘v쀄B乘vB | 1110110010000000100001000100001011100100101110011001100001110110111011001000000010000100010000101110010010111001100110000111011001000010 | ec808442e4b99876ec808442e4b9987642 |
UHC | 쀄B乘v쀄B乘vB | 10010111110001000100001011100011101010110111011010010111110001000100001011100011101010110111011001000010 | 97c442e3ab7697c442e3ab7642 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)