To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??}i??}iB | 001111110011111101111101011010010011111100111111011111010110100101000010 | 3f3f7d693f3f7d6942 |
SJIS-WIN | 竺辞}i竺辞}iB | 10001110101100011000111010101011011111010110100110001110101100011000111010101011011111010110100101000010 | 8eb18eab7d698eb18eab7d6942 |
EUC-JP | 竺辞}i竺辞}iB | 10111100101100111011110010101101011111010110100110111100101100111011110010101101011111010110100101000010 | bcb3bcad7d69bcb3bcad7d6942 |
UTF-8 | 竺辞}i竺辞}iB | 1110011110101011101110101110100010111110100111100111110101101001111001111010101110111010111010001011111010011110011111010110100101000010 | e7abbae8be9e7d69e7abbae8be9e7d6942 |
UHC | 竺?}i竺?}iB | 1111010111100111001111110111110101101001111101011110011100111111011111010110100101000010 | f5e73f7d69f5e73f7d6942 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)