To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??}i??}iB | 001111110011111101111101011010010011111100111111011111010110100101000010 | 3f3f7d693f3f7d6942 |
SJIS-WIN | ?鴦}i?鴦}iB | 0011111111101001111100010111110101101001001111111110100111110001011111010110100101000010 | 3fe9f17d693fe9f17d6942 |
EUC-JP | ?鴦}i?鴦}iB | 0011111111110010111100110111110101101001001111111111001011110011011111010110100101000010 | 3ff2f37d693ff2f37d6942 |
UTF-8 | 맦鴦}i맦鴦}iB | 1110101110100111101001101110100110110100101001100111110101101001111010111010011110100110111010011011010010100110011111010110100101000010 | eba7a6e9b4a67d69eba7a6e9b4a67d6942 |
UHC | 맦鴦}i맦鴦}iB | 10010000101011111110010011101100011111010110100110010000101011111110010011101100011111010110100101000010 | 90afe4ec7d6990afe4ec7d6942 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)