To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ???o???c | 0011111100111111001111110110111100111111001111110011111101100011 | 3f3f3f6f3f3f3f63 |
SJIS-WIN | 曜??o?⑤?c | 10010111011010100011111100111111011011110011111110000111010001000011111101100011 | 976a3f3f6f3f87443f63 |
EUC-JP | 曜??o???c | 110011011100101100111111001111110110111100111111001111110011111101100011 | cdcb3f3f6f3f3f3f63 |
UTF-8 | 曜쒕젡o曆⑤젨c | 1110011010011011100111001110110010010010100101011110110010100000101000010110111111101111101001101000101111100010100100011010010011101100101000001010100001100011 | e69b9cec9295eca0a16fefa68be291a4eca0a863 |
UHC | 曜쒕젡o曆⑤젨c | 1110100011111000100111001110101110100000100110100110111111100110101101111010100011101011101000001010000001100011 | e8f89ceba09a6fe6b7a8eba0a063 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)