To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ????g | 0011111100111111001111110011111101100111 | 3f3f3f3f67 |
SJIS-WIN | ◆?日オg | 1000000110011111001111111001001111111010100000110100100101100111 | 819f3f93fa834967 |
EUC-JP | ◆?日オg | 1010001010100001001111111100011011111100101001011010101001100111 | a2a13fc6fca5aa67 |
UTF-8 | ◆룫日オg | 11100010100101111000011011101011101000111010101111100110100101111010010111100011100000101010101001100111 | e29786eba3abe697a5e382aa67 |
UHC | ◆룫日オg | 101000011101111110001111101000101110110011101101101010111010101001100111 | a1df8fa2ecedabaa67 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)