To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??I???G | 00111111001111110100100100111111001111110011111101000111 | 3f3f493f3f3f47 |
SJIS-WIN | ?}I?ぁ粗G | 00111111100000010111000001001001001111111000001010011111100100010110010101000111 | 3f8170493f829f916547 |
EUC-JP | ?}I?ぁ粗G | 00111111101000011101000101001001001111111010010010100001110000011100011001000111 | 3fa1d1493fa4a1c1c647 |
UTF-8 | 룵}I룵ぁ粗G | 1110101110100011101101011110111110111101100111010100100111101011101000111011010111100011100000011000000111100111101100101001011101000111 | eba3b5efbd9d49eba3b5e38181e7b29747 |
UHC | 룵}I룵ぁ粗G | 100011111010101010100011111111010100100110001111101010101010101010100001111100001101100001000111 | 8faaa3fd498faaaaa1f0d847 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)