To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ???h??? | 00111111001111110011111101101000001111110011111100111111 | 3f3f3f683f3f3f |
SJIS-WIN | 馭?Шh馭?Ш | 1110100101100110001111111000010001011001011010001110100101100110001111111000010001011001 | e9663f845968e9663f8459 |
EUC-JP | 馭?Шh馭?Ш | 1111000111000111001111111010011110111010011010001111000111000111001111111010011110111010 | f1c73fa7ba68f1c73fa7ba |
UTF-8 | 馭곸Шh馭곸Ш | 1110100110100110101011011110101010110011101110001101000010101000011010001110100110100110101011011110101010110011101110001101000010101000 | e9a6adeab3b8d0a868e9a6adeab3b8d0a8 |
UHC | 馭곸Шh馭곸Ш | 11100101110111111000000111101100101011001011101001101000111001011101111110000001111011001010110010111010 | e5df81ecacba68e5df81ecacba |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)