To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ???Lh???L | 001111110011111100111111010011000110100000111111001111110011111101001100 | 3f3f3f4c683f3f3f4c |
SJIS-WIN | ?ν?Lh?ν?L | 0011111110000011110010110011111101001100011010000011111110000011110010110011111101001100 | 3f83cb3f4c683f83cb3f4c |
EUC-JP | ?ν?Lh?ν?L | 0011111110100110110011010011111101001100011010000011111110100110110011010011111101001100 | 3fa6cd3f4c683fa6cd3f4c |
UTF-8 | 䮤ν릢Lh䮤ν릢L | 11100100101011101010010011001110101111011110101110100110101000100100110001101000111001001010111010100100110011101011110111101011101001101010001001001100 | e4aea4cebdeba6a24c68e4aea4cebdeba6a24c |
UHC | ?ν릢Lh?ν릢L | 00111111101001011110110110010000100001000100110001101000001111111010010111101101100100001000010001001100 | 3fa5ed90844c683fa5ed90844c |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)