To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??i??iB | 00111111001111110110100100111111001111110110100101000010 | 3f3f693f3f6942 |
SJIS-WIN | 捨蛇i捨蛇iB | 1000111011001100100011101101011001101001100011101100110010001110110101100110100101000010 | 8ecc8ed6698ecc8ed66942 |
EUC-JP | 捨蛇i捨蛇iB | 1011110011001110101111001101100001101001101111001100111010111100110110000110100101000010 | bccebcd869bccebcd86942 |
UTF-8 | 捨蛇i捨蛇iB | 111001101000110110101000111010001001101110000111011010011110011010001101101010001110100010011011100001110110100101000010 | e68da8e89b8769e68da8e89b876942 |
UHC | 捨蛇i捨蛇iB | 1101111011010111110111101110111101101001110111101101011111011110111011110110100101000010 | ded7deef69ded7deef6942 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)