To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??i??iB | 00111111001111110110100100111111001111110110100101000010 | 3f3f693f3f6942 |
SJIS-WIN | 遇?i遇?iB | 100010111111011000111111011010011000101111110110001111110110100101000010 | 8bf63f698bf63f6942 |
EUC-JP | 遇?i遇?iB | 101101101111100000111111011010011011011011111000001111110110100101000010 | b6f83f69b6f83f6942 |
UTF-8 | 遇遼i遇遼iB | 111010011000000110000111111011111010011110000011011010011110100110000001100001111110111110100111100000110110100101000010 | e98187efa78369e98187efa7836942 |
UHC | 遇遼i遇遼iB | 1110100111100111111010011010110001101001111010011110011111101001101011000110100101000010 | e9e7e9ac69e9e7e9ac6942 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)