To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
| Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
|---|---|---|---|
| ISO-8859-1 | ??i??iB | 00111111001111110110100100111111001111110110100101000010 | 3f3f693f3f6942 |
| SJIS-WIN | 祀蜴i祀蜴iB | 1110001001001010111001011000111001101001111000100100101011100101100011100110100101000010 | e24ae58e69e24ae58e6942 |
| EUC-JP | 祀蜴i祀蜴iB | 1110001110101011111010011110111001101001111000111010101111101001111011100110100101000010 | e3abe9ee69e3abe9ee6942 |
| UTF-8 | 祀蜴i祀蜴iB | 111001111010010110000000111010001001110010110100011010011110011110100101100000001110100010011100101101000110100101000010 | e7a580e89cb469e7a580e89cb46942 |
| UHC | 祀?i祀?iB | 110111101110010100111111011010011101111011100101001111110110100101000010 | dee53f69dee53f6942 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)