To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
| Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
|---|---|---|---|
| ISO-8859-1 | ?????O | 001111110011111100111111001111110011111101001111 | 3f3f3f3f3f4f |
| SJIS-WIN | 鉗キ螢甫O | 11100111111001111011011111100101101000111001010111100001111100101000001101001111 | e7e7b7e5a395e1f2834f |
| EUC-JP | 鉗キ螢甫?O | 11101110111010011000111010110111111010101010010111001010111000110011111101001111 | eee98eb7eaa5cae33f4f |
| UTF-8 | 鉗キ螢甫O | 11101001100010011001011111101111101111011011011111101000100111101010001011100111100101001010101111101110100001101011101001001111 | e98997efbdb7e89ea2e794abee86ba4f |
| UHC | 鉗?螢甫?O | 110011001100011000111111111110111010101111011100110010110011111101001111 | ccc63ffbabdccb3f4f |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)