To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
| Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
|---|---|---|---|
| ISO-8859-1 | ?????? | 001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f |
| SJIS-WIN | 鬲夥棘塚ォ豈 | 1110100110101101100110101110110010011110100110011111101010011100101010111110011010101111 | e9ad9aec9e99fa9cabe6af |
| EUC-JP | 鬲夥棘?ォ豈 | 1111001010101111110101001110111011011011111110010011111110001110101010111110110010110001 | f2afd4eedbf93f8eabecb1 |
| UTF-8 | 鬲夥棘塚ォ豈 | 111010011010110010110010111001011010010010100101111001101010001110011000111011111010100010010000111011111011110110101011111010001011000110001000 | e9acb2e5a4a5e6a398efa890efbdabe8b188 |
| UHC | ??棘??豈 | 0011111100111111110100001011111000111111001111111101000111000010 | 3f3fd0be3f3fd1c2 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)