To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
| Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
|---|---|---|---|
| ISO-8859-1 | ?????? | 001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f |
| SJIS-WIN | 寃?際?終孟 | 10011011100000110011111110001101110110110011111110001111010010011001011011010000 | 9b833f8ddb3f8f4996d0 |
| EUC-JP | 寃?際?終孟 | 11010101111000110011111110111010110111010011111110111101101010101100110011010010 | d5e33fbadd3fbdaaccd2 |
| UTF-8 | 寃렭際렑終孟 | 111001011010111110000011111010111010000010101101111010011001101010011011111010111010000010010001111001111011010110000010111001011010110110011111 | e5af83eba0ade99a9beba091e7b582e5ad9f |
| UHC | 寃렭際렑終孟 | 111010101011001010001110101110101111000010110111100011101010011011110000111110111101100011101011 | eab28ebaf0b78ea6f0fbd8eb |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)