To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????? | 001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f |
SJIS-WIN | ???塵鼎菁 | 001111110011111100111111100100000110111110010011010000111110010011000101 | 3f3f3f906f9343e4c5 |
EUC-JP | ???塵鼎菁 | 001111110011111100111111101111111101000011000101101001001110100011000111 | 3f3f3fbfd0c5a4e8c7 |
UTF-8 | 뤯헤울塵鼎菁 | 111010111010010010101111111011011001011110100100111011001001101010111000111001011010000110110101111010011011110010001110111010001000111110000001 | eba4afed97a4ec9ab8e5a1b5e9bc8ee88f81 |
UHC | 뤯헤울塵鼎菁 | 100011111101110111000111111011001011111111101111111100101100100011110000101000111111010011101010 | 8fddc7ecbfeff2c8f0a3f4ea |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)