To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????^ | 001111110011111100111111001111110011111101011110 | 3f3f3f3f3f5e |
SJIS-WIN | 驅?翼健德^ | 11101001011110110011111110010111100000111000110010010010111110101011101001011110 | e97b3f97838c92faba5e |
EUC-JP | 驅?翼健?^ | 111100011101110000111111110011011110001110110111111100100011111101011110 | f1dc3fcde3b7f23f5e |
UTF-8 | 驅뤥翼健德^ | 11101001101010011000010111101011101001001010010111100111101111111011110011100101100000011010010111100101101111101011011101011110 | e9a985eba4a5e7bfbce581a5e5beb75e |
UHC | 驅뤥翼健德^ | 1100111111001100100011111101001111101100110011111100101111101101110100111110110001011110 | cfcc8fd3eccfcbedd3ec5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)