To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ????v | 0011111100111111001111110011111101110110 | 3f3f3f3f76 |
SJIS-WIN | 畯?弔?v | 11111011011011110011111110010010101000100011111101110110 | fb6f3f92a23f76 |
EUC-JP | 畯?弔?v | 1000111111001101101110110011111111000100101001000011111101110110 | 8fcdbb3fc4a43f76 |
UTF-8 | 畯품弔렟v | 11100111100101011010111111101101100100101000100011100101101111001001010011101011101000001001111101110110 | e795afed9288e5bc94eba09f76 |
UHC | 畯품弔렟v | 111100011110000111000111101100001111000011000000100011101011000001110110 | f1e1c7b0f0c08eb076 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)