To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ????? | 0011111100111111001111110011111100111111 | 3f3f3f3f3f |
SJIS-WIN | ?鴨?善洩 | 0011111110001010100110110011111110010001010100001000100101101011 | 3f8a9b3f9150896b |
EUC-JP | ?鴨侁善洩 | 00111111101100111111101110001111101100001111110011000001101100011011000111001100 | 3fb3fb8fb0fcc1b1b1cc |
UTF-8 | 卨鴨侁善洩 | 111001011000110110101000111010011011010010101000111001001011111010000001111001011001011010000100111001101011010010101001 | e58da8e9b4a8e4be81e59684e6b4a9 |
UHC | 卨鴨侁善洩 | 11100000110110011110010011100101111000111110000011100000101111001110000011011101 | e0d9e4e5e3e0e0bce0dd |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)