To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????? | 001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f |
SJIS-WIN | 際?池逗?池 | 10001101110110110011111110010010011100101001000010000000001111111001001001110010 | 8ddb3f927290803f9272 |
EUC-JP | 際?池逗?池 | 10111010110111010011111111000011110100111011111111100000001111111100001111010011 | badd3fc3d3bfe03fc3d3 |
UTF-8 | 際렑池逗벳池 | 111010011001101010011011111010111010000010010001111001101011000110100000111010011000000010010111111010111011001010110011111001101011000110100000 | e99a9beba091e6b1a0e98097ebb2b3e6b1a0 |
UHC | 際렑池逗벳池 | 111100001011011110001110101001101111001010101110110101001110100010111010101010101111001010101110 | f0b78ea6f2aed4e8baaaf2ae |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)