To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??W??z | 001111110011111101010111001111110011111101111010 | 3f3f573f3f7a |
SJIS-WIN | ツ額Wツ額z | 1100001010001010011110100101011111000010100010100111101001111010 | c28a7a57c28a7a7a |
EUC-JP | ツ額Wツ額z | 10001110110000101011001111011011010101111000111011000010101100111101101101111010 | 8ec2b3db578ec2b3db7a |
UTF-8 | ツ額Wツ額z | 1110111110111110100000101110100110100001100011010101011111101111101111101000001011101001101000011000110101111010 | efbe82e9a18d57efbe82e9a18d7a |
UHC | ?額W?額z | 0011111111100100111111100101011100111111111001001111111001111010 | 3fe4fe573fe4fe7a |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)