To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 竪棚造竪族其竪短 10010010010001111001001001001001100100011010001010010010010001111001000110110000100100011011010010010010010001111001001001011010 9247924991a2924791b091b49247925a
EUC-JP 竪棚造竪族其竪短 11000011101010001100001110101010110000101010010011000011101010001100001010110010110000101011011011000011101010001100001110111011 c3a8c3aac2a4c3a8c2b2c2b6c3a8c3bb
UTF-8 竪棚造竪族其竪短 111001111010101110101010111001101010001110011010111010011000000010100000111001111010101110101010111001101001011110001111111001011000010110110110111001111010101110101010111001111001111110101101 e7abaae6a39ae980a0e7abaae6978fe585b6e7abaae79fad
UHC 竪棚造竪族其竪短 11100010101101011101110111011100111100001110001111100010101101011111000011101001110100001110110011100010101101011101001110101101 e2b5dddcf0e3e2b5f0e9d0ece2b5d3ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)