To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 脱袖俗脱尊揃竪賊 10010010010001011001000110110011100100011010110110010010010001011001000110111000100100011011010110010010010001111001000110101111 924591b391ad924591b891b5924791af
EUC-JP 脱袖俗脱尊揃竪賊 11000011101001101100001010110101110000101010111111000011101001101100001010111010110000101011011111000011101010001100001010110001 c3a6c2b5c2afc3a6c2bac2b7c3a8c2b1
UTF-8 脱袖俗脱尊揃竪賊 111010001000010010110001111010001010001010010110111001001011111110010111111010001000010010110001111001011011000010001010111001101000111110000011111001111010101110101010111010001011001110001010 e884b1e8a296e4bf97e884b1e5b08ae68f83e7abaae8b38a
UHC ?袖俗?尊?竪賊 00111111111000101100000011100001110101000011111111110000111011100011111111100010101101011110111011100100 3fe2c0e1d43ff0ee3fe2b5eee4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)