To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 額??屍??藥??B 10001010011110100011111100111111100011100111001000111111001111111110010101011010001111110011111101000010 8a7a3f3f8e723f3fe55a3f3f42
EUC-JP 額??屍??藥??B 10110011110110110011111100111111101110111101001100111111001111111110100110111011001111110011111101000010 b3db3f3fbbd33f3fe9bb3f3f42
UTF-8 額뗨뛝屍뜈괮藥럤뤇B 11101001101000011000110111101011100101111010100011101011100110111001110111100101101100011000110111101011100111001000100011101010101101001010111011101000100101111010010111101011100111111010010011101011101001001000011101000010 e9a18deb97a8eb9b9de5b18deb9c88eab4aee897a5eb9fa4eba48742
UHC 額뗨뛝屍뜈괮藥럤뤇B 11100100111111101000101111101000100011010101101011100011101110111000110110001011100000100101010111100101101101111000111010000111100011111011011101000010 e4fe8be88d5ae3bb8d8b8255e5b78e878fb742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)