To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 鶯??艤??臆 11101001111100100011111100111111111001000111111000111111001111111000100110110000 e9f23f3fe47e3f3f89b0
EUC-JP 鶯??艤??臆 11110010111101000011111100111111111001111101111100111111001111111011001010110010 f2f43f3fe7df3f3fb2b2
UTF-8 鶯뺞찇艤꾢짆臆 111010011011011010101111111010111011101010011110111011001011000010000111111010001000100110100100111010101011111010100010111011001010011110000110111010001000011110000110 e9b6afebba9eecb087e889a4eabea2eca786e88786
UHC 鶯뺞찇艤꾢짆臆 1110010110100011100101011110011010101001100010111110101111111010100001001110010110100011100101011110010111100110 e5a395e6a98bebfa84e5a395e5e6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)