To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 倭??饒??鼇??倭↑ 10011000011000000011111100111111111010010110000000111111001111111110101010000111001111110011111110011000011000001000000110101010 98603f3fe9603f3fea873f3f986081aa
EUC-JP 倭??饒??鼇??倭↑ 11001111110000010011111100111111111100011100000100111111001111111111001111100111001111110011111111001111110000011010001010101100 cfc13f3ff1c13f3ff3e73f3fcfc1a2ac
UTF-8 倭랃슭饒뽳슴鼇묋콖倭↑ 111001011000000010101101111010111001111010000011111011001000101010101101111010011010010110010010111010111011110110110011111011001000101010110100111010011011110010000111111010111010110010001011111011001011110110010110111001011000000010101101111000101000011010010001 e580adeb9e83ec8aade9a592ebbdb3ec8ab4e9bc87ebac8becbd96e580ade28691
UHC 倭랃슭饒뽳슴鼇묋콖倭↑ 11101000110111101000110111101111101111011011111011101001101011101001011011101111101111011011111111101000101010001001000111101000101100011001000011101000110111101010000111101000 e8de8defbdbee9ae96efbdbfe8a891e8b190e8dea1e8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)