To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??愉??乙щ? 11100001100111110011111100111111100101101111100100111111001111111000100110110011100001001000101100111111 e19f3f3f96f93f3f89b3848b3f
EUC-JP 癲??愉??乙щ? 11100010101000010011111100111111110011001111101100111111001111111011001010110101101001111110101100111111 e2a13f3fccfb3f3fb2b5a7eb3f
UTF-8 癲싳뇴愉뺧쬆乙щ쐱 1110011110011001101100101110110010001011101100111110101110000111101101001110011010000100100010011110101110111010101001111110110010101100100001101110010010111001100110011101000110001001111011001001000010110001 e799b2ec8bb3eb87b4e68489ebbaa7ecac86e4b999d189ec90b1
UHC 癲싳뇴愉뺧쬆乙щ쐱 111011111010011010011010111011001000011110011000111010101111000010010101111011111010011010011101111010111110000010101100111010111001110010010100 efa69aec8798eaf095efa69debe0aceb9c94

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)