To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 癲???熱??乙 1110000110011111001111110011111100111111100101000100110100111111001111111000100110110011 e19f3f3f3f944d3f3f89b3
EUC-JP 癲???熱??乙 1110001010100001001111110011111100111111110001111010111000111111001111111011001010110101 e2a13f3f3fc7ae3f3fb2b5
UTF-8 癲ㅡ꾨퓘熱듬떻乙 111001111001100110110010111000111000010110100001111010101011111010101000111011011001001110011000111001111000011010110001111010111001001110101100111010111001011010111011111001001011100110011001 e799b2e385a1eabea8ed9398e786b1eb93aceb96bbe4b999
UHC 癲ㅡ꾨퓘熱듬떻乙 11101111101001101010010011010001100001001110101110111111100000111110011011110000101101011110101110110110101110111110101111100000 efa6a4d184ebbf83e6f0b5ebb6bbebe0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)