To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘂???ュ?喩?? 111001010100000100111111001111110011111110000011100001010011111110011010011001110011111100111111 e5413f3f3f83853f9a673f3f
EUC-JP 蘂??嫄ュ?喩?? 1110100110100010001111110011111110001111101110101010000110100101111001010011111111010011110010000011111100111111 e9a23f3f8fbaa1a5e53fd3c83f3f
UTF-8 蘂띠꼳嫄ュㄾ喩볦뵫 111010001001100010000010111010111001110110100000111010101011110010110011111001011010101110000100111000111000001110100101111000111000010010111110111001011001011010101001111010111011001110100110111010111011010110101011 e89882eb9da0eabcb3e5ab84e383a5e384bee596a9ebb3a6ebb5ab
UHC 蘂띠꼳嫄ュㄾ喩볦뵫 111001111101111010110110111011001000010010001100111010101011000110101011111001011010010010101110111010101110011110010011111011001001010010101001 e7deb6ec848ceab1abe5a4aeeae793ec94a9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)