To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???毅??靭??幼??娃??B 0011111100111111001111111000101101000010001111110011111110010000011110000011111100111111100101110110001100111111001111111000100010100001001111110011111101000010 3f3f3f8b423f3f90783f3f97633f3f88a13f3f42
EUC-JP ???毅??靭??幼??娃??B 0011111100111111001111111011010110100011001111110011111110111111110110010011111100111111110011011100010000111111001111111011000010100011001111110011111101000010 3f3f3fb5a33f3fbfd93f3fcdc43f3fb0a33f3f42
UTF-8 閱묐똻毅껈걬靭붺븨幼곸쁼娃뺥뇯B 11101001100101101011000111101011101011001001000011101011100110001011101111100110101011111000010111101010101110111000100011101010101100011010110011101001100111011010110111101011101101101011101011101011101110001010100011100101101110011011110011101010101100111011100011101100100000011011110011100101101010001000001111101011101110101010010111101011100001111010111101000010 e996b1ebac90eb98bbe6af85eabb88eab1ace99dadebb6baebb8a8e5b9bceab3b8ec81bce5a883ebbaa5eb87af42
UHC 閱묐똻毅껈걬靭붺븨幼곸쁼娃뺥뇯B 11100110111100111001000111101011100011001000000111101011111101101000001111101001100000011001010111101100111001011001010011100111100101011001000111101010111010101000000111101100100110001000001111101000110111111001010111101101100001111001010001000010 e6f391eb8c81ebf683e98195ece594e79591eaea81ec9883e8df95ed879442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)