To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????B 0011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f42
SJIS-WIN 邪糅頸贓糘遮B 100011101101011111100010111100001110100011110010111001101101100111100010111100101111001011100010100011101101010101000010 8ed7e2f0e8f2e6d9e2f2f2e28ed542
EUC-JP 邪糅頸贓糘?遮B 1011110011011001111001001111001011110000111101001110110011011011111001001111010000111111101111001101011101000010 bcd9e4f2f0f4ecdbe4f43fbcd742
UTF-8 邪糅頸贓糘遮B 11101001100000101010101011100111101100111000010111101001101000001011100011101000101101001001001111100111101100111001100011101110100010001001100111101001100000011010111001000010 e982aae7b385e9a0b8e8b493e7b398ee8899e981ae42
UHC 邪?頸贓??遮B 110111101111011100111111110011001111001011101101111111000011111100111111111100111011010001000010 def73fccf2edfc3f3ff3b442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)