To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 儀??鴉?儀??鴉?B 100010110101011000111111001111111110100111101011001111111000101101010110001111110011111111101001111010110011111101000010 8b563f3fe9eb3f8b563f3fe9eb3f42
EUC-JP 儀??鴉?儀??鴉?B 101101011011011100111111001111111111001011101101001111111011010110110111001111110011111111110010111011010011111101000010 b5b73f3ff2ed3fb5b73f3ff2ed3f42
UTF-8 儀쒒죲鴉둰儀쒒죲鴉둰B 11100101100001001000000011101100100100101001001011101100101000111011001011101001101101001000100111101011100100011011000011100101100001001000000011101100100100101001001011101100101000111011001011101001101101001000100111101011100100011011000001000010 e58480ec9292eca3b2e9b489eb91b0e58480ec9292eca3b2e9b489eb91b042
UHC 儀쒒죲鴉둰儀쒒죲鴉둰B 111010111111000010011100111010011010000110001101111001001011110010001010011010011110101111110000100111001110100110100001100011011110010010111100100010100110100101000010 ebf09ce9a18de4bc8a69ebf09ce9a18de4bc8a6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)