To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 懿????????B 1001110011110010001111110011111100111111001111110011111100111111001111110011111101000010 9cf23f3f3f3f3f3f3f3f42
EUC-JP 懿????????B 1101100011110100001111110011111100111111001111110011111100111111001111110011111101000010 d8f43f3f3f3f3f3f3f3f42
UTF-8 懿롫쨸殮㏃븥琉믥썻B 11100110100001111011111111101011101000011010101111101100101010001011100011101111101001101010010111100011100011111000001111101011101110001010010111101111101001111000110011101011101011111010010111101100100011011011101101000010 e687bfeba1abeca8b8efa6a5e38f83ebb8a5efa78cebafa5ec8dbb42
UHC 懿롫쨸殮㏃븥琉믥썻B 11101011111100111000111011101011101001001001001011100110111110011010011111101100100101011000111011101011101001001001001011100111100110111010011101000010 ebf38eeba492e6f9a7ec958eeba492e79ba742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)