To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??c??????c 00111111001111110110001100111111001111110011111100111111001111110011111101100011 3f3f633f3f3f3f3f3f63
SJIS-WIN ??c?寶匡賻??c 00111111001111110110001100111111100110111000111110001011101001111110011011010000001111110011111101100011 3f3f633f9b8f8ba7e6d03f3f63
EUC-JP ??c?寶匡賻??c 00111111001111110110001100111111110101011110111110110110101010011110110011010010001111110011111101100011 3f3f633fd5efb6a9ecd23f3f63
UTF-8 렻렋c렺寶匡賻렻렋c 1110101110100000101110111110101110100000100010110110001111101011101000001011101011100101101011111011011011100101100011001010000111101000101100111011101111101011101000001011101111101011101000001000101101100011 eba0bbeba08b63eba0bae5afb6e58ca1e8b3bbeba0bbeba08b63
UHC 렻렋c렺寶匡賻렻렋c 100011101100001110001110101000100110001110001110110000101101110011000100110011101100010011011101101110001000111011000011100011101010001001100011 8ec38ea2638ec2dcc4cec4ddb88ec38ea263

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)