To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN 儼??揖??楡??B 10011001010101100011111100111111100101110100101100111111001111111001111010111110001111110011111101000010 99563f3f974b3f3f9ebe3f3f42
EUC-JP 儼??揖??楡??B 11010001101101110011111100111111110011011010110000111111001111111101110011000000001111110011111101000010 d1b73f3fcdac3f3fdcc03f3f42
UTF-8 儼볥슪揖썸뮄楡녹돺B 11100101100001001011110011101011101100111010010111101100100010101010101011100110100011111001011011101100100011011011100011101011101011101000010011100110101001011010000111101011100001011011100111101011100011111011101001000010 e584bcebb3a5ec8aaae68f96ec8db8ebae84e6a5a1eb85b9eb8fba42
UHC 儼볥슪揖썸뮄楡녹돺B 11100101111100001001001111101011100110101011001111101011111001111011110111100110100100101001001111101010111110001011001111101100100010011011110101000010 e5f093eb9ab3ebe7bde69293eaf8b3ec89bd42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)