To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鴦??應?鴦??應?B 111010011111000100111111001111111001110011100100001111111110100111110001001111110011111110011100111001000011111101000010 e9f13f3f9ce43fe9f13f3f9ce43f42
EUC-JP 鴦??應?鴦??應?B 111100101111001100111111001111111101100011100110001111111111001011110011001111110011111111011000111001100011111101000010 f2f33f3fd8e63ff2f33f3fd8e63f42
UTF-8 鴦꾨끆應퍉鴦꾨끆應퍉B 11101001101101001010011011101010101111101010100011101011100000011000011011100110100001111000100111101101100011011000100111101001101101001010011011101010101111101010100011101011100000011000011011100110100001111000100111101101100011011000100101000010 e9b4a6eabea8eb8186e68789ed8d89e9b4a6eabea8eb8186e68789ed8d8942
UHC 鴦꾨끆應퍉鴦꾨끆應퍉B 111001001110110010000100111010111000010110111010111010111110101110111011011110101110010011101100100001001110101110000101101110101110101111101011101110110111101001000010 e4ec84eb85baebebbb7ae4ec84eb85baebebbb7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)