To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 陞溷サ幄棔蟒嫂 11101000100111101001111111100101101110111001101111101000100111101001111111100101101110111001101101011110 e89e9fe5bb9be89e9fe5bb9b5e
EUC-JP 陞溷サ幄棔蟒嫂 1110111111111110110111101110011110001110101110111101011011101010110111001010000111101010101111011101010110111111 effedee78ebbd6eadca1eabdd5bf
UTF-8 陞溷サ幄棔蟒嫂 111010011001100110011110111001101011101010110111111011111011110110111011111001011011100110000100111001101010001110010100111010001001111110010010111001011010101110000010 e9999ee6bab7efbdbbe5b984e6a394e89f92e5ab82
UHC 陞??幄??嫂 11100011101100110011111100111111111001001100000100111111001111111110000111111001 e3b33f3fe4c13f3fe1f9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)