To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ?莖?宏?莖?槐^ 00111111111001001011000100111111100011010100011100111111111001001011000100111111100111101100010101011110 3fe4b13f8d473fe4b13f9ec55e
EUC-JP ?莖芚宏?莖芚槐^ 0011111111101000101100111000111111010111101110111011100110101000001111111110100010110011100011111101011110111011110111001100011101011110 3fe8b38fd7bbb9a83fe8b38fd7bbdcc75e
UTF-8 뤾莖芚宏뤾莖芚槐^ 11101011101001001011111011101000100011101001011011101000100010101001101011100101101011101000111111101011101001001011111011101000100011101001011011101000100010101001101011100110101001111001000001011110 eba4bee88e96e88a9ae5ae8feba4bee88e96e88a9ae6a7905e
UHC 뤾莖芚宏뤾莖芚槐^ 1000111111101010110011001110110011010100111011001100111011011011100011111110101011001100111011001101010011101100110011101101100101011110 8feaccecd4eccedb8feaccecd4ecced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)