To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ?害?宏?害?槐^ 00111111100010100101000100111111100011010100011100111111100010100101000100111111100111101100010101011110 3f8a513f8d473f8a513f9ec55e
EUC-JP ?害?宏?害?槐^ 00111111101100111011001000111111101110011010100000111111101100111011001000111111110111001100011101011110 3fb3b23fb9a83fb3b23fdcc75e
UTF-8 뤒害녑宏뤒害녑槐^ 11101011101001001001001011100101101011101011001111101011100001011001000111100101101011101000111111101011101001001001001011100101101011101011001111101011100001011001000111100110101001111001000001011110 eba492e5aeb3eb8591e5ae8feba492e5aeb3eb8591e6a7905e
UHC 뤒害녑宏뤒害녑槐^ 1000111111000010111110101010101010110011111001011100111011011011100011111100001011111010101010101011001111100101110011101101100101011110 8fc2faaab3e5cedb8fc2faaab3e5ced95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)