To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ?曉???曉??^ 0011111110011101111110100011111100111111001111111001110111111010001111110011111101011110 3f9dfa3f3f3f9dfa3f3f5e
EUC-JP ?曉???曉??^ 0011111111011010111111000011111100111111001111111101101011111100001111110011111101011110 3fdafc3f3f3fdafc3f3f5e
UTF-8 뤌曉ㄿ벤뤌曉ㄿ뱐^ 11101011101001001000110011100110100110111000100111100011100001001011111111101011101100101010010011101011101001001000110011100110100110111000100111100011100001001011111111101011101100011001000001011110 eba48ce69b89e384bfebb2a4eba48ce69b89e384bfebb1905e
UHC 뤌曉ㄿ벤뤌曉ㄿ뱐^ 1000111110111100111111001111101110100100101011111011101010100101100011111011110011111100111110111010010010101111101110011111010001011110 8fbcfcfba4afbaa58fbcfcfba4afb9f45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)