To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ?曉???曉??^ 0011111110011101111110100011111100111111001111111001110111111010001111110011111101011110 3f9dfa3f3f3f9dfa3f3f5e
EUC-JP ?曉???曉??^ 0011111111011010111111000011111100111111001111111101101011111100001111110011111101011110 3fdafc3f3f3fdafc3f3f5e
UTF-8 뤌曉ㄿ낚뤌曉ㄿ낚^ 11101011101001001000110011100110100110111000100111100011100001001011111111101011100000101001101011101011101001001000110011100110100110111000100111100011100001001011111111101011100000101001101001011110 eba48ce69b89e384bfeb829aeba48ce69b89e384bfeb829a5e
UHC 뤌曉ㄿ낚뤌曉ㄿ낚^ 1000111110111100111111001111101110100100101011111011001110101100100011111011110011111100111110111010010010101111101100111010110001011110 8fbcfcfba4afb3ac8fbcfcfba4afb3ac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)