To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 箏??麥??? 111000101011010100111111001111111110101001101101001111110011111100111111 e2b53f3fea6d3f3f3f
EUC-JP 箏?薏麥?醬? 11100100101101110011111110001111110110011101111011110011110011100011111110001111111000101111001100111111 e4b73f8fd9def3ce3f8fe2f33f
UTF-8 箏렖薏麥삯醬렊 111001111010111010001111111010111010000010010110111010001001011010001111111010011011101010100101111011001000001010101111111010011000011010101100111010111010000010001010 e7ae8feba096e8968fe9baa5ec82afe986aceba08a
UHC 箏렖薏麥삯醬렊 1110111010110100100011101010101111101011111110111101100011101010101110111110100111101101111111011000111010100001 eeb48eabebfbd8eabbe9edfd8ea1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)