To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??載麥??嫉??融 0011111100111111100011011101101011101010011011010011111100111111100011101011100100111111001111111001011101011010 3f3f8ddaea6d3f3f8eb93f3f975a
EUC-JP 塡?載麥??嫉??融 10001111101110001011010000111111101110101101110011110011110011100011111100111111101111001011101100111111001111111100110110111011 8fb8b43fbadcf3ce3f3fbcbb3f3fcdbb
UTF-8 塡렯載麥뤂듸嫉쇼겝融 111001011010000110100001111010111010000010101111111010001011110010001001111010011011101010100101111010111010010010000010111010111001001110111000111001011010101110001001111011001000011110111100111010101011001010011101111010001001111010001101 e5a1a1eba0afe8bc89e9baa5eba482eb93b8e5ab89ec87bceab29de89e8d
UHC 塡렯載麥뤂듸嫉쇼겝融 1110111011110011100011101011110011101110101100001101100011101010100011111011001110110101111011111111001011101100101111001110111010110000110110001110101111010111 eef38ebceeb0d8ea8fb3b5eff2ecbceeb0d8ebd7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)