To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??載麥??嫉?? 001111110011111110001101110110101110101001101101001111110011111110001110101110010011111100111111 3f3f8ddaea6d3f3f8eb93f3f
EUC-JP 塡?載麥??嫉?? 1000111110111000101101000011111110111010110111001111001111001110001111110011111110111100101110110011111100111111 8fb8b43fbadcf3ce3f3fbcbb3f3f
UTF-8 塡렯載麥뤂듸嫉쇼겝 111001011010000110100001111010111010000010101111111010001011110010001001111010011011101010100101111010111010010010000010111010111001001110111000111001011010101110001001111011001000011110111100111010101011001010011101 e5a1a1eba0afe8bc89e9baa5eba482eb93b8e5ab89ec87bceab29d
UHC 塡렯載麥뤂듸嫉쇼겝 111011101111001110001110101111001110111010110000110110001110101010001111101100111011010111101111111100101110110010111100111011101011000011011000 eef38ebceeb0d8ea8fb3b5eff2ecbceeb0d8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)