To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??憶?????押 0011111100111111100010011010111100111111001111110011111100111111001111111000100110011111 3f3f89af3f3f3f3f3f899f
EUC-JP 邕?憶?沅?邕?押 1000111111100001111011010011111110110010101100010011111110001111110001101110100100111111100011111110000111101101001111111011001010100001 8fe1ed3fb2b13f8fc6e93f8fe1ed3fb2a1
UTF-8 邕렋憶렰沅렋邕렋押 111010011000001010010101111010111010000010001011111001101000011010110110111010111010000010110000111001101011001010000101111010111010000010001011111010011000001010010101111010111010000010001011111001101000101010111100 e98295eba08be686b6eba0b0e6b285eba08be98295eba08be68abc
UHC 邕렋憶렰沅렋邕렋押 111010001011101110001110101000101110010111100011100011101011110111101010101101101000111010100010111010001011101110001110101000101110010011100011 e8bb8ea2e5e38ebdeab68ea2e8bb8ea2e4e3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)