To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????j????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111011010100011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f6a3f3f3f3f3f
SJIS-WIN 嗚??揖?嗚???↑????j嗚???? 10011010011010100011111100111111100101110100101100111111100110100110101000111111001111110011111110000001101010100011111100111111001111110011111101101010100110100110101000111111001111110011111100111111 9a6a3f3f974b3f9a6a3f3f3f81aa3f3f3f3f6a9a6a3f3f3f3f
EUC-JP 嗚??揖?嗚?ł?↑????j嗚?ł?? 1101001111001011001111110011111111001101101011000011111111010011110010110011111110001111101010011100100000111111101000101010110000111111001111110011111100111111011010101101001111001011001111111000111110101001110010000011111100111111 d3cb3f3fcdac3fd3cb3f8fa9c83fa2ac3f3f3f3f6ad3cb3f8fa9c83f3f
UTF-8 嗚삳챿揖퓂嗚삳ł璘↑뮲紐뚯돽j嗚삳ł璘좦 1110010110010111100110101110110010000010101100111110110010110001101111111110011010001111100101101110110110010011100000101110010110010111100110101110110010000010101100111100010110000010111011111010011110101111111000101000011010010001111010111010111010110010111011111010011110001111111010111001101010101111111010111000111110111101011010101110010110010111100110101110110010000010101100111100010110000010111011111010011110101111111011001010001010100110 e5979aec82b3ecb1bfe68f96ed9382e5979aec82b3c582efa7afe28691ebaeb2efa78feb9aafeb8fbd6ae5979aec82b3c582efa7afeca2a6
UHC 嗚삳챿揖퓂嗚삳ł璘↑뮲紐뚯돽j嗚삳ł璘좦 111001111111000010111011111010111010101010001100111010111110011110111111011010101110011111110000101110111110101110101001101010011110110011011110101000011110100010010010101110111110101110101010100011001110110010001001101111110110101011100111111100001011101111101011101010011010100111101100110111101010000101000010 e7f0bbebaa8cebe7bf6ae7f0bbeba9a9ecdea1e892bbebaa8cec89bf6ae7f0bbeba9a9ecdea142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)