To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 閼ア蟆雁、夊ェー驕懆鳩遶ェ螟壼ュォ隱ー驕懆エ 1110100010000100101100011110010110110000100010101110010110100100100110101110100010101010101100001110100110000001100111001110100010010100101101011110011110101011101010101110010110100100100110101110010110101101101010111110100010101010101100001110100110000001100111001110100010110100 e884b1e5b08ae5a49ae8aab0e9819ce894b5e7abaae5a49ae5adabe8aab0e9819ce8b4
EUC-JP 閼ア蟆雁、夊ェー驕懆鳩遶ェ螟壼ュォ隱ー驕懆エ 1110111111100100100011101011000111101010101100101011010011100111100011101010010011010100111010101000111010101010100011101011000011110001111000011101100011101010110010001011011111101110101011011000111010101010111010101010011011010100111001111000111010101101100011101010101111110000101011001000111010110000111100011110000111011000111010101000111010110100 efe48eb1eab2b4e78ea4d4ea8eaa8eb0f1e1d8eac8b7eead8eaaeaa6d4e78ead8eabf0ac8eb0f1e1d8ea8eb4
UTF-8 閼ア蟆雁、夊ェー驕懆鳩遶ェ螟壼ュォ隱ー驕懆エ 111010011001011010111100111011111011110110110001111010001001111110000110111010011001101110000001111011111011110110100100111001011010010010001010111011111011110110101010111011111011110110110000111010011010100110010101111001101000011110000110111010011011001110101001111010011000000110110110111011111011110110101010111010001001111010011111111001011010001110111100111011111011110110101101111011111011110110101011111010011001101010110001111011111011110110110000111010011010100110010101111001101000011110000110111011111011110110110100 e996bcefbdb1e89f86e99b81efbda4e5a48aefbdaaefbdb0e9a995e68786e9b3a9e981b6efbdaae89e9fe5a3bcefbdadefbdabe99ab1efbdb0e9a995e68786efbdb4
UHC 閼??雁????驕?鳩??螟???隱?驕?? 1110010011011001001111110011111111100100110100100011111100111111001111110011111111001110111101100011111111001111110011010011111100111111110110011010110100111111001111110011111111101011110111110011111111001110111101100011111100111111 e4d93f3fe4d23f3f3f3fcef63fcfcd3f3fd9ad3f3f3febdf3fcef63f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)