To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈔ア蟆雁、夊ェー鞜懆鳩遶ェ螟壼ュォ隱ー鞜懆エ 1110011111100010101100011110010110110000100010101110010110100100100110101110100010101010101100001110100011011111100111001110100010010100101101011110011110101011101010101110010110100100100110101110010110101101101010111110100010101010101100001110100011011111100111001110100010110100 e7e2b1e5b08ae5a49ae8aab0e8df9ce894b5e7abaae5a49ae5adabe8aab0e8df9ce8b4
EUC-JP 鈔ア蟆雁、夊ェー鞜懆鳩遶ェ螟壼ュォ隱ー鞜懆エ 1110111011100100100011101011000111101010101100101011010011100111100011101010010011010100111010101000111010101010100011101011000011110000111000011101100011101010110010001011011111101110101011011000111010101010111010101010011011010100111001111000111010101101100011101010101111110000101011001000111010110000111100001110000111011000111010101000111010110100 eee48eb1eab2b4e78ea4d4ea8eaa8eb0f0e1d8eac8b7eead8eaaeaa6d4e78ead8eabf0ac8eb0f0e1d8ea8eb4
UTF-8 鈔ア蟆雁、夊ェー鞜懆鳩遶ェ螟壼ュォ隱ー鞜懆エ 111010011000100010010100111011111011110110110001111010001001111110000110111010011001101110000001111011111011110110100100111001011010010010001010111011111011110110101010111011111011110110110000111010011001111010011100111001101000011110000110111010011011001110101001111010011000000110110110111011111011110110101010111010001001111010011111111001011010001110111100111011111011110110101101111011111011110110101011111010011001101010110001111011111011110110110000111010011001111010011100111001101000011110000110111011111011110110110100 e98894efbdb1e89f86e99b81efbda4e5a48aefbdaaefbdb0e99e9ce68786e9b3a9e981b6efbdaae89e9fe5a3bcefbdadefbdabe99ab1efbdb0e99e9ce68786efbdb4
UHC ???雁??????鳩??螟???隱???? 0011111100111111001111111110010011010010001111110011111100111111001111110011111100111111110011111100110100111111001111111101100110101101001111110011111100111111111010111101111100111111001111110011111100111111 3f3f3fe4d23f3f3f3f3f3fcfcd3f3fd9ad3f3f3febdf3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)