To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 遶ェ豺。蓁ウ隱ー鞜懈クャ遶ェ驕懆エ郁ェー鞜懷 11100111101010111010101011100110101101111010000111100100111010111011001111101000101010101011000011101000110111111001110011100110101110001010110011100111101010111010101011101001100000011001110011101000101101001000100011101000101010101011000011101000110111111001110011100101 e7abaae6b7a1e4ebb3e8aab0e8df9ce6b8ace7abaae9819ce8b488e8aab0e8df9ce5
EUC-JP 遶ェ豺。蓁ウ隱ー鞜懈クャ遶ェ驕懆エ郁ェー鞜懷 1110111010101101100011101010101011101100101110011000111010100001111010001110110110001110101100111111000010101100100011101011000011110000111000011101100011101000100011101011100010001110101011001110111010101101100011101010101011110001111000011101100011101010100011101011010010110000111010101000111010101010100011101011000011110000111000011101100011100111 eead8eaaecb98ea1e8ed8eb3f0ac8eb0f0e1d8e88eb88eaceead8eaaf1e1d8ea8eb4b0ea8eaa8eb0f0e1d8e7
UTF-8 遶ェ豺。蓁ウ隱ー鞜懈クャ遶ェ驕懆エ郁ェー鞜懷 111010011000000110110110111011111011110110101010111010001011000110111010111011111011110110100001111010001001001110000001111011111011110110110011111010011001101010110001111011111011110110110000111010011001111010011100111001101000011110001000111011111011110110111000111011111011110110101100111010011000000110110110111011111011110110101010111010011010100110010101111001101000011110000110111011111011110110110100111010011000001110000001111011111011110110101010111011111011110110110000111010011001111010011100111001101000011110110111 e981b6efbdaae8b1baefbda1e89381efbdb3e99ab1efbdb0e99e9ce68788efbdb8efbdace981b6efbdaae9a995e68786efbdb4e98381efbdaaefbdb0e99e9ce687b7
UHC ??豺???隱??懈????驕??郁???懷 00111111001111111110001111001111001111110011111100111111111010111101111100111111001111111111101010101011001111110011111100111111001111111100111011110110001111110011111111101001111101000011111100111111001111111111110011100011 3f3fe3cf3f3f3febdf3f3ffaab3f3f3f3fcef63f3fe9f43f3f3ffce3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)