To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??孺??柔??筌??竊??惟??億??愉 111010100101111100111111001111111001101101111101001111110011111110001111010111110011111100111111111000101010001100111111001111111110001010000110001111110011111110001000110100100011111100111111100010011010110100111111001111111001011011111001 ea5f3f3f9b7d3f3f8f5f3f3fe2a33f3fe2863f3f88d23f3f89ad3f3f96f9
EUC-JP 鸚??孺??柔??筌??竊??惟??億??愉 111100111100000000111111001111111101010111011110001111110011111110111101110000000011111100111111111001001010010100111111001111111110001111100110001111110011111110110000110101000011111100111111101100101010111100111111001111111100110011111011 f3c03f3fd5de3f3fbdc03f3fe4a53f3fe3e63f3fb0d43f3fb2af3f3fccfb
UTF-8 鸚쒓퍔孺얏젔柔㏃낄筌뚯궡竊믥뙠惟깅튂億됰뀥愉 111010011011100010011010111011001001001010010011111011011000110110010100111001011010110110111010111011001001011010001111111011001010000010010100111001101001111110010100111000111000111110000011111010111000001010000100111001111010110110001100111010111001101010101111111010101011011010100001111001111010101110001010111010111010111110100101111010111001100110100000111001101000001110011111111010101011100110000101111011011000101010000010111001011000010010000100111010111001000010110000111010111000000010100101111001101000010010001001 e9b89aec9293ed8d94e5adbaec968feca094e69f94e38f83eb8284e7ad8ceb9aafeab6a1e7ab8aebafa5eb99a0e6839feab985ed8a82e58484eb90b0eb80a5e68489
UHC 鸚쒓퍔孺얏젔柔㏃낄筌뚯궡竊믥뙠惟깅튂億됰뀥愉 1110010110100100100111001110101010111011100010111110101011101000101111101110011010100000100100101110101011110101101001111110110010110011101001011110111110100111100011001110110010000010101101001110111110111100100100101110011110001100101001011110101011101110101100011110101110111001100110001110010111100010100010011110101110000101100111001110101011110000 e5a49ceabb8beae8bee6a092eaf5a7ecb3a5efa78cec82b4efbc92e78ca5eaeeb1ebb998e5e289eb859ceaf0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)