To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 霓??違??矣??嶸→?揄??巍ル?愉 11101000101111010011111100111111100010001110000100111111001111111110000111100001001111110011111111111010101101001000000110101000001111111001110110001001001111110011111110011011110110011000001110001011001111111001011011111001 e8bd3f3f88e13f3fe1e13f3ffab481a83f9d893f3f9bd9838b3f96f9
EUC-JP 霓??違??矣??嶸→?揄??巍ル?愉 1111000010111111001111110011111110110000111000110011111100111111111000101110001100111111001111111000111110111011111101001010001010101010001111111101100111101001001111110011111111010110110110111010010111101011001111111100110011111011 f0bf3f3fb0e33f3fe2e33f3f8fbbf4a2aa3fd9e93f3fd6dba5eb3fccfb
UTF-8 霓낅뜄違울㎠矣섍콟嶸→꽴揄졼렍巍ル쵐愉 111010011001110010010011111010111000001010000101111010111001110010000100111010011000000110010101111011001001101010111000111000111000111010100000111001111001111110100011111011001000010010001101111011001011110110011111111001011011011010111000111000101000011010010010111010101011110110110100111001101000111110000100111011001010000110111100111010111010000010001101111001011011011110001101111000111000001110101011111011001011010110010000111001101000010010001001 e99c93eb8285eb9c84e98195ec9ab8e38ea0e79fa3ec848decbd9fe5b6b8e28692eabdb4e68f84eca1bceba08de5b78de383abecb590e68489
UHC 霓낅뜄違울㎠矣섍콟嶸→꽴揄졼렍巍ル쵐愉 1110011111100111100001011110101110001101100010001110101011011110101111111110111110100111101100101110101111111000100110001110101010110001100101111110011110101110101000011110011010000100101111111110101011110001101000001110001110001110101000111110100011100100101010111110101110101100100100101110101011110000 e7e785eb8d88eadebfefa7b2ebf898eab197e7aea1e684bfeaf1a0e38ea3e8e4abebac92eaf0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)