To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 陰??陰??歟?陰??陰??歟?陰??陰??歟? 100010010100000100111111001111111000100101000001001111110011111110011111011000100011111110001001010000010011111100111111100010010100000100111111001111111001111101100010001111111000100101000001001111110011111110001001010000010011111100111111100111110110001000111111 89413f3f89413f3f9f623f89413f3f89413f3f9f623f89413f3f89413f3f9f623f
EUC-JP 陰??陰??歟?陰??陰??歟?陰??陰??歟? 101100011010001000111111001111111011000110100010001111110011111111011101110000110011111110110001101000100011111100111111101100011010001000111111001111111101110111000011001111111011000110100010001111110011111110110001101000100011111100111111110111011100001100111111 b1a23f3fb1a23f3fddc33fb1a23f3fb1a23f3fddc33fb1a23f3fb1a23f3fddc33f
UTF-8 陰잛뀱陰쎌뀯歟칔陰잛뀱陰쎌뀯歟칈陰잛뀱陰쎌뀯歟칔 111010011001100110110000111011001001111010011011111010111000000010110001111010011001100110110000111011001000111010001100111010111000000010101111111001101010110110011111111011001011100110010100111010011001100110110000111011001001111010011011111010111000000010110001111010011001100110110000111011001000111010001100111010111000000010101111111001101010110110011111111011001011100110001000111010011001100110110000111011001001111010011011111010111000000010110001111010011001100110110000111011001000111010001100111010111000000010101111111001101010110110011111111011001011100110010100 e999b0ec9e9beb80b1e999b0ec8e8ceb80afe6ad9fecb994e999b0ec9e9beb80b1e999b0ec8e8ceb80afe6ad9fecb988e999b0ec9e9beb80b1e999b0ec8e8ceb80afe6ad9fecb994
UHC 陰잛뀱陰쎌뀯歟칔陰잛뀱陰쎌뀯歟칈陰잛뀱陰쎌뀯歟칔 111010111110010010011111111011001000010110100111111010111110010010111101111011001000010110100101111001101010001010101111011010111110101111100100100111111110110010000101101001111110101111100100101111011110110010000101101001011110011010100010101011110101100111101011111001001001111111101100100001011010011111101011111001001011110111101100100001011010010111100110101000101010111101101011 ebe49fec85a7ebe4bdec85a5e6a2af6bebe49fec85a7ebe4bdec85a5e6a2af59ebe49fec85a7ebe4bdec85a5e6a2af6b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)