To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘂??意??擬??如??異??受??鈺?И 1110010101000001001111110011111110001000110100110011111100111111100010110101101100111111001111111001010001000000001111110011111110001000110110010011111100111111100011101111001100111111001111111111101111000100001111111000010001001001 e5413f3f88d33f3f8b5b3f3f94403f3f88d93f3f8ef33f3ffbc43f8449
EUC-JP 蘂??意??擬??如??異??受??鈺?И 111010011010001000111111001111111011000011010101001111110011111110110101101111000011111100111111110001111010000100111111001111111011000011011011001111110011111110111100111101010011111100111111100011111110001111010101001111111010011110101010 e9a23f3fb0d53f3fb5bc3f3fc7a13f3fb0db3f3fbcf53f3f8fe3d53fa7aa
UTF-8 蘂띠눖意덄뙳擬뺣솾如붽퀣異룟슖受꿸틕鈺곕И 1110100010011000100000101110101110011101101000001110101110001000100101101110011010000100100011111110101110001101100001001110101110011001101100111110011010010011101011001110101110111010101000111110110010000110101111101110010110100110100000101110101110110110101111011110110110000000101000111110011110010101101100001110101110100011100111111110110010001010100101101110010110001111100101111110101010111111101110001110110110001011100101011110100110001000101110101110101010110011100101011101000010011000 e89882eb9da0eb8896e6848feb8d84eb99b3e693acebbaa3ec86bee5a682ebb6bded80a3e795b0eba39fec8a96e58f97eabfb8ed8b95e988baeab395d098
UHC 蘂띠눖意덄뙳擬뺣솾如붽퀣異룟슖受꿸틕鈺곕И 111001111101111010110110111011001000011110110000111010111111001010001000111001111000110010110110111010111111010010010101111010111001100110110010111001011111110110010100111010101011001110010111111011001011011010110111111001011001101010100101111000011111010010110010111010101011101010000011111010001010110110110000111010111010110010101010 e7deb6ec87b0ebf288e78cb6ebf495eb99b2e5fd94eab397ecb6b7e59aa5e1f4b2eaba83e8adb0ebacaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)