To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??ビ⊂??咐??濯????ダ?瞼杷ダ◇? 001111110011111110000011011100101000000110111100001111110011111110011001111100110011111100111111100100011111001100111111001111110011111100111111100000110101111100111111111000011101100110010100011001101000001101011111100000011001111000111111 3f3f837281bc3f3f99f33f3f91f33f3f3f3f835f3fe1d99466835f819e3f
EUC-JP ??ビ⊂??咐??濯????ダ?瞼杷ダ◇? 001111110011111110100101110100111010001010111110001111110011111111010010111101010011111100111111110000101111010100111111001111110011111100111111101001011100000000111111111000101101101111000111110001111010010111000000101000011111111000111111 3f3fa5d3a2be3f3fd2f53f3fc2f53f3f3f3fa5c03fe2dbc7c7a5c0a1fe3f
UTF-8 룶핊ビ⊂룶웩咐룶웩濯룶엌룫횕ダ룴瞼杷ダ◇룫 111010111010001110110110111011011001010110001010111000111000001110010011111000101000101010000010111010111010001110110110111011001001101110101001111001011001001010010000111010111010001110110110111011001001101110101001111001101011111110101111111010111010001110110110111011001001011110001100111010111010001110101011111011011001101010010101111000111000001110000000111010111010001110110100111001111001111010111100111001101001110110110111111000111000001110000000111000101001011110000111111010111010001110101011 eba3b6ed958ae38393e28a82eba3b6ec9ba9e59290eba3b6ec9ba9e6bfafeba3b6ec978ceba3abed9a95e38380eba3b4e79ebce69db7e38380e29787eba3ab
UHC 룶핊ビ⊂룶웩咐룶웩濯룶엌룫횕ダ룴瞼杷ダ◇룫 100011111010101111000000100011111010101111010011101000011111100010001111101010111100000010100001110111001111101110001111101010111100000010100001111101101111101110001111101010111011111011111101100011111010001011000011100011111010101111000000100011111010100111001100101000011111011111101101101010111100000010100001110111101000111110100010 8fabc08fabd3a1f88fabc0a1dcfb8fabc0a1f6fb8fabbefd8fa2c38fabc08fa9cca1f7edabc0a1de8fa2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)