To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鍮??議??齬??異??碎??沃?? 001111110011111100111111111010000100101000111111001111111000101101100011001111110011111111101010100101110011111100111111100010001101100100111111001111111110000111101010001111110011111110010111100000000011111100111111 3f3f3fe84a3f3f8b633f3fea973f3f88d93f3fe1ea3f3f97803f3f
EUC-JP ???鍮??議??齬??異??碎??沃?? 001111110011111100111111111011111010101100111111001111111011010111000100001111110011111111110011111101110011111100111111101100001101101100111111001111111110001011101100001111110011111111001101111000000011111100111111 3f3f3fefab3f3fb5c43f3ff3f73f3fb0db3f3fe2ec3f3fcde03f3f
UTF-8 捻뀀맩鍮뽬린議얩뫛齬잙벊異룩첑碎ㅻ깹沃쇱쉰 111011111010011010100100111010111000000010000000111010111010011110101001111010011000110110101110111010111011110110101100111010111010011010110000111010001010110110110000111011001001011010101001111010111010101110011011111010011011110110101100111011001001111010011001111010111011001010001010111001111001010110110000111010111010001110101001111011001011001010010001111001111010001010001110111000111000010110111011111010101011100110111001111001101011001010000011111011001000011110110001111011001000100110110000 efa6a4eb8080eba7a9e98daeebbdaceba6b0e8adb0ec96a9ebab9be9bdacec9e99ebb28ae795b0eba3a9ecb291e7a28ee385bbeab9b9e6b283ec87b1ec89b0
UHC 捻뀀맩鍮뽬린議얩뫛齬잙벊異룩첑碎ㅻ깹沃쇱쉰 111001101111011110110010111010111001000010110001111010111011100110010110111010001011100010110000111011001010000110111110111011011001000110111011111001011110000110011111111010111001001110101101111011001011011010110111111010001010101010011110111000011110111110100100111010111011001010100001111010001010101010111100111011001011110110101110 e6f7b2eb90b1ebb996e8b8b0eca1beed91bbe5e19feb93adecb6b7e8aa9ee1efa4ebb2a1e8aabcecbdae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)