To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 筌??認??筌??游??喩???ル?諭??喩??^ 111000101010001100111111001111111001010001000110001111110011111111100010101000110011111100111111100111111110000000111111001111111001101001100111001111110011111100111111100000111000101100111111100101110100000000111111001111111001101001100111001111110011111101011110 e2a33f3f94463f3fe2a33f3f9fe03f3f9a673f3f3f838b3f97403f3f9a673f3f5e
EUC-JP 筌??認??筌??游??喩???ル?諭??喩??^ 111001001010010100111111001111111100011110100111001111110011111111100100101001010011111100111111110111101110001000111111001111111101001111001000001111110011111100111111101001011110101100111111110011011010000100111111001111111101001111001000001111110011111101011110 e4a53f3fc7a73f3fe4a53f3fdee23f3fd3c83f3f3fa5eb3fcda13f3fd3c83f3f5e
UTF-8 筌뚮뱷認뗥끀筌듬끼游뤄쫨喩륁뿉曆ル틷諭㎪에喩뽰쨦^ 11100111101011011000110011101011100110101010111011101011101100011011011111101000101010101000110111101011100101111010010111101011100000011000000011100111101011011000110011101011100100111010110011101011100000011011110011100110101110001011100011101011101001001000010011101100101010111010100011100101100101101010100111101011101001011000000111101011101111111000100111101111101001101000101111100011100000111010101111101101100010111011011111101000101010111010110111100011100011101010101011101100100101111001000011100101100101101010100111101011101111011011000011101100101010001010011001011110 e7ad8ceb9aaeebb1b7e8aa8deb97a5eb8180e7ad8ceb93aceb81bce6b8b8eba484ecaba8e596a9eba581ebbf89efa68be383abed8bb7e8abade38eaaec9790e596a9ebbdb0eca8a65e
UHC 筌뚮뱷認뗥끀筌듬끼游뤄쫨喩륁뿉曆ル틷諭㎪에喩뽰쨦^ 11101111101001111000110011101011100100111001110111101100111000111000101111100101100001011011011011101111101001111011010111101011101100111010001011101010111111011011011111101111101001101000000111101010111001111000111111101100100101111001000011100110101101111010101111101011101110101001111011101011101100011010011111100110101111111010000111101010111001111001011011101100101001001000000101011110 efa78ceb939dece38be585b6efa7b5ebb3a2eafdb7efa681eae78fec9790e6b7abebba9eebb1a7e6bfa1eae796eca4815e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)