To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌?????癒??????繹??????踰? 1110001010100011001111110011111100111111001111110011111110010110111111000011111100111111001111110011111100111111001111111110001110001000001111110011111100111111001111110011111100111111111001101111101000111111 e2a33f3f3f3f3f96fc3f3f3f3f3f3fe3883f3f3f3f3f3fe6fa3f
EUC-JP 筌?????癒??????繹??????踰? 1110010010100101001111110011111100111111001111110011111111001100111111100011111100111111001111110011111100111111001111111110010111101000001111110011111100111111001111110011111100111111111011001111110000111111 e4a53f3f3f3f3fccfe3f3f3f3f3f3fe5e83f3f3f3f3f3fecfc3f
UTF-8 筌뚯룈留볟킊癒꿔렒樂끧됱쯽繹☏듬츃嶺뚯뜲踰킖 111001111010110110001100111010111001101010101111111010111010001110001000111011111010011110001101111010111011001110011111111011011000001010001010111001111001100110010010111010101011111110010100111010111010000010010010111011111010011010111111111010111000000110100111111010111001000010110001111011001010111110111101111001111011100110111001111000101001100010001111111010111001001110101100111011001011100010000011111011111010011010101011111010111001101010101111111010111001110010110010111010001011100010110000111011011000001010010110 e7ad8ceb9aafeba388efa78debb39fed828ae79992eabf94eba092efa6bfeb81a7eb90b1ecafbde7b9b9e2988feb93acecb883efa6abeb9aafeb9cb2e8b8b0ed8296
UHC 筌뚯룈留볟킊癒꿔렒樂끧됱쯽繹☏듬츃嶺뚯뜲踰킖 1110111110100111100011001110110010001111100001111110101110100111100100111110010110110100100101101110101110101000101100101110001110001110101001111110100011111001100001011101000110001001111011001010100110000001111001101011101010100010110011101011010111101011101011101000000111100111101011011000110011101100100011011011000011101011101100101011010101000010 efa78cec8f87eba793e5b496eba8b2e38ea7e8f985d189eca981e6baa2ceb5ebae81e7ad8cec8db0ebb2b542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)