To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 要??節????????嚴??節??言??^ 100101110111011000111111001111111001000011011111001111110011111100111111001111110011111100111111001111110011111110011010100011100011111100111111100100001101111100111111001111111000110010111110001111110011111101011110 97763f3f90df3f3f3f3f3f3f3f3f9a8e3f3f90df3f3f8cbe3f3f5e
EUC-JP 要??節????????嚴??節??言??^ 110011011101011100111111001111111100000011100001001111110011111100111111001111110011111100111111001111110011111111010011111011100011111100111111110000001110000100111111001111111011100011000000001111110011111101011110 cdd73f3fc0e13f3f3f3f3f3f3f3fd3ee3f3fc0e13f3fb8c03f3f5e
UTF-8 要띷벦節쏙쉼料쒏뜿濾뚪찉嚴억슬節곈솢言뷴뫝^ 11101000101001101000000111101011100111011011011111101011101100101010011011100111101011111000000011101100100011111001100111101100100010011011110011101111101001101011111011101100100100101000111111101011100111001011111111101111101001101000010011101011100110101010101011101100101100001000100111100101100110101011010011101100100101101011010111101100100010101010110011100111101011111000000011101010101100111000100011101100100001101010001011101000101010001000000011101011101101111011010011101011101010111001110101011110 e8a681eb9db7ebb2a6e7af80ec8f99ec89bcefa6beec928feb9cbfefa684eb9aaaecb089e59ab4ec96b5ec8aace7af80eab388ec86a2e8a880ebb7b4ebab9d5e
UHC 要띷벦節쏙쉼料쒏뜿濾뚪찉嚴억슬節곈솢言뷴뫝^ 11101001101010011000110111100110100100111011111011101111101111011011110111101111101111011011000011101000111101111001110011100110100011011011101011100110101001001000110011101001101010011000110111100101111100011011111011101111101111011011110111101111101111011011000011101001100110011001110011100101111010111011101011100101100100011011110101011110 e9a98de693beefbdbdefbdb0e8f79ce68dbae6a48ce9a98de5f1beefbdbdefbdb0e9999ce5ebbae591bd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)