To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悟?????儒??繹??異??酉??椰??楢 1000110011100101001111110011111100111111001111110011111110001110111100100011111100111111111000111000100000111111001111111000100011011001001111110011111110010011110100010011111100111111100111101011110100111111001111111001001111101000 8ce53f3f3f3f3f8ef23f3fe3883f3f88d93f3f93d13f3f9ebd3f3f93e8
EUC-JP 悟??沅??儒??繹??異??酉??椰??楢 10111000111001110011111100111111100011111100011011101001001111110011111110111100111101000011111100111111111001011110100000111111001111111011000011011011001111110011111111000110110100110011111100111111110111001011111100111111001111111100011011101010 b8e73f3f8fc6e93f3fbcf43f3fe5e83f3fb0db3f3fc6d33f3fdcbf3f3fc6ea
UTF-8 悟뽯쉼沅졿궇儒띠젂繹먮씮異룟쫩酉귦맊椰꾠끁楢 111001101000001010011111111010111011110110101111111011001000100110111100111001101011001010000101111011001010000110111111111010101011011010000111111001011000010010010010111010111001110110100000111011001010000010000010111001111011100110111001111010111010100010101110111011001001010010101110111001111001010110110000111010111010001110011111111011001010101110101001111010011000010110001001111010101011011110100110111010111010011110001010111001101010010010110000111010101011111010100000111010111000000110000001111001101010010110100010 e6829febbdafec89bce6b285eca1bfeab687e58492eb9da0eca082e7b9b9eba8aeec94aee795b0eba39fecaba9e98589eab7a6eba78ae6a4b0eabea0eb8181e6a5a2
UHC 悟뽯쉼沅졿궇儒띠젂繹먮씮異룟쫩酉귦맊椰꾠끁楢 1110011111110110100101101110101110111101101100001110101010110110101000001110011010000010101000001110101011100011101101101110110010100000100001101110011010111010100100001110101110011101101111111110110010110110101101111110010110100110100000101110101110110111100000101110110110010000101000101110010110101011100001001110001110000101101101111110101011111001 e7f696ebbdb0eab6a0e682a0eae3b6eca086e6ba90eb9dbfecb6b7e5a682ebb782ed90a2e5ab84e385b7eaf9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)