To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 甕?????鸚??焉?????梧??節??^ 111000010101000000111111001111110011111100111111001111111110101001011111001111110011111111100000100000010011111100111111001111110011111100111111100011001110011000111111001111111001000011011111001111110011111101011110 e1503f3f3f3f3fea5f3f3fe0813f3f3f3f3f8ce63f3f90df3f3f5e
EUC-JP 甕?????鸚??焉?????梧??節??^ 111000011011000100111111001111110011111100111111001111111111001111000000001111110011111111011111111000010011111100111111001111110011111100111111101110001110100000111111001111111100000011100001001111110011111101011110 e1b13f3f3f3f3ff3c03f3fdfe13f3f3f3f3fb8e83f3fc0e13f3f5e
UTF-8 甕앭츍掠볢왊鸚㏆쉽焉욤툦若덂콚梧삣ㄷ節길뜙^ 11100111100101001001010111101100100101011010110111101100101110001000110111101111101001011011010111101011101100111010001011101100100110011000101011101001101110001001101011100011100011111000011011101100100010011011110111100111100001001000100111101100100110101010010011101101100010001010011011101111101001011011010011101011100011011000001011101100101111011001101011100110101000101010011111101100100000101010001111100011100001001011011111100111101011111000000011101010101110001011100011101011100111001001100101011110 e79495ec95adecb88defa5b5ebb3a2ec998ae9b89ae38f86ec89bde78489ec9aa4ed88a6efa5b4eb8d82ecbd9ae6a2a7ec82a3e384b7e7af80eab8b8eb9c995e
UHC 甕앭츍掠볢왊鸚㏆쉽焉욤툦若덂콚梧삣ㄷ節길뜙^ 11101000101110001001110111100101101011101000100011100101101100011001001111101000100111101011101111100101101001001010011111101111101111011011000111100101111010101011111111101000101110001001110111100101101011101000100011100101101100011001001111100111111111001011101111100101101001001010011111101111101111011011000111100110100011011001110001011110 e8b89de5ae88e5b193e89ebbe5a4a7efbdb1e5eabfe8b89de5ae88e5b193e7fcbbe5a4a7efbdb1e68d9c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)