To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 瑤??與??節??節ら?窈??易?????B 1110101010100010001111110011111111100100011011110011111100111111100100001101111100111111001111111001000011011111100000101110011100111111111000100111011100111111001111111000100011010101001111110011111100111111001111110011111101000010 eaa23f3fe46f3f3f90df3f3f90df82e73fe2773f3f88d53f3f3f3f3f42
EUC-JP 瑤??與??節??節ら?窈??易?????B 1111010010100100001111110011111111100111110100000011111100111111110000001110000100111111001111111100000011100001101001001110100100111111111000111101100000111111001111111011000011010111001111110011111100111111001111110011111101000010 f4a43f3fe7d03f3fc0e13f3fc0e1a4e93fe3d83f3fb0d73f3f3f3f3f42
UTF-8 瑤덆죱與쀦텊節욥썖節ら웼窈뚧쮻易뉛슴樂됵쉠B 11100111100100011010010011101011100011011000011011101100101000111011000111101000100010001000011111101100100000001010011011101101100001011000101011100111101011111000000011101100100110101010010111101100100011011001011011100111101011111000000011100011100000101000100111101100100110111011110011100111101010101000100011101011100110101010011111101100101011101011101111100110100110001001001111101011100010011001101111101100100010101011010011101111101001101011111111101011100100001011010111101100100010011010000001000010 e791a4eb8d86eca3b1e88887ec80a6ed858ae7af80ec9aa5ec8d96e7af80e38289ec9bbce7aa88eb9aa7ecaebbe69893eb899bec8ab4efa6bfeb90b5ec89a042
UHC 瑤덆죱與쀦텊節욥썖節ら웼窈뚧쮻易뉛슴樂됵쉠B 11101000111111011000100011101001101000011000110011100110101010001001011111100110101101101000011111101111101111011011111111101001100110111000100111101111101111011010101011101001100111111000100011101001101000011000110011100110101010001001011111100110101101101000011111101111101111011011111111101000111110011000100111101111101111011010101001000010 e8fd88e9a18ce6a897e6b687efbdbfe99b89efbdaae99f88e9a18ce6a897e6b687efbdbfe8f989efbdaa42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)