To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??幽??瑤??弛?????茹???? 00111111001111110011111111100010100001100011111100111111100101110100100000111111001111111110101010100010001111110011111110010010011011110011111100111111001111110011111100111111111001001010010100111111001111110011111100111111 3f3f3fe2863f3f97483f3feaa23f3f926f3f3f3f3f3fe4a53f3f3f3f
EUC-JP ???竊??幽??瑤??弛?????茹??嫄? 001111110011111100111111111000111110011000111111001111111100110110101001001111110011111111110100101001000011111100111111110000111101000000111111001111110011111100111111001111111110100010100111001111110011111110001111101110101010000100111111 3f3f3fe3e63f3fcda93f3ff4a43f3fc3d03f3f3f3f3fe8a73f3f8fbaa13f
UTF-8 捻뀁뮆竊섉꼷幽껊눀瑤뗭닂弛끿뙴紐꾨탞茹띾맧嫄콮 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001001111010101011110010110111111001011011100110111101111010101011101110001010111010111000100010000000111001111001000110100100111010111001011110101101111010111000101110000010111001011011110010011011111010111000000110111111111010111001100110110100111011111010011110001111111010101011111010101000111011011000001110011110111010001000110010111001111010111001110110111110111010111010011110100111111001011010101110000100111011001011110110101110 efa6a4eb8081ebae86e7ab8aec8489eabcb7e5b9bdeabb8aeb8880e791a4eb97adeb8b82e5bc9beb81bfeb99b4efa78feabea8ed839ee88cb9eb9dbeeba7a7e5ab84ecbdae
UHC 捻뀁뮆竊섉꼷幽껊눀瑤뗭닂弛끿뙴紐꾨탞茹띾맧嫄콮 11100110111101111011001011101100100100101001010111101111101111001001100011100110100001001000111111101010111010111000001111101011100001111010000111101000111111011000101111101100100010001000101111101100101011001000010111100111100011001011011111101011101010101000010011101011101101011000001011100110101010101000110111101011100100001011000011101010101100011011001001000010 e6f7b2ec9295efbc98e6848feaeb83eb87a1e8fd8bec888becac85e78cb7ebaa84ebb582e6aa8deb90b0eab1b242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)