To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悟?????猿????????怨??悠?┼ 1000110011100101001111110011111100111111001111110011111110001001100011100011111100111111001111110011111100111111001111110011111100111111100010011000010100111111001111111001011101001001001111111000010010101001 8ce53f3f3f3f3f898e3f3f3f3f3f3f3f3f89853f3f97493f84a9
EUC-JP 悟??沅??猿??饔??彛??怨??悠?┼ 1011100011100111001111110011111110001111110001101110100100111111001111111011000111101110001111110011111110001111111010001110111100111111001111111000111110111100111110100011111100111111101100011110010100111111001111111100110110101010001111111010100010101011 b8e73f3f8fc6e93f3fb1ee3f3f8fe8ef3f3f8fbcfa3f3fb1e53f3fcdaa3fa8ab
UTF-8 悟뽯쉼沅졿궇猿뉎걶饔낃퉮彛뗧넭怨딆뇠悠잞┼ 111001101000001010011111111010111011110110101111111011001000100110111100111001101011001010000101111011001010000110111111111010101011011010000111111001111000110010111111111010111000100110001110111010101011000110110110111010011010010110010100111010111000001010000011111011011000100110101110111001011011110110011011111010111001011110100111111010111000010010101101111001101000000010101000111010111001010010000110111010111000011110100000111001101000001010100000111011001001111010011110111000101001010010111100 e6829febbdafec89bce6b285eca1bfeab687e78cbfeb898eeab1b6e9a594eb8283ed89aee5bd9beb97a7eb84ade680a8eb9486eb87a0e682a0ec9e9ee294bc
UHC 悟뽯쉼沅졿궇猿뉎걶饔낃퉮彛뗧넭怨딆뇠悠잞┼ 111001111111011010010110111010111011110110110000111010101011011010100000111001101000001010100000111010101011101110000111111000111000000110011100111010001011110110000101111010101011100110000110111011001010110110001011111001111000011010101100111010101011001110001010111011001000011110001000111010101110110110011111111011111010011010101011 e7f696ebbdb0eab6a0e682a0eabb87e3819ce8bd85eab986ecad8be786aceab38aec8788eaed9fefa6ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)