To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 擾??援?????檍??逾??猿??蘂?? 100011111110111100111111001111111000100110000111001111110011111100111111001111110011111110011110111110000011111100111111111001111010010100111111001111111000100110001110001111110011111111100101010000010011111100111111 8fef3f3f89873f3f3f3f3f9ef83f3fe7a53f3f898e3f3fe5413f3f
EUC-JP 擾??援?????檍??逾??猿??蘂?? 101111101111000100111111001111111011000111100111001111110011111100111111001111110011111111011100111110100011111100111111111011101010011100111111001111111011000111101110001111110011111111101001101000100011111100111111 bef13f3fb1e73f3f3f3f3fdcfa3f3feea73f3fb1ee3f3fe9a23f3f
UTF-8 擾우엱援졽펺栒몃듋檍됰챷逾곦벧猿됯펷蘂뚯쎅 111001101001001110111110111011001001101010110000111011001001011110110001111001101000111110110100111011001010000110111101111011011000111010111010111001101010000010010010111010111010101010000011111010111001001110001011111001101010101010001101111010111001000010110000111011001011000110110111111010011000000010111110111010101011001110100110111010111011001010100111111001111000110010111111111010111001000010101111111011011000111010110111111010001001100010000010111010111001101010101111111011001000111010000101 e693beec9ab0ec97b1e68fb4eca1bded8ebae6a092ebaa83eb938be6aa8deb90b0ecb1b7e980beeab3a6ebb2a7e78cbfeb90afed8eb7e89882eb9aafec8e85
UHC 擾우엱援졽펺栒몃듋檍됰챷逾곦벧猿됯펷蘂뚯쎅 111010001111011010111111111011001001111010000110111010101011010110100000111001001011110010001010111000101110001110111000111010111000101010111110111001011110010110001001111010111010101010000100111010111011010110000001111001001011101010100110111010101011101110001001111010101011110010001000111001111101111010001100111011001001101110101110 e8f6bfec9e86eab5a0e4bc8ae2e3b8eb8abee5e589ebaa84ebb581e4baa6eabb89eabc88e7de8cec9bae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)