To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 搖?????松??夭???ら┃???泣 10011101100010100011111100111111001111110011111100111111100011111011110000111111001111111001101011101110001111110011111100111111100000101110011110000100101010110011111100111111001111111000101110000011 9d8a3f3f3f3f3f8fbc3f3f9aee3f3f3f82e784ab3f3f3f8b83
EUC-JP 搖??靷??松??夭???ら┃沅??泣 1101100111101010001111110011111110001111111001111011110100111111001111111011111010111110001111110011111111010100111100000011111100111111001111111010010011101001101010001010110110001111110001101110100100111111001111111011010111100011 d9ea3f3f8fe7bd3f3fbebe3f3fd4f03f3f3fa4e9a8ad8fc6e93f3fb5e3
UTF-8 搖깅ㅏ靷숁만松썬렆夭뽯똻璘ら┃沅쎈뀪泣 111001101001000010010110111010101011100110000101111000111000010110001111111010011001110110110111111011001000100010000001111010111010011110001100111001101001110110111110111011001000110110101100111010111010000010000110111001011010010010101101111010111011110110101111111010111001100010111011111011111010011110101111111000111000001010001001111000101001010010000011111001101011001010000101111011001000111010001000111010111000000010101010111001101011001110100011 e69096eab985e3858fe99db7ec8881eba78ce69dbeec8daceba086e5a4adebbdafeb98bbefa7afe38289e29483e6b285ec8e88eb80aae6b3a3
UHC 搖깅ㅏ靷숁만松썬렆夭뽯똻璘ら┃沅쎈뀪泣 1110100011110100101100011110101110100100101111111110110011100110100110011110011010111000101110001110000111100110101111011110001110001110101000001110100011101100100101101110101110001100100000011110110011011110101010101110100110100110101011011110101010110110101111011110101110000101101000001110101111101000 e8f4b1eba4bfece699e6b8b8e1e6bde38ea0e8ec96eb8c81ecdeaae9a6adeab6bdeb85a0ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)