To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8諭?????熬??日??揄ы?疫 111000011001111100111111100000100101011110010111010000000011111100111111001111110011111100111111111000001001001000111111001111111001001111111010001111110011111110011101100010011000010010001101001111111000100101110101 e19f3f825797403f3f3f3f3fe0923f3f93fa3f3f9d89848d3f8975
EUC-JP 癲?8諭??洹??熬??日??揄ы?疫 1110001010100001001111111010001110111000110011011010000100111111001111111000111111000111101110100011111100111111110111111111001000111111001111111100011011111100001111110011111111011001111010011010011111101101001111111011000111010110 e2a13fa3b8cda13f3f8fc7ba3f3fdff23f3fc6fc3f3fd9e9a7ed3fb1d6
UTF-8 癲쒕8諭뜻뇻洹쏆뫓熬곣뫅日뉒춯揄ы떝疫 1110011110011001101100101110110010010010100101011110111110111100100110001110100010101011101011011110101110011100101110111110101110000111101110111110011010110100101110011110110010001111100001101110101110101011100100111110011110000110101011001110101010110011101000111110101110101011100001011110011010010111101001011110101110001001100100101110110010110110101011111110011010001111100001001101000110001011111010111001011010011101111001111001011010101011 e799b2ec9295efbc98e8abadeb9cbbeb87bbe6b4b9ec8f86ebab93e786aceab3a3ebab85e697a5eb8992ecb6afe68f84d18beb969de796ab
UHC 癲쒕8諭뜻뇻洹쏆뫓熬곣뫅日뉒춯揄ы떝疫 1110111110100110100111001110101110100011101110001110101110110001101101101110011010110100101001111110101010110111100110111110110010010001101101011110100010100010100000011110001010010001101010001110110011101101100001111110011110101101100011001110101011110001101011001110110110001011101100111110011010111001 efa69ceba3b8ebb1b6e6b4a7eab79bec91b5e8a281e291a8eced87e7ad8ceaf1aced8bb3e6b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)