To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 潁??泣?┸惟??潁??泣??衰⑥?永??揖? 100111111111000100111111001111111000101110000011001111111000010010111101100010001101001000111111001111111001111111110001001111110011111110001011100000110011111100111111100100001000101010000111010001010011111110001001011010010011111100111111100101110100101100111111 9ff13f3f8b833f84bd88d23f3f9ff13f3f8b833f3f908a87453f89693f3f974b3f
EUC-JP 潁??泣?┸惟??潁??泣??衰??永??揖? 1101111011110011001111110011111110110101111000110011111110101000101111111011000011010100001111110011111111011110111100110011111100111111101101011110001100111111001111111011111111101010001111110011111110110001110010100011111100111111110011011010110000111111 def33f3fb5e33fa8bfb0d43f3fdef33f3fb5e33f3fbfea3f3fb1ca3f3fcdac3f
UTF-8 潁뺣맮泣댐┸惟듭돖潁뺣맮泣닸벦衰⑥넶永띠옚揖쁥 111001101011110110000001111010111011101010100011111010111010011110101110111001101011001110100011111010111000110010010000111000101001010010111000111001101000001110011111111010111001001110101101111010111000111110010110111001101011110110000001111010111011101010100011111010111010011110101110111001101011001110100011111010111000101110111000111010111011001010100110111010001010000110110000111000101001000110100101111010111000010010110110111001101011000010111000111010111001110110100000111011001001100010011010111001101000111110010110111011001000000110100101 e6bd81ebbaa3eba7aee6b3a3eb8c90e294b8e6839feb93adeb8f96e6bd81ebbaa3eba7aee6b3a3eb8bb8ebb2a6e8a1b0e291a5eb84b6e6b0b8eb9da0ec989ae68f96ec81a5
UHC 潁뺣맮泣댐┸惟듭돖潁뺣맮泣닸벦衰⑥넶永띠옚揖쁥 11100111101110001001010111101011100100001011010111101011111010001011010011101111101001101011111111101010111011101011010111101100100010011010000011100111101110001001010111101011100100001011010111101011111010001011010011100110100100111011111011100001111100011010100011101100100001101011001111100111101101011011011011101100100111101001111011101011111001111001100001101000 e7b895eb90b5ebe8b4efa6bfeaeeb5ec89a0e7b895eb90b5ebe8b4e693bee1f1a8ec86b3e7b5b6ec9e9eebe79868

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)