To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???C?????????C??????B 001111110011111100111111010000110011111100111111001111110011111100111111001111110011111100111111001111110100001100111111001111110011111100111111001111110011111101000010 3f3f3f433f3f3f3f3f3f3f3f3f433f3f3f3f3f3f42
SJIS-WIN 劾??C劾??劾?????C劾??劾??B 1000101001001110001111110011111101000011100010100100111000111111001111111000101001001110001111110011111100111111001111110011111101000011100010100100111000111111001111111000101001001110001111110011111101000010 8a4e3f3f438a4e3f3f8a4e3f3f3f3f3f438a4e3f3f8a4e3f3f42
EUC-JP 劾??C劾??劾?????C劾??劾??B 1011001110101111001111110011111101000011101100111010111100111111001111111011001110101111001111110011111100111111001111110011111101000011101100111010111100111111001111111011001110101111001111110011111101000010 b3af3f3f43b3af3f3fb3af3f3f3f3f3f43b3af3f3fb3af3f3f42
UTF-8 劾귦삫C劾귦삪劾귥찈淋귳눅C劾귦삪劾귥찈B 111001011000101010111110111010101011011110100110111011001000001010101011010000111110010110001010101111101110101010110111101001101110110010000010101010101110010110001010101111101110101010110111101001011110110010110000100010001110111110100111101101011110101010110111101100111110101110001000100001010100001111100101100010101011111011101010101101111010011011101100100000101010101011100101100010101011111011101010101101111010010111101100101100001000100001000010 e58abeeab7a6ec82ab43e58abeeab7a6ec82aae58abeeab7a5ecb088efa7b5eab7b3eb888543e58abeeab7a6ec82aae58abeeab7a5ecb08842
UHC 劾귦삫C劾귦삪劾귥찈淋귳눅C劾귦삪劾귥찈B 111110101011011010000010111011011001100010101010010000111111101010110110100000101110110110011000101010011111101010110110100000101110110010101001100011001110110011111000100000101111101010110100101010100100001111111010101101101000001011101101100110001010100111111010101101101000001011101100101010011000110001000010 fab682ed98aa43fab682ed98a9fab682eca98cecf882fab4aa43fab682ed98a9fab682eca98c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)