To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 翁??韋??矣??齬??愉??觀魏⑨?醫?ザ 1000100110100101001111110011111111101000111010000011111100111111111000011110000100111111001111111110101010010111001111110011111110010110111110010011111100111111111001100101011011101001101100001000011101001000001111111110011111001110001111111000001101010101 89a53f3fe8e83f3fe1e13f3fea973f3f96f93f3fe656e9b087483fe7ce3f8355
EUC-JP 翁??韋??矣??齬??愉??觀魏??醫?ザ 10110010101001110011111100111111111100001110101000111111001111111110001011100011001111110011111111110011111101110011111100111111110011001111101100111111001111111110101110110111111100101011001000111111001111111110111011010000001111111010010110110110 b2a73f3ff0ea3f3fe2e33f3ff3f73f3fccfb3f3febb7f2b23f3feed03fa5b6
UTF-8 翁띾뀍韋뤸룚矣뺤퍥齬잙벊愉녜걲觀魏⑨쭑醫묒ザ 111001111011111110000001111010111001110110111110111010111000000010001101111010011001111110001011111010111010010010111000111010111010001110011010111001111001111110100011111010111011101010100100111011011000110110100101111010011011110110101100111011001001111010011001111010111011001010001010111001101000010010001001111010111000010110011100111010101011000110110010111010001010011110000000111010011010110110001111111000101001000110101000111011001010110110010001111010011000011010101011111010111010110010010010111000111000001010110110 e7bf81eb9dbeeb808de99f8beba4b8eba39ae79fa3ebbaa4ed8da5e9bdacec9e99ebb28ae68489eb859ceab1b2e8a780e9ad8fe291a8ecad91e986abebac92e382b6
UHC 翁띾뀍韋뤸룚矣뺤퍥齬잙벊愉녜걲觀魏⑨쭑醫묒ザ 1110100010111010100011011110101110000101100010001110101011011111100011111110011010001111100101101110101111111000100101011110110010111011100111001110010111100001100111111110101110010011101011011110101011110000101100111110100110000001100110011100111010111010111010101110000010101000111011111010011110001001111011001010001010010001111011001010101110110110 e8ba8deb8588eadf8fe68f96ebf895ecbb9ce5e19feb93adeaf0b3e98199cebaeae0a8efa789eca291ecabb6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)