To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?????惟??巍リ???┷鷹??鵝?? 11100001100111110011111100111111001111110011111100111111100010001101001000111111001111111001101111011001100000111000101000111111001111110011111110000100101110001001000111101001001111110011111111101010010000000011111100111111 e19f3f3f3f3f3f88d23f3f9bd9838a3f3f3f84b891e93f3fea403f3f
EUC-JP 癲?????惟??巍リ???┷鷹??鵝?? 11100010101000010011111100111111001111110011111100111111101100001101010000111111001111111101011011011011101001011110101000111111001111110011111110101000101110101100001011101011001111110011111111110011101000010011111100111111 e2a13f3f3f3f3fb0d43f3fd6dba5ea3f3f3fa8bac2eb3f3ff3a13f3f
UTF-8 癲욌맧杻득궇惟듈뀲巍リ쑴藺듸┷鷹녿닲鵝얠칳 111001111001100110110010111011001001101010001100111010111010011110100111111011111010011110001000111010111001001110011101111010101011011010000111111001101000001110011111111010111001001110001000111010111000000010110010111001011011011110001101111000111000001110101010111011001001000110110100111011111010011110110000111010111001001110111000111000101001010010110111111010011011011110111001111010111000010110111111111010111000101110110010111010011011010110011101111011001001011010100000111011001011100110110011 e799b2ec9a8ceba7a7efa788eb939deab687e6839feb9388eb80b2e5b78de383aaec91b4efa7b0eb93b8e294b7e9b7b9eb85bfeb8bb2e9b59dec96a0ecb9b3
UHC 癲욌맧杻득궇惟듈뀲巍リ쑴藺듸┷鷹녿닲鵝얠칳 111011111010011010011110111010111001000010110000111010101111010010110101111001101000001010100000111010101110111010110101111000101000010110101000111010001110010010101011111010101011111010101001111011001110000110110101111011111010011010111010111010111110110110000110111010111000100010101000111001001011110110111110111011001010111110000110 efa69eeb90b0eaf4b5e682a0eaeeb5e285a8e8e4abeabea9ece1b5efa6baebed86eb88a8e4bdbeecaf86

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)