To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 繕煽??魏據?莎幼[繕煽??魏據?莎幼[^ 100100010101010110010000111110000011111100111111111010011011000010011101100111110011111111100100101100111001011101100011010110111001000101010101100100001111100000111111001111111110100110110000100111011001111100111111111001001011001110010111011000110101101101011110 915590f83f3fe9b09d9f3fe4b397635b915590f83f3fe9b09d9f3fe4b397635b5e
EUC-JP 繕煽??魏據?莎幼[繕煽??魏據?莎幼[^ 110000011011011011000000111110100011111100111111111100101011001011011010101000010011111111101000101101011100110111000100010110111100000110110110110000001111101000111111001111111111001010110010110110101010000100111111111010001011010111001101110001000101101101011110 c1b6c0fa3f3ff2b2daa13fe8b5cdc45bc1b6c0fa3f3ff2b2daa13fe8b5cdc45b5e
UTF-8 繕煽롛뤾魏據횓莎幼[繕煽롛뤾魏據횓莎幼[^ 111001111011100110010101111001111000010110111101111010111010000110011011111010111010010010111110111010011010110110001111111001101001001110011010111011011001101010010011111010001000111010001110111001011011100110111100010110111110011110111001100101011110011110000101101111011110101110100001100110111110101110100100101111101110100110101101100011111110011010010011100110101110110110011010100100111110100010001110100011101110010110111001101111000101101101011110 e7b995e785bdeba19beba4bee9ad8fe6939aed9a93e88e8ee5b9bc5be7b995e785bdeba19beba4bee9ad8fe6939aed9a93e88e8ee5b9bc5b5e
UHC 繕煽롛뤾魏據횓莎幼[繕煽롛뤾魏據횓莎幼[^ 111000001100101111100000110000111000111011011111100011111110101011101010111000001100101111100000110000111000111011011110111011011110101011101010010110111110000011001011111000001100001110001110110111111000111111101010111010101110000011001011111000001100001110001110110111101110110111101010111010100101101101011110 e0cbe0c38edf8feaeae0cbe0c38edeedeaea5be0cbe0c38edf8feaeae0cbe0c38edeedeaea5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)