To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 螢咎沮螢咎沮B 11100101101000111001100111101001100111111001110011100101101000111001100111101001100111111001110001000010 e5a399e99f9ce5a399e99f9c42
EUC-JP 螢咎沮螢咎沮B 11101010101001011101001011101011110111011111110011101010101001011101001011101011110111011111110001000010 eaa5d2ebddfceaa5d2ebddfc42
UTF-8 螢咎沮螢咎沮B 11101000100111101010001011100101100100101000111011100110101100101010111011101000100111101010001011100101100100101000111011100110101100101010111001000010 e89ea2e5928ee6b2aee89ea2e5928ee6b2ae42
UHC 螢咎沮螢咎沮B 11111011101010111100111110100100111011101100000111111011101010111100111110100100111011101100000101000010 fbabcfa4eec1fbabcfa4eec142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)