To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???[???[^ 001111110011111100111111010110110011111100111111001111110101101101011110 3f3f3f5b3f3f3f5b5e
SJIS-WIN 稷崋妹[稷崋妹[^ 111000100110110010011011101110101001011010000101010110111110001001101100100110111011101010010110100001010101101101011110 e26c9bba96855be26c9bba96855b5e
EUC-JP 稷崋妹[稷崋妹[^ 111000111100110111010110101111001100101111100101010110111110001111001101110101101011110011001011111001010101101101011110 e3cdd6bccbe55be3cdd6bccbe55b5e
UTF-8 稷崋妹[稷崋妹[^ 111001111010100010110111111001011011010010001011111001011010011010111001010110111110011110101000101101111110010110110100100010111110010110100110101110010101101101011110 e7a8b7e5b48be5a6b95be7a8b7e5b48be5a6b95b5e
UHC 稷?妹[稷?妹[^ 11110010110000110011111111011000110110010101101111110010110000110011111111011000110110010101101101011110 f2c33fd8d95bf2c33fd8d95b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)