To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蔗弱喉郤丈╋胚 1110010011110010100011101110001110001101010000011110011110111010100011111110010010000100101101001110001111110011 e4f28ee38d41e7ba8fe484b4e3f3
EUC-JP 蔗弱喉郤丈╋胚 1110100011110100101111001110010110111001101000101110111010111100101111101110011010101000101101101110011011110101 e8f4bce5b9a2eebcbee6a8b6e6f5
UTF-8 蔗弱喉郤丈╋胚 111010001001010010010111111001011011110010110001111001011001011010001001111010011000001110100100111001001011100010001000111000101001010110001011111010001000001110011010 e89497e5bcb1e59689e983a4e4b888e2958be8839a
UHC 蔗弱喉?丈╋胚 11101101101111011110010110110000111111011010101000111111111011011101101110100110101101101101101111001111 edbde5b0fdaa3feddba6b6dbcf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)