To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 鄒御シ趣シ剰浚 111001111011111010001100111001001011110010001110111011111011110010001111111010001001111110110010 e7be8ce4bc8eefbc8fe89fb2
EUC-JP 鄒御シ趣シ剰浚 1110111011000000101110001110011010001110101111001011110011110001100011101011110010111110111010101101111010110100 eec0b8e68ebcbcf18ebcbeeadeb4
UTF-8 鄒御シ趣シ剰浚 111010011000010010010010111001011011111010100001111011111011110110111100111010001011011010100011111011111011110110111100111001011000100110110000111001101011010110011010 e98492e5bea1efbdbce8b6a3efbdbce589b0e6b59a
UHC 鄒御?趣??浚 1111010111011011111001011101100100111111111101101010110000111111001111111111000111011101 f5dbe5d93ff6ac3f3ff1dd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)