To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚴??誼??猿??魚 1001101010001110001111110011111110001011011000100011111100111111100010011000111000111111001111111000101110011011 9a8e3f3f8b623f3f898e3f3f8b9b
EUC-JP 嚴??誼??猿??魚 1101001111101110001111110011111110110101110000110011111100111111101100011110111000111111001111111011010111111011 d3ee3f3fb5c33f3fb1ee3f3fb5fb
UTF-8 嚴곸늾誼⒵듉猿딆젲魚 111001011001101010110100111010101011001110111000111010111000101010111110111010001010101010111100111000101001001010110101111010111001001110001001111001111000110010111111111010111001010010000110111011001010000010110010111010011010110110011010 e59ab4eab3b8eb8abee8aabce292b5eb9389e78cbfeb9486eca0b2e9ad9a
UHC 嚴곸늾誼⒵듉猿딆젲魚 1110010111110001100000011110110010001000100001111110101111111110101010011110011010001010101111001110101010111011100010101110110010100000101001101110010111100000 e5f181ec8887ebfea9e68abceabb8aeca0a6e5e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)