To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 猥?9寡??? 11100000110011100011111110000010010110001000100111000111001111110011111100111111 e0ce3f825889c73f3f3f
EUC-JP 猥?9寡??? 11100000110100000011111110100011101110011011001011001001001111110011111100111111 e0d03fa3b9b2c93f3f3f
UTF-8 猥욕9寡了욄변 111001111000110010100101111011001001101010010101111011111011110010011001111001011010111110100001111011111010011010111010111011001001101010000100111010111011001110000000 e78ca5ec9a95efbc99e5afa1efa6baec9a84ebb380
UHC 猥욕9寡了욄변 1110100011100101101111111110010110100011101110011100110111111011111010001110011110011110111001101011101010101111 e8e5bfe5a3b9cdfbe8e79ee6baaf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)