To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?猥??娼?猥??創^ 001111111110000011001110001111110011111110001111101010010011111111100000110011100011111100111111100100010110111001011110 3fe0ce3f3f8fa93fe0ce3f3f916e5e
EUC-JP ?猥??娼?猥??創^ 001111111110000011010000001111110011111110111110101010110011111111100000110100000011111100111111110000011100111101011110 3fe0d03f3fbeab3fe0d03f3fc1cf5e
UTF-8 롚猥롌롈娼롚猥롌롈創^ 11101011101000011001101011100111100011001010010111101011101000011000110011101011101000011000100011100101101010001011110011101011101000011001101011100111100011001010010111101011101000011000110011101011101000011000100011100101100010011011010101011110 eba19ae78ca5eba18ceba188e5a8bceba19ae78ca5eba18ceba188e589b55e
UHC 롚猥롌롈娼롚猥롌롈創^ 100011101101111011101000111001011000111011010010100011101100111011110011110111101000111011011110111010001110010110001110110100101000111011001110111100111101110001011110 8edee8e58ed28ecef3de8edee8e58ed28ecef3dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)