To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 噎荀囑噎荀囑^ 10011010011110001110010010100100100110101001011010011010011110001110010010100100100110101001011001011110 9a78e4a49a969a78e4a49a965e
EUC-JP 噎荀囑噎荀囑^ 11010011110110011110100010100110110100111111011011010011110110011110100010100110110100111111011001011110 d3d9e8a6d3f6d3d9e8a6d3f65e
UTF-8 噎荀囑噎荀囑^ 11100101100110011000111011101000100011011000000011100101100110111001000111100101100110011000111011101000100011011000000011100101100110111001000101011110 e5998ee88d80e59b91e5998ee88d80e59b915e
UHC ?荀囑?荀囑^ 0011111111100010111100001111010110110110001111111110001011110000111101011011011001011110 3fe2f0f5b63fe2f0f5b65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)