To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????語 0011111100111111001111110011111100111111001111110011111100111111001111111000110011101010 3f3f3f3f3f3f3f3f3f8cea
EUC-JP ??????洹??語 00111111001111110011111100111111001111110011111110001111110001111011101000111111001111111011100011101100 3f3f3f3f3f3f8fc7ba3f3fb8ec
UTF-8 娛뤿챶栒녷슅洹욎꽦語 111001011010100010011011111010111010010010111111111011001011000110110110111001101010000010010010111010111000010110110111111011001000101010000101111001101011010010111001111011001001101010001110111010101011110110100110111010001010101010011110 e5a89beba4bfecb1b6e6a092eb85b7ec8a85e6b4b9ec9a8eeabda6e8aa9e
UHC 娛뤿챶栒녷슅洹욎꽦語 1110011111110100100011111110101110101010100000111110001011100011100001101110011010011010100101111110101010110111100111101110110010000100101100011110010111011110 e7f48febaa83e2e386e69a97eab79eec84b1e5de

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)