To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????W?? 00111111001111110011111100111111001111110011111100111111010101110011111100111111 3f3f3f3f3f3f3f573f3f
SJIS-WIN ??匡?似??W?? 001111110011111110001011101001110011111110001110100101110011111100111111010101110011111100111111 3f3f8ba73f8e973f3f573f3f
EUC-JP ??匡?似??W?? 001111110011111110110110101010010011111110111011111101110011111100111111010101110011111100111111 3f3fb6a93fbbf73f3f573f3f
UTF-8 렺읖匡팍似맣렯W렺팍 11101011101000001011101011101100100111011001011011100101100011001010000111101101100011001000110111100100101111001011110011101011101001111010001111101011101000001010111101010111111010111010000010111010111011011000110010001101 eba0baec9d96e58ca1ed8c8de4bcbceba7a3eba0af57eba0baed8c8d
UHC 렺읖匡팍似맣렯W렺팍 10001110110000101100000011000101110011101100010011000110110001011101111011000100101110001100010010001110101111000101011110001110110000101100011011000101 8ec2c0c5cec4c6c5dec4b8c48ebc578ec2c6c5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)