To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??砥?雋麥?旨?? 0011111100111111100100110111010100111111111010001011001011101010011011010011111110001110011111000011111100111111 3f3f93753fe8b2ea6d3f8e7c3f3f
EUC-JP 珽?砥?雋麥?旨?? 10001111110010111111111000111111110001011101011000111111111100001011010011110011110011100011111110111011110111010011111100111111 8fcbfe3fc5d63ff0b4f3ce3fbbdd3f3f
UTF-8 珽렖砥렫雋麥윙旨곌췻 111001111000111110111101111010111010000010010110111001111010000010100101111010111010000010101011111010011001101110001011111010011011101010100101111011001001110010011001111001101001011110101000111010101011001110001100111011001011011110111011 e78fbdeba096e7a0a5eba0abe99b8be9baa5ec9c99e697a8eab38cecb7bb
UHC 珽렖砥렫雋麥윙旨곌췻 1110111111101010100011101010101111110010101100101000111010111001111100011110011011011000111010101100000010101110111100101010100110110000111010101100001111110000 efea8eabf2b28eb9f1e6d8eac0aef2a9b0eac3f0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)