To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN ??砥?雋麥? 00111111001111111001001101110101001111111110100010110010111010100110110100111111 3f3f93753fe8b2ea6d3f
EUC-JP 珽?砥?雋麥? 100011111100101111111110001111111100010111010110001111111111000010110100111100111100111000111111 8fcbfe3fc5d63ff0b4f3ce3f
UTF-8 珽렖砥렫雋麥륏 111001111000111110111101111010111010000010010110111001111010000010100101111010111010000010101011111010011001101110001011111010011011101010100101111010111010010110001111 e78fbdeba096e7a0a5eba0abe99b8be9baa5eba58f
UHC 珽렖砥렫雋麥륏 1110111111101010100011101010101111110010101100101000111010111001111100011110011011011000111010101011011111110111 efea8eabf2b28eb9f1e6d8eab7f7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)