To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蠖零、ク鬢俯ャ 1110010110111101100101111110101110100100101110001110100110100100100110001110101110101100 e5bd97eba4b8e9a498ebac
EUC-JP 蠖零、ク鬢俯ャ 1110101010111111110011101110110110001110101001001000111010111000111100101010011011010000111011011000111010101100 eabfceed8ea48eb8f2a6d0ed8eac
UTF-8 蠖零、ク鬢俯ャ 111010001010000010010110111010011001101110110110111011111011110110100100111011111011110110111000111010011010110010100010111001001011111110101111111011111011110110101100 e8a096e99bb6efbda4efbdb8e9aca2e4bfafefbdac
UHC ?零???俯? 001111111101011011000011001111110011111100111111110111001111011000111111 3fd6c33f3f3fdcf63f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)