To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 髮ケ鬘比コ玖撕 111010011001101110111001111010011010000110010100111001001011101010001011111010001001110110011001 e99bb9e9a194e4ba8be89d99
EUC-JP 髮ケ鬘比コ玖撕 1111000111111011100011101011100111110010101000111100100011100110100011101011101010110110111010101101100111111001 f1fb8eb9f2a3c8e68ebab6ead9f9
UTF-8 髮ケ鬘比コ玖撕 111010011010101110101110111011111011110110111001111010011010110010011000111001101010111110010100111011111011110110111010111001111000111010010110111001101001001010010101 e9abaeefbdb9e9ac98e6af94efbdbae78e96e69295
UHC 髮??比?玖? 11011011101001010011111100111111110111011110111100111111110011111011100000111111 dba53f3fddef3fcfb83f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)