To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 螂ェ閾楢鳩隹キ 111001011010010110101010111010001000011110010011111010001001010010110101111010001011000010110111 e5a5aae88793e894b5e8b0b7
EUC-JP 螂ェ閾楢鳩隹キ 1110101010100111100011101010101011101111111001111100011011101010110010001011011111110000101100101000111010110111 eaa78eaaefe7c6eac8b7f0b28eb7
UTF-8 螂ェ閾楢鳩隹キ 111010001001111010000010111011111011110110101010111010011001011010111110111001101010010110100010111010011011001110101001111010011001101010111001111011111011110110110111 e89e82efbdaae996bee6a5a2e9b3a9e99ab9efbdb7
UHC 螂??楢鳩?? 11010101110011000011111100111111111010101111100111001111110011010011111100111111 d5cc3f3feaf9cfcd3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)