To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 悌?財???臧六 100100101110111000111111100011011110000000111111001111110011111111100100011010001001100001011010 92ee3f8de03f3f3fe468985a
EUC-JP 悌?財???臧六 110001001111000000111111101110101110001000111111001111110011111111100111110010011100111110111011 c4f03fbae23f3f3fe7c9cfbb
UTF-8 悌렩財쯔렓렩臧六 111001101000001010001100111010111010000010101001111010001011001010100001111011001010111110010100111010111010000010010011111010111010000010101001111010001000011110100111111001011000010110101101 e6828ceba0a9e8b2a1ecaf94eba093eba0a9e887a7e585ad
UHC 悌렩財쯔렓렩臧六 11110000101010101000111010110111111011101010111111000010111010101000111010101000100011101011011111101101111101011101011110111111 f0aa8eb7eeafc2ea8ea88eb7edf5d7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)