To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 驩ァ閠碁鴬閠沓 11101001100010011010011111101000100000001000110011101001100010011010011111101000100000001000110001000010 e989a7e8808ce989a7e8808c42
EUC-JP 驩ァ閠碁鴬閠沓 1111000111101001100011101010011111101111111000001011100011101011101100101010100111101111111000001011011110100011 f1e98ea7efe0b8ebb2a9efe0b7a3
UTF-8 驩ァ閠碁鴬閠沓 111010011010100110101001111011111011110110100111111010011001011010100000111001111010001010000001111010011011010010101100111010011001011010100000111001101011001010010011 e9a9a9efbda7e996a0e7a281e9b4ace996a0e6b293
UHC 驩??碁??沓 11111100101111100011111100111111110100011011001100111111001111111101001111001011 fcbe3f3fd1b33f3fd3cb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)