To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 輿??援??淫?? 100101110110000000111111001111111000100110000111001111110011111110001000111110100011111100111111 97603f3f89873f3f88fa3f3f
EUC-JP 輿??援??淫?? 110011011100000100111111001111111011000111100111001111110011111110110000111111000011111100111111 cdc13f3fb1e73f3fb0fc3f3f
UTF-8 輿삳뿣援잍쾮淫볛봻 111010001011110010111111111011001000001010110011111010111011111110100011111001101000111110110100111011001001111010001101111011001011111010101110111001101011011110101011111010111011001110011011111010111011010010111011 e8bcbfec82b3ebbfa3e68fb4ec9e8decbeaee6b7abebb39bebb4bb
UHC 輿삳뿣援잍쾮淫볛봻 111001101010101110111011111010111001011110100011111010101011010110011111111001101011001010000101111010111110001010010011111000101001010010000010 e6abbbeb97a3eab59fe6b285ebe293e29482

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)