To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 綜???咀?艤?? 100100011000111000111111001111110011111110011001111100000011111111100100011111100011111100111111 918e3f3f3f99f03fe47e3f3f
EUC-JP 綜???咀?艤?嫄 1100000111101110001111110011111100111111110100101111001000111111111001111101111100111111100011111011101010100001 c1ee3f3f3fd2f23fe7df3f8fbaa1
UTF-8 綜골렰렑咀렡艤렜嫄 111001111011011010011100111010101011001110101000111010111010000010110000111010111010000010010001111001011001001010000000111010111010000010100001111010001000100110100100111010111010000010011100111001011010101110000100 e7b69ceab3a8eba0b0eba091e59280eba0a1e889a4eba09ce5ab84
UHC 綜골렰렑咀렡艤렜嫄 111100001111110010110000111100011000111010111101100011101010011011101110101110101000111010110010111010111111101010001110101011101110101010110001 f0fcb0f18ebd8ea6eeba8eb2ebfa8eaeeab1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)