To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??C}??C{^ 001111110011111101000011011111010011111100111111010000110111101101011110 3f3f437d3f3f437b5e
SJIS-WIN ?亨C}?亨C{^ 0011111110001011100111000100001101111101001111111000101110011100010000110111101101011110 3f8b9c437d3f8b9c437b5e
EUC-JP ?亨C}?亨C{^ 0011111110110101111111000100001101111101001111111011010111111100010000110111101101011110 3fb5fc437d3fb5fc437b5e
UTF-8 뤚亨C}뤚亨C{^ 1110101110100100100110101110010010111010101010000100001101111101111010111010010010011010111001001011101010101000010000110111101101011110 eba49ae4baa8437deba49ae4baa8437b5e
UHC 뤚亨C}뤚亨C{^ 10001111110010011111101011111011010000110111110110001111110010011111101011111011010000110111101101011110 8fc9fafb437d8fc9fafb437b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)