To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???L???L^ 001111110011111100111111010011000011111100111111001111110100110001011110 3f3f3f4c3f3f3f4c5e
SJIS-WIN 谿曙襖L谿曙襖L^ 111001101010111010001111100011001000100110100110010011001110011010101110100011111000110010001001101001100100110001011110 e6ae8f8c89a64ce6ae8f8c89a64c5e
EUC-JP 谿曙襖L谿曙襖L^ 111011001011000010111101111011001011001010101000010011001110110010110000101111011110110010110010101010000100110001011110 ecb0bdecb2a84cecb0bdecb2a84c5e
UTF-8 谿曙襖L谿曙襖L^ 111010001011000010111111111001101001101110011001111010001010010110010110010011001110100010110000101111111110011010011011100110011110100010100101100101100100110001011110 e8b0bfe69b99e8a5964ce8b0bfe69b99e8a5964c5e
UHC 谿曙?L谿曙?L^ 11001101101011001101111111110101001111110100110011001101101011001101111111110101001111110100110001011110 cdacdff53f4ccdacdff53f4c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)