To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN ?然盂杭?然盂恒^ 001111111001000101010010111000011011000110001101010110010011111110010001010100101110000110110001100011010101000001011110 3f9152e1b18d593f9152e1b18d505e
EUC-JP ?然盂杭?然盂恒^ 001111111100000110110011111000101011001110111001101110100011111111000001101100111110001010110011101110011011000101011110 3fc1b3e2b3b9ba3fc1b3e2b3b9b15e
UTF-8 뤗然盂杭뤗然盂恒^ 11101011101001001001011111100111100001001011011011100111100110111000001011100110100111011010110111101011101001001001011111100111100001001011011011100111100110111000001011100110100000011001001001011110 eba497e784b6e79b82e69dadeba497e784b6e79b82e681925e
UHC 뤗然盂杭뤗然盂恒^ 1000111111000111111001101101010011101001110111011111100111111000100011111100011111100110110101001110100111011101111110011111011001011110 8fc7e6d4e9ddf9f88fc7e6d4e9ddf9f65e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)