To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????[????[^ 0011111100111111001111110011111101011011001111110011111100111111001111110101101101011110 3f3f3f3f5b3f3f3f3f5b5e
SJIS-WIN ?紙?直[?紙?直[^ 001111111000111010000110001111111001001010111100010110110011111110001110100001100011111110010010101111000101101101011110 3f8e863f92bc5b3f8e863f92bc5b5e
EUC-JP ?紙?直[?紙?直[^ 001111111011101111100110001111111100010010111110010110110011111110111011111001100011111111000100101111100101101101011110 3fbbe63fc4be5b3fbbe63fc4be5b5e
UTF-8 솜紙솜直[솜紙솜直[^ 111011001000011010011100111001111011010010011001111011001000011010011100111001111001101110110100010110111110110010000110100111001110011110110100100110011110110010000110100111001110011110011011101101000101101101011110 ec869ce7b499ec869ce79bb45bec869ce7b499ec869ce79bb45b5e
UHC 솜紙솜直[솜紙솜直[^ 10111100110110001111001010110101101111001101100011110010110000010101101110111100110110001111001010110101101111001101100011110010110000010101101101011110 bcd8f2b5bcd8f2c15bbcd8f2b5bcd8f2c15b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)