To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN シナ痔柴治屡シナ痔柴治質シナ痔柴治邪 101111001100010110001110101001001000111011000100100011101010000110001110110001101011110011000101100011101010010010001110110001001000111010100001100011101011111110111100110001011000111010100100100011101100010010001110101000011000111011010111 bcc58ea48ec48ea18ec6bcc58ea48ec48ea18ebfbcc58ea48ec48ea18ed7
EUC-JP シナ痔柴治屡シナ痔柴治質シナ痔柴治邪 100011101011110010001110110001011011110010100110101111001100011010111100101000111011110011001000100011101011110010001110110001011011110010100110101111001100011010111100101000111011110011000001100011101011110010001110110001011011110010100110101111001100011010111100101000111011110011011001 8ebc8ec5bca6bcc6bca3bcc88ebc8ec5bca6bcc6bca3bcc18ebc8ec5bca6bcc6bca3bcd9
UTF-8 シナ痔柴治屡シナ痔柴治質シナ痔柴治邪 111011111011110110111100111011111011111010000101111001111001011110010100111001101001111110110100111001101011001010111011111001011011000110100001111011111011110110111100111011111011111010000101111001111001011110010100111001101001111110110100111001101011001010111011111010001011001110101010111011111011110110111100111011111011111010000101111001111001011110010100111001101001111110110100111001101011001010111011111010011000001010101010 efbdbcefbe85e79794e69fb4e6b2bbe5b1a1efbdbcefbe85e79794e69fb4e6b2bbe8b3aaefbdbcefbe85e79794e69fb4e6b2bbe982aa
UHC ??痔柴治???痔柴治質??痔柴治邪 0011111100111111111101101100000011100011110000111111011010111101001111110011111100111111111101101100000011100011110000111111011010111101111100101111010100111111001111111111011011000000111000111100001111110110101111011101111011110111 3f3ff6c0e3c3f6bd3f3f3ff6c0e3c3f6bdf2f53f3ff6c0e3c3f6bddef7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)