To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 頸痔鴟スシ頸痔鴟スシ^ 1110100011110010100011101010010011101001111101001011110110111100111010001111001010001110101001001110100111110100101111011011110001011110 e8f28ea4e9f4bdbce8f28ea4e9f4bdbc5e
EUC-JP 頸痔鴟スシ頸痔鴟スシ^ 111100001111010010111100101001101111001011110110100011101011110110001110101111001111000011110100101111001010011011110010111101101000111010111101100011101011110001011110 f0f4bca6f2f68ebd8ebcf0f4bca6f2f68ebd8ebc5e
UTF-8 頸痔鴟スシ頸痔鴟スシ^ 11101001101000001011100011100111100101111001010011101001101101001001111111101111101111011011110111101111101111011011110011101001101000001011100011100111100101111001010011101001101101001001111111101111101111011011110111101111101111011011110001011110 e9a0b8e79794e9b49fefbdbdefbdbce9a0b8e79794e9b49fefbdbdefbdbc5e
UHC 頸痔???頸痔???^ 110011001111001011110110110000000011111100111111001111111100110011110010111101101100000000111111001111110011111101011110 ccf2f6c03f3f3fccf2f6c03f3f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)