To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN タス痔フス痔[タス痔フス痔[^ 1111000010111010110000001111000110001110101111011000111010100100111100001110000011001100111100011000111010111101100011101010010001011011111100001011101011000000111100011000111010111101100011101010010011110000111000001100110011110001100011101011110110001110101001000101101101011110 f0bac0f18ebd8ea4f0e0ccf18ebd8ea45bf0bac0f18ebd8ea4f0e0ccf18ebd8ea45b5e
EUC-JP ?タ?ス痔?フ?ス痔[?タ?ス痔?フ?ス痔[^ 0011111110001110110000000011111110001110101111011011110010100110001111111000111011001100001111111000111010111101101111001010011001011011001111111000111011000000001111111000111010111101101111001010011000111111100011101100110000111111100011101011110110111100101001100101101101011110 3f8ec03f8ebdbca63f8ecc3f8ebdbca65b3f8ec03f8ebdbca63f8ecc3f8ebdbca65b5e
UTF-8 タス痔フス痔[タス痔フス痔[^ 111011101000000110111001111011111011111010000000111011101000010010001001111011111011110110111101111001111001011110010100111011101000001010011111111011111011111010001100111011101000010010001001111011111011110110111101111001111001011110010100010110111110111010000001101110011110111110111110100000001110111010000100100010011110111110111101101111011110011110010111100101001110111010000010100111111110111110111110100011001110111010000100100010011110111110111101101111011110011110010111100101000101101101011110 ee81b9efbe80ee8489efbdbde79794ee829fefbe8cee8489efbdbde797945bee81b9efbe80ee8489efbdbde79794ee829fefbe8cee8489efbdbde797945b5e
UHC ????痔????痔[????痔????痔[^ 001111110011111100111111001111111111011011000000001111110011111100111111001111111111011011000000010110110011111100111111001111110011111111110110110000000011111100111111001111110011111111110110110000000101101101011110 3f3f3f3ff6c03f3f3f3ff6c05b3f3f3f3ff6c03f3f3f3ff6c05b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)