To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淨論┗窪?淨論┗姐?淨論┗窪?淨論┗姐?^ 10011111110001001001100001011111100001001010111110001100010001010011111110011111110001001001100001011111100001001010111110001000101101110011111110011111110001001001100001011111100001001010111110001100010001010011111110011111110001001001100001011111100001001010111110001000101101110011111101011110 9fc4985f84af8c453f9fc4985f84af88b73f9fc4985f84af8c453f9fc4985f84af88b73f5e
EUC-JP 淨論┗窪?淨論┗姐?淨論┗窪?淨論┗姐?^ 11011110110001101100111111000000101010001011000110110111101001100011111111011110110001101100111111000000101010001011000110110000101110010011111111011110110001101100111111000000101010001011000110110111101001100011111111011110110001101100111111000000101010001011000110110000101110010011111101011110 dec6cfc0a8b1b7a63fdec6cfc0a8b1b0b93fdec6cfc0a8b1b7a63fdec6cfc0a8b1b0b93f5e
UTF-8 淨論┗窪렜淨論┗姐받淨論┗窪렜淨論┗姐밗^ 11100110101101111010100011101000101010111001011011100010100101001001011111100111101010101010101011101011101000001001110011100110101101111010100011101000101010111001011011100010100101001001011111100101101001111001000011101011101100001001101111100110101101111010100011101000101010111001011011100010100101001001011111100111101010101010101011101011101000001001110011100110101101111010100011101000101010111001011011100010100101001001011111100101101001111001000011101011101100001001011101011110 e6b7a8e8ab96e29497e7aaaaeba09ce6b7a8e8ab96e29497e5a790ebb09be6b7a8e8ab96e29497e7aaaaeba09ce6b7a8e8ab96e29497e5a790ebb0975e
UHC 淨論┗窪렜淨論┗姐받淨論┗窪렜淨論┗姐밗^ 1110111111100100110101101110010110100110101100011110100011000001100011101010111011101111111001001101011011100101101001101011000111101110101110111011100111011110111011111110010011010110111001011010011010110001111010001100000110001110101011101110111111100100110101101110010110100110101100011110111010111011101110011101110001011110 efe4d6e5a6b1e8c18eaeefe4d6e5a6b1eebbb9deefe4d6e5a6b1e8c18eaeefe4d6e5a6b1eebbb9dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)