To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 晤??潁??占θ?巍??晤??潁??占θ?巍??B 1001110111101011001111110011111110011111111100010011111100111111100100001110100010000011110001100011111110011011110110010011111100111111100111011110101100111111001111111001111111110001001111110011111110010000111010001000001111000110001111111001101111011001001111110011111101000010 9deb3f3f9ff13f3f90e883c63f9bd93f3f9deb3f3f9ff13f3f90e883c63f9bd93f3f42
EUC-JP 晤??潁??占θ?巍??晤??潁??占θ?巍??B 1101101011101101001111110011111111011110111100110011111100111111110000001110101010100110110010000011111111010110110110110011111100111111110110101110110100111111001111111101111011110011001111110011111111000000111010101010011011001000001111111101011011011011001111110011111101000010 daed3f3fdef33f3fc0eaa6c83fd6db3f3fdaed3f3fdef33f3fc0eaa6c83fd6db3f3f42
UTF-8 晤댐쉈潁꿰빓占θ렆巍먥뵶晤댐쉈潁꿰빓占θ렆巍먥뵶B 1110011010011001101001001110101110001100100100001110110010001001100010001110011010111101100000011110101010111111101100001110101110111001100100111110010110001101101000001100111010111000111010111010000010000110111001011011011110001101111010111010100010100101111010111011010110110110111001101001100110100100111010111000110010010000111011001000100110001000111001101011110110000001111010101011111110110000111010111011100110010011111001011000110110100000110011101011100011101011101000001000011011100101101101111000110111101011101010001010010111101011101101011011011001000010 e699a4eb8c90ec8988e6bd81eabfb0ebb993e58da0ceb8eba086e5b78deba8a5ebb5b6e699a4eb8c90ec8988e6bd81eabfb0ebb993e58da0ceb8eba086e5b78deba8a5ebb5b642
UHC 晤댐쉈潁꿰빓占θ렆巍먥뵶晤댐쉈潁꿰빓占θ렆巍먥뵶B 11100111111110111011010011101111101111011010010111100111101110001011001011100111100101011011011111101111101111111010010111101000100011101010000011101000111001001001000011100010100101001011010011100111111110111011010011101111101111011010010111100111101110001011001011100111100101011011011111101111101111111010010111101000100011101010000011101000111001001001000011100010100101001011010001000010 e7fbb4efbda5e7b8b2e795b7efbfa5e88ea0e8e490e294b4e7fbb4efbda5e7b8b2e795b7efbfa5e88ea0e8e490e294b442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)