To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 晤??潁??焉②?巍??晤??潁??焉②?巍??B 1001110111101011001111110011111110011111111100010011111100111111111000001000000110000111010000010011111110011011110110010011111100111111100111011110101100111111001111111001111111110001001111110011111111100000100000011000011101000001001111111001101111011001001111110011111101000010 9deb3f3f9ff13f3fe08187413f9bd93f3f9deb3f3f9ff13f3fe08187413f9bd93f3f42
EUC-JP 晤??潁??焉??巍??晤??潁??焉??巍??B 110110101110110100111111001111111101111011110011001111110011111111011111111000010011111100111111110101101101101100111111001111111101101011101101001111110011111111011110111100110011111100111111110111111110000100111111001111111101011011011011001111110011111101000010 daed3f3fdef33f3fdfe13f3fd6db3f3fdaed3f3fdef33f3fdfe13f3fd6db3f3f42
UTF-8 晤댐쉈潁꿨눎焉②렆巍먨쥥晤댐쉈潁꿨눎焉②렆巍먨쥥B 11100110100110011010010011101011100011001001000011101100100010011000100011100110101111011000000111101010101111111010100011101011100010001000111011100111100001001000100111100010100100011010000111101011101000001000011011100101101101111000110111101011101010001010100011101100101001011010010111100110100110011010010011101011100011001001000011101100100010011000100011100110101111011000000111101010101111111010100011101011100010001000111011100111100001001000100111100010100100011010000111101011101000001000011011100101101101111000110111101011101010001010100011101100101001011010010101000010 e699a4eb8c90ec8988e6bd81eabfa8eb888ee78489e291a1eba086e5b78deba8a8eca5a5e699a4eb8c90ec8988e6bd81eabfa8eb888ee78489e291a1eba086e5b78deba8a8eca5a542
UHC 晤댐쉈潁꿨눎焉②렆巍먨쥥晤댐쉈潁꿨눎焉②렆巍먨쥥B 11100111111110111011010011101111101111011010010111100111101110001011001011100101100001111010101011100101111010101010100011101000100011101010000011101000111001001001000011100101101000101001011111100111111110111011010011101111101111011010010111100111101110001011001011100101100001111010101011100101111010101010100011101000100011101010000011101000111001001001000011100101101000101001011101000010 e7fbb4efbda5e7b8b2e587aae5eaa8e88ea0e8e490e5a297e7fbb4efbda5e7b8b2e587aae5eaa8e88ea0e8e490e5a29742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)