To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 讌疲垰阮帶・疲垰阮孺讌疲垰阮帶・疲垰阮學^ 111001101010010110010100111001101001101010111001111010001001011010011011111001101010010110010100111001101001101010111001111010001001011010011011011111011110011010100101100101001110011010011010101110011110100010010110100110111110011010100101100101001110011010011010101110011110100010010110100110110111101101011110 e6a594e69ab9e8969be6a594e69ab9e8969b7de6a594e69ab9e8969be6a594e69ab9e8969b7b5e
EUC-JP 讌疲垰阮帶・疲垰阮孺讌疲垰阮帶・疲垰阮學^ 1110110010100111110010001110100011010100101110111110111111110110110101101110100010001110101001011100100011101000110101001011101111101111111101101101010111011110111011001010011111001000111010001101010010111011111011111111011011010110111010001000111010100101110010001110100011010100101110111110111111110110110101011101110001011110 eca7c8e8d4bbeff6d6e88ea5c8e8d4bbeff6d5deeca7c8e8d4bbeff6d6e88ea5c8e8d4bbeff6d5dc5e
UTF-8 讌疲垰阮帶・疲垰阮孺讌疲垰阮帶・疲垰阮學^ 11101000101011101000110011100111100101101011001011100101100111101011000011101001100110001010111011100101101110001011011011101111101111011010010111100111100101101011001011100101100111101011000011101001100110001010111011100101101011011011101011101000101011101000110011100111100101101011001011100101100111101011000011101001100110001010111011100101101110001011011011101111101111011010010111100111100101101011001011100101100111101011000011101001100110001010111011100101101011011011100001011110 e8ae8ce796b2e59eb0e998aee5b8b6efbda5e796b2e59eb0e998aee5adbae8ae8ce796b2e59eb0e998aee5b8b6efbda5e796b2e59eb0e998aee5adb85e
UHC ?疲?阮帶?疲?阮孺?疲?阮帶?疲?阮學^ 001111111111100110101010001111111110100011010110110100111110000100111111111110011010101000111111111010001101011011101010111010000011111111111001101010100011111111101000110101101101001111100001001111111111100110101010001111111110100011010110111110011100101001011110 3ff9aa3fe8d6d3e13ff9aa3fe8d6eae83ff9aa3fe8d6d3e13ff9aa3fe8d6f9ca5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)