To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 艾??竊??恂κ????毓??揄??午???? 111001001000100000111111001111111110001010000110001111110011111110011100100101101000001111001000001111110011111100111111001111111001111101111001001111110011111110011101100010010011111100111111100011001101111100111111001111110011111100111111 e4883f3fe2863f3f9c9683c83f3f3f3f9f793f3f9d893f3f8cdf3f3f3f3f
EUC-JP 艾??竊??恂κ????毓??揄??午??堉? 1110011111101000001111110011111111100011111001100011111100111111110101111111011010100110110010100011111100111111001111110011111111011101110110100011111100111111110110011110100100111111001111111011100011100001001111110011111110001111101101111111110100111111 e7e83f3fe3e63f3fd7f6a6ca3f3f3f3fddda3f3fd9e93f3fb8e13f3f8fb7fd3f
UTF-8 艾쎈끏竊뺝킊恂κ돌曆뱀뭼毓양뙴揄쒕츪午댁꼮堉볿 1110100010001001101111101110110010001110100010001110101110000001100011111110011110101011100010101110101110111010100111011110110110000010100010101110011010000001100000101100111010111010111010111000111110001100111011111010011010001011111010111011000110000000111010111010110110111100111001101010111110010011111011001001011010010001111010111001100110110100111001101000111110000100111011001001001010010101111011001011100010101010111001011000110110001000111010111000110010000001111010101011110010101110111001011010000010001001111010111011001110111111 e889beec8e88eb818fe7ab8aebba9ded828ae68182cebaeb8f8cefa68bebb180ebadbce6af93ec9691eb99b4e68f84ec9295ecb8aae58d88eb8c81eabcaee5a089ebb3bf
UHC 艾쎈끏竊뺝킊恂κ돌曆뱀뭼毓양뙴揄쒕츪午댁꼮堉볿 11100100111101011011110111101011100001011011111111101111101111001001010111100101101101001001011011100010111000011010010111101010101101011011100111100110101101111011100111101100100100101000101111101011101111101011111011100111100011001011011111101010111100011001110011101011101011101001111111100111111011011011010011101100100001001000100111101011101111001001010001000010 e4f5bdeb85bfefbc95e5b496e2e1a5eab5b9e6b7b9ec928bebbebee78cb7eaf19cebae9fe7edb4ec8489ebbc9442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)