To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????k??????? 0011111100111111001111110011111100111111001111110110101100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f6b3f3f3f3f3f3f3f
SJIS-WIN 蕭h音鐔緒秀k蕭h音鐔緒秀薔 111001010100101010000010100010001000100110111001111010000101110010001111100011111000111101000111011010111110010101001010100000101000100010001001101110011110100001011100100011111000111110001111010001111110010101001011 e54a828889b9e85c8f8f8f476be54a828889b9e85c8f8f8f47e54b
EUC-JP 蕭h音鐔緒秀k蕭h音鐔緒秀薔 111010011010101110100011111010001011001010111011111011111011110110111101111011111011110110101000011010111110100110101011101000111110100010110010101110111110111110111101101111011110111110111101101010001110100110101100 e9aba3e8b2bbefbdbdefbda86be9aba3e8b2bbefbdbdefbda8e9ac
UTF-8 蕭h音鐔緒秀k蕭h音鐔緒秀薔 11101000100101011010110111101111101111011000100011101001100111111011001111101001100100001001010011100111101101111001001011100111101001111000000001101011111010001001010110101101111011111011110110001000111010011001111110110011111010011001000010010100111001111011011110010010111001111010011110000000111010001001011010010100 e895adefbd88e99fb3e99094e7b792e7a7806be895adefbd88e99fb3e99094e7b792e7a780e89694
UHC 蕭h音??秀k蕭h音??秀薔 1110000111001011101000111110100011101011111001010011111100111111111000101011001101101011111000011100101110100011111010001110101111100101001111110011111111100010101100111110110111111001 e1cba3e8ebe53f3fe2b36be1cba3e8ebe53f3fe2b3edf9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)