To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 膣遺墾膣遺墾^ 11100100010100111000100011100010100011011010010011100100010100111000100011100010100011011010010001011110 e45388e28da4e45388e28da45e
EUC-JP 膣遺墾膣遺墾^ 11100111101101001011000011100100101110101010011011100111101101001011000011100100101110101010011001011110 e7b4b0e4baa6e7b4b0e4baa65e
UTF-8 膣遺墾膣遺墾^ 11101000100001101010001111101001100000011011101011100101101000101011111011101000100001101010001111101001100000011011101011100101101000101011111001011110 e886a3e981bae5a2bee886a3e981bae5a2be5e
UHC 膣遺墾膣遺墾^ 11110010111100111110101110110110110010101100101111110010111100111110101110110110110010101100101101011110 f2f3ebb6cacbf2f3ebb6cacb5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)