To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 遑晉キ包スイ訷ゐ┨遑晉キ包スイ訷ゐ┨^ 11100111101000011001110111100111101101111001010111101111101111011011001011111011101001001000001011101110100001001011011111100111101000011001110111100111101101111001010111101111101111011011001011111011101001001000001011101110100001001011011101011110 e7a19de7b795efbdb2fba482ee84b7e7a19de7b795efbdb2fba482ee84b75e
EUC-JP 遑晉キ包スイ訷ゐ┨遑晉キ包スイ訷ゐ┨^ 111011101010001111011010111010011000111010110111110010101111000110001110101111011000111010110010100011111101110111010100101001001111000010101000101110011110111010100011110110101110100110001110101101111100101011110001100011101011110110001110101100101000111111011101110101001010010011110000101010001011100101011110 eea3dae98eb7caf18ebd8eb28fddd4a4f0a8b9eea3dae98eb7caf18ebd8eb28fddd4a4f0a8b95e
UTF-8 遑晉キ包スイ訷ゐ┨遑晉キ包スイ訷ゐ┨^ 11101001100000011001000111100110100110011000100111101111101111011011011111100101100011001000010111101111101111011011110111101111101111011011001011101000101010001011011111100011100000101001000011100010100101001010100011101001100000011001000111100110100110011000100111101111101111011011011111100101100011001000010111101111101111011011110111101111101111011011001011101000101010001011011111100011100000101001000011100010100101001010100001011110 e98191e69989efbdb7e58c85efbdbdefbdb2e8a8b7e38290e294a8e98191e69989efbdb7e58c85efbdbdefbdb2e8a8b7e38290e294a85e
UHC 遑晉?包???ゐ┨遑晉?包???ゐ┨^ 1111110011011010111100101100101100111111111110001101000000111111001111110011111110101010111100001010011010111001111111001101101011110010110010110011111111111000110100000011111100111111001111111010101011110000101001101011100101011110 fcdaf2cb3ff8d03f3f3faaf0a6b9fcdaf2cb3ff8d03f3f3faaf0a6b95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)