To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????W}??????W{^ 0011111100111111001111110011111100111111001111110101011101111101001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f577d3f3f3f3f3f3f577b5e
SJIS-WIN 闌イ鬧育ァ杵W}闌イ鬧育ァ杵W{^ 11101000100011001011001011101001101001111000100011100111101001111000101101101110010101110111110111101000100011001011001011101001101001111000100011100111101001111000101101101110010101110111101101011110 e88cb2e9a788e7a78b6e577de88cb2e9a788e7a78b6e577b5e
EUC-JP 闌イ鬧育ァ杵W}闌イ鬧育ァ杵W{^ 1110111111101100100011101011001011110010101010011011000011101001100011101010011110110101110011110101011101111101111011111110110010001110101100101111001010101001101100001110100110001110101001111011010111001111010101110111101101011110 efec8eb2f2a9b0e98ea7b5cf577defec8eb2f2a9b0e98ea7b5cf577b5e
UTF-8 闌イ鬧育ァ杵W}闌イ鬧育ァ杵W{^ 1110100110010111100011001110111110111101101100101110100110101100101001111110100010000010101100101110111110111101101001111110011010011101101101010101011101111101111010011001011110001100111011111011110110110010111010011010110010100111111010001000001010110010111011111011110110100111111001101001110110110101010101110111101101011110 e9978cefbdb2e9aca7e882b2efbda7e69db5577de9978cefbdb2e9aca7e882b2efbda7e69db5577b5e
UHC ??鬧育?杵W}??鬧育?杵W{^ 0011111100111111110101111010001011101011110000000011111111101110101111100101011101111101001111110011111111010111101000101110101111000000001111111110111010111110010101110111101101011110 3f3fd7a2ebc03feebe577d3f3fd7a2ebc03feebe577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)