To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????WD????????WD^ 001111110011111100111111001111110011111100111111001111110011111101010111010001000011111100111111001111110011111100111111001111110011111100111111010101110100010001011110 3f3f3f3f3f3f3f3f57443f3f3f3f3f3f3f3f57445e
SJIS-WIN 隘搾スヲ鬲假スォWD隘搾スヲ鬲假スォWD^ 1110100010100101100011011110111110111101101001101110100110101101100110001110111110111101101010110101011101000100111010001010010110001101111011111011110110100110111010011010110110011000111011111011110110101011010101110100010001011110 e8a58defbda6e9ad98efbdab5744e8a58defbda6e9ad98efbdab57445e
EUC-JP 隘搾スヲ鬲假スォWD隘搾スヲ鬲假スォWD^ 11110000101001111011101011110001100011101011110110001110101001101111001010101111110100001111000110001110101111011000111010101011010101110100010011110000101001111011101011110001100011101011110110001110101001101111001010101111110100001111000110001110101111011000111010101011010101110100010001011110 f0a7baf18ebd8ea6f2afd0f18ebd8eab5744f0a7baf18ebd8ea6f2afd0f18ebd8eab57445e
UTF-8 隘搾スヲ鬲假スォWD隘搾スヲ鬲假スォWD^ 1110100110011010100110001110011010010000101111101110111110111101101111011110111110111101101001101110100110101100101100101110010110000001100001111110111110111101101111011110111110111101101010110101011101000100111010011001101010011000111001101001000010111110111011111011110110111101111011111011110110100110111010011010110010110010111001011000000110000111111011111011110110111101111011111011110110101011010101110100010001011110 e99a98e690beefbdbdefbda6e9acb2e58187efbdbdefbdab5744e99a98e690beefbdbdefbda6e9acb2e58187efbdbdefbdab57445e
UHC 隘搾???假??WD隘搾???假??WD^ 111001001111011011110011101101100011111100111111001111111100101010100011001111110011111101010111010001001110010011110110111100111011011000111111001111110011111111001010101000110011111100111111010101110100010001011110 e4f6f3b63f3f3fcaa33f3f5744e4f6f3b63f3f3fcaa33f3f57445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)