To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????W}????????W{^ 001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 逵ク?讌逶豸辷貍W}逵ク?讌逶豸辷貍W{^ 111001111001110010111000001111111110011010100101111001111001101111100110101101101110011110001000111001101011110001010111011111011110011110011100101110000011111111100110101001011110011110011011111001101011011011100111100010001110011010111100010101110111101101011110 e79cb83fe6a5e79be6b6e788e6bc577de79cb83fe6a5e79be6b6e788e6bc577b5e
EUC-JP 逵ク訵讌逶豸辷貍W}逵ク訵讌逶豸辷貍W{^ 111011011111110010001110101110001000111111011101110100111110110010100111111011011111101111101100101110001110110111101000111011001011111001010111011111011110110111111100100011101011100010001111110111011101001111101100101001111110110111111011111011001011100011101101111010001110110010111110010101110111101101011110 edfc8eb88fddd3eca7edfbecb8ede8ecbe577dedfc8eb88fddd3eca7edfbecb8ede8ecbe577b5e
UTF-8 逵ク訵讌逶豸辷貍W}逵ク訵讌逶豸辷貍W{^ 1110100110000000101101011110111110111101101110001110100010101000101101011110100010101110100011001110100110000000101101101110100010110001101110001110100010111110101101111110100010110010100011010101011101111101111010011000000010110101111011111011110110111000111010001010100010110101111010001010111010001100111010011000000010110110111010001011000110111000111010001011111010110111111010001011001010001101010101110111101101011110 e980b5efbdb8e8a8b5e8ae8ce980b6e8b1b8e8beb7e8b28d577de980b5efbdb8e8a8b5e8ae8ce980b6e8b1b8e8beb7e8b28d577b5e
UHC 逵???????W}逵???????W{^ 1101000010110000001111110011111100111111001111110011111100111111001111110101011101111101110100001011000000111111001111110011111100111111001111110011111100111111010101110111101101011110 d0b03f3f3f3f3f3f3f577dd0b03f3f3f3f3f3f3f577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)