To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN 杜?荏nR杜?荏n^[杜?荏nR杜?荏n^[^ 10010011011011010011111110001001011000000110111001010010100100110110110100111111100010010110000001101110010111100101101110010011011011010011111110001001011000000110111001010010100100110110110100111111100010010110000001101110010111100101101101011110 936d3f89606e52936d3f89606e5e5b936d3f89606e52936d3f89606e5e5b5e
EUC-JP 杜?荏nR杜?荏n^[杜?荏nR杜?荏n^[^ 11000101110011100011111110110001110000010110111001010010110001011100111000111111101100011100000101101110010111100101101111000101110011100011111110110001110000010110111001010010110001011100111000111111101100011100000101101110010111100101101101011110 c5ce3fb1c16e52c5ce3fb1c16e5e5bc5ce3fb1c16e52c5ce3fb1c16e5e5b5e
UTF-8 杜룫荏nR杜룫荏n^[杜룫荏nR杜룫荏n^[^ 1110011010011101100111001110101110100011101010111110100010001101100011110110111001010010111001101001110110011100111010111010001110101011111010001000110110001111011011100101111001011011111001101001110110011100111010111010001110101011111010001000110110001111011011100101001011100110100111011001110011101011101000111010101111101000100011011000111101101110010111100101101101011110 e69d9ceba3abe88d8f6e52e69d9ceba3abe88d8f6e5e5be69d9ceba3abe88d8f6e52e69d9ceba3abe88d8f6e5e5b5e
UHC 杜룫荏nR杜룫荏n^[杜룫荏nR杜룫荏n^[^ 1101010011100001100011111010001011101100111110110110111001010010110101001110000110001111101000101110110011111011011011100101111001011011110101001110000110001111101000101110110011111011011011100101001011010100111000011000111110100010111011001111101101101110010111100101101101011110 d4e18fa2ecfb6e52d4e18fa2ecfb6e5e5bd4e18fa2ecfb6e52d4e18fa2ecfb6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)