To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 鳶??癌?R鳶??癌?^[鳶??癌?R鳶??癌?^[^ 1001001111001110001111110011111110001010111000000011111101010010100100111100111000111111001111111000101011100000001111110101111001011011100100111100111000111111001111111000101011100000001111110101001010010011110011100011111100111111100010101110000000111111010111100101101101011110 93ce3f3f8ae03f5293ce3f3f8ae03f5e5b93ce3f3f8ae03f5293ce3f3f8ae03f5e5b5e
EUC-JP 鳶??癌?R鳶??癌?^[鳶??癌?R鳶??癌?^[^ 1100011011010000001111110011111110110100111000100011111101010010110001101101000000111111001111111011010011100010001111110101111001011011110001101101000000111111001111111011010011100010001111110101001011000110110100000011111100111111101101001110001000111111010111100101101101011110 c6d03f3fb4e23f52c6d03f3fb4e23f5e5bc6d03f3fb4e23f52c6d03f3fb4e23f5e5b5e
UTF-8 鳶멱퀕癌큡R鳶멱퀕癌큡^[鳶멱퀕癌큡R鳶멱퀕癌큡^[^ 11101001101100111011011011101011101010011011000111101101100000001001010111100111100110011000110011101101100000011010000101010010111010011011001110110110111010111010100110110001111011011000000010010101111001111001100110001100111011011000000110100001010111100101101111101001101100111011011011101011101010011011000111101101100000001001010111100111100110011000110011101101100000011010000101010010111010011011001110110110111010111010100110110001111011011000000010010101111001111001100110001100111011011000000110100001010111100101101101011110 e9b3b6eba9b1ed8095e7998ced81a152e9b3b6eba9b1ed8095e7998ced81a15e5be9b3b6eba9b1ed8095e7998ced81a152e9b3b6eba9b1ed8095e7998ced81a15e5b5e
UHC 鳶멱퀕癌큡R鳶멱퀕癌큡^[鳶멱퀕癌큡R鳶멱퀕癌큡^[^ 1110011011101001101110001110100010110011100010101110010011011111101101000110111001010010111001101110100110111000111010001011001110001010111001001101111110110100011011100101111001011011111001101110100110111000111010001011001110001010111001001101111110110100011011100101001011100110111010011011100011101000101100111000101011100100110111111011010001101110010111100101101101011110 e6e9b8e8b38ae4dfb46e52e6e9b8e8b38ae4dfb46e5e5be6e9b8e8b38ae4dfb46e52e6e9b8e8b38ae4dfb46e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)