To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 銹?????頻銹?語銹?????頻銹??^ 11100111111101100011111100111111001111110011111100111111100101010111000011100111111101100011111110001100111010101110011111110110001111110011111100111111001111110011111110010101011100001110011111110110001111110011111101011110 e7f63f3f3f3f3f9570e7f63f8ceae7f63f3f3f3f3f9570e7f63f3f5e
EUC-JP 銹?????頻銹?語銹?????頻銹?瘀^ 111011101111100000111111001111110011111100111111001111111100100111010001111011101111100000111111101110001110110011101110111110000011111100111111001111110011111100111111110010011101000111101110111110000011111110001111110011011110001101011110 eef83f3f3f3f3fc9d1eef83fb8eceef83f3f3f3f3fc9d1eef83f8fcde35e
UTF-8 銹뤒뷜휸년혓頻銹롐語銹뤒뷜휸년혓頻銹롐瘀^ 11101001100010101011100111101011101001001001001011101011101101111001110011101101100111001011100011101011100001011000010011101101100110001001001111101001101000001011101111101001100010101011100111101011101000011001000011101000101010101001111011101001100010101011100111101011101001001001001011101011101101111001110011101101100111001011100011101011100001011000010011101101100110001001001111101001101000001011101111101001100010101011100111101011101000011001000011100111100110001000000001011110 e98ab9eba492ebb79ced9cb8eb8584ed9893e9a0bbe98ab9eba190e8aa9ee98ab9eba492ebb79ced9cb8eb8584ed9893e9a0bbe98ab9eba190e798805e
UHC 銹뤒뷜휸년혓頻銹롐語銹뤒뷜휸년혓頻銹롐瘀^ 1110001011001000100011111100001010111010111000101100100011100000101100111110001011000111111110101101111010111010111000101100100010001110110101101110010111011110111000101100100010001111110000101011101011100010110010001110000010110011111000101100011111111010110111101011101011100010110010001000111011010110111001011101110001011110 e2c88fc2bae2c8e0b3e2c7fadebae2c88ed6e5dee2c88fc2bae2c8e0b3e2c7fadebae2c88ed6e5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)