To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 賊?鬱頭?製?蹟????除??屯??僥 100100011010111100111111100111110101010010010011101010100011111110010000101110110011111110010000110101100011111100111111001111110011111110001111100111000011111100111111100100111101010000111111001111111001100101000110 91af3f9f5493aa3f90bb3f90d63f3f3f3f8f9c3f3f93d43f3f9946
EUC-JP 賊?鬱頭?製?蹟?汶佾?除??屯??僥 11000010101100010011111111011101101101011100011010101100001111111100000010111101001111111100000011011000001111111000111111000110111001011000111110110000111110110011111110111101111111000011111100111111110001101101011000111111001111111101000110100111 c2b13fddb5c6ac3fc0bd3fc0d83f8fc6e58fb0fb3fbdfc3f3fc6d63f3fd1a7
UTF-8 賊렠鬱頭稶製렩蹟펨汶佾렪除곈렧屯렕렟僥 111010001011001110001010111010111010000010100000111010011010110010110001111010011010000010101101111001111010100010110110111010001010001110111101111010111010000010101001111010001011100110011111111011011000111010101000111001101011000110110110111001001011110110111110111010111010000010101010111010011001100110100100111010101011001110001000111010111010000010100111111001011011000110101111111010111010000010010101111010111010000010011111111001011000001110100101 e8b38aeba0a0e9acb1e9a0ade7a8b6e8a3bdeba0a9e8b99fed8ea8e6b1b6e4bdbeeba0aae999a4eab388eba0a7e5b1afeba095eba09fe583a5
UHC 賊렠鬱頭稶製렩蹟펨汶佾렪除곈렧屯렕렟僥 1110111011100100100011101011000111101010101001101101010011101001111010011111001111110000101100101000111010110111111011101110011111000110111010001101101010100001111011001110101110001110101110001111000010110110101100001110100110001110101101101101010011101010100011101010101010001110101100001110100011101001 eee48eb1eaa6d4e9e9f3f0b28eb7eee7c6e8daa1eceb8eb8f0b6b0e98eb6d4ea8eaa8eb0e8e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)