To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????}???????????{^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111110100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 賊?鬱頭?????垣龜}賊?鬱頭?????垣龜{^ 1001000110101111001111111001111101010100100100111010101000111111001111110011111100111111001111111000101001011111111010101001110101111101100100011010111100111111100111110101010010010011101010100011111100111111001111110011111100111111100010100101111111101010100111010111101101011110 91af3f9f5493aa3f3f3f3f3f8a5fea9d7d91af3f9f5493aa3f3f3f3f3f8a5fea9d7b5e
EUC-JP 賊?鬱頭?????垣龜}賊?鬱頭?????垣龜{^ 1100001010110001001111111101110110110101110001101010110000111111001111110011111100111111001111111011001111000000111100111111110101111101110000101011000100111111110111011011010111000110101011000011111100111111001111110011111100111111101100111100000011110011111111010111101101011110 c2b13fddb5c6ac3f3f3f3f3fb3c0f3fd7dc2b13fddb5c6ac3f3f3f3f3fb3c0f3fd7b5e
UTF-8 賊렠鬱頭稶欌렪罹렗垣龜}賊렠鬱頭稶欌렪罹렗垣龜{^ 111010001011001110001010111010111010000010100000111010011010110010110001111010011010000010101101111001111010100010110110111001101010110010001100111010111010000010101010111011111010011110100110111010111010000010010111111001011001111010100011111010011011111010011100011111011110100010110011100010101110101110100000101000001110100110101100101100011110100110100000101011011110011110101000101101101110011010101100100011001110101110100000101010101110111110100111101001101110101110100000100101111110010110011110101000111110100110111110100111000111101101011110 e8b38aeba0a0e9acb1e9a0ade7a8b6e6ac8ceba0aaefa7a6eba097e59ea3e9be9c7de8b38aeba0a0e9acb1e9a0ade7a8b6e6ac8ceba0aaefa7a6eba097e59ea3e9be9c7b5e
UHC 賊렠鬱頭稶欌렪罹렗垣龜}賊렠鬱頭稶欌렪罹렗垣龜{^ 1110111011100100100011101011000111101010101001101101010011101001111010011111001111101101111010111000111010111000111011001011101010001110101011001110101010101111110011111100111101111101111011101110010010001110101100011110101010100110110101001110100111101001111100111110110111101011100011101011100011101100101110101000111010101100111010101010111111001111110011110111101101011110 eee48eb1eaa6d4e9e9f3edeb8eb8ecba8eaceaafcfcf7deee48eb1eaa6d4e9e9f3edeb8eb8ecba8eaceaafcfcf7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)