To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 踵門嶋鮓コ逵ゥ踐∫ゥゥ踵門嶋鮓コ逵ゥ踐∫命 111001101111100110010110111001011001001110001000111010011011011010111010111001111001110010101001111001101111011010000001111001111010100110101001111001101111100110010110111001011001001110001000111010011011011010111010111001111001110010101001111001101111011010000001111001111001011010111101 e6f996e59388e9b6bae79ca9e6f681e7a9a9e6f996e59388e9b6bae79ca9e6f681e796bd
EUC-JP 踵門嶋鮓コ逵ゥ踐∫ゥゥ踵門嶋鮓コ逵ゥ踐∫命 111011001111101111001100111001111100010111101000111100101011100010001110101110101110110111111100100011101010100111101100111110001010001011101001100011101010100110001110101010011110110011111011110011001110011111000101111010001111001010111000100011101011101011101101111111001000111010101001111011001111100010100010111010011100110010111111 ecfbcce7c5e8f2b88ebaedfc8ea9ecf8a2e98ea98ea9ecfbcce7c5e8f2b88ebaedfc8ea9ecf8a2e9ccbf
UTF-8 踵門嶋鮓コ逵ゥ踐∫ゥゥ踵門嶋鮓コ逵ゥ踐∫命 111010001011100010110101111010011001011010000000111001011011011010001011111010011010111010010011111011111011110110111010111010011000000010110101111011111011110110101001111010001011100010010000111000101000100010101011111011111011110110101001111011111011110110101001111010001011100010110101111010011001011010000000111001011011011010001011111010011010111010010011111011111011110110111010111010011000000010110101111011111011110110101001111010001011100010010000111000101000100010101011111001011001000110111101 e8b8b5e99680e5b68be9ae93efbdbae980b5efbda9e8b890e288abefbda9efbda9e8b8b5e99680e5b68be9ae93efbdbae980b5efbda9e8b890e288abe591bd
UHC 踵門嶋??逵?踐∫??踵門嶋??逵?踐∫命 11110001101000101101101010100110110100111111011100111111001111111101000010110000001111111111010011000010101000011111001000111111001111111111000110100010110110101010011011010011111101110011111100111111110100001011000000111111111101001100001010100001111100101101100110100100 f1a2daa6d3f73f3fd0b03ff4c2a1f23f3ff1a2daa6d3f73f3fd0b03ff4c2a1f2d9a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)