To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鞳檎┥鮨奇セ募蜀呵繻辟シ鮨奇セ募蜀竸 11101000111000111000110011100111100001001011110011101001101111011000101011101111101111101001010111100101111101001001110011100101100001101001100111101000111000111000110011100111100001001011110011101001101111011000101011101111101111101001010111100101111101001001110011100101100001101001100101011110 e8e38ce784bce9bd8aefbe95e5f49ce58699e8e38ce784bce9bd8aefbe95e5f49ce586995e
EUC-JP 鞳檎┥鮨奇セ募?蜀呵繻辟シ鮨奇セ募?蜀竸 1111000011100101101110001110100110101000101111101111001010111111101101001111000110001110101111101100101011100111001111111110100111100110110100101110101011100101111011001110110111100100100011101011110011110010101111111011010011110001100011101011111011001010111001110011111111101001111001101101000110111111 f0e5b8e9a8bef2bfb4f18ebecae73fe9e6d2eae5ecede48ebcf2bfb4f18ebecae73fe9e6d1bf
UTF-8 鞳檎┥鮨奇セ募蜀呵繻辟シ鮨奇セ募蜀竸 111010011001111010110011111001101010101010001110111000101001010010100101111010011010111010101000111001011010010110000111111011111011110110111110111001011000101110011111111011101000110110001011111010001001110010000000111001011001000110110101111001111011100110111011111010001011111010011111111011111011110110111100111010011010111010101000111001011010010110000111111011111011110110111110111001011000101110011111111011101000110110001011111010001001110010000000111001111010101110111000 e99eb3e6aa8ee294a5e9aea8e5a587efbdbee58b9fee8d8be89c80e591b5e7b9bbe8be9fefbdbce9aea8e5a587efbdbee58b9fee8d8be89c80e7abb8
UHC ?檎┥?奇?募?蜀呵????奇?募?蜀? 0011111111010000110101011010011010111110001111111101000011110100001111111101100110110100001111111111010110111001110010101010011100111111001111110011111100111111110100001111010000111111110110011011010000111111111101011011100100111111 3fd0d5a6be3fd0f43fd9b43ff5b9caa73f3f3f3fd0f43fd9b43ff5b93f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)