To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?企?障?鬱豆?訂?障?鬱淡 00111111100010101110100100111111100011111110000100111111100111110101010010010011101001000011111110010010111110010011111110001111111000010011111110011111010101001001001001010111 3f8ae93f8fe13f9f5493a43f92f93f8fe13f9f549257
EUC-JP 勖企?障?鬱豆?訂?障?鬱淡 100011111011001111101101101101001110101100111111101111101110001100111111110111011011010111000110101001100011111111000100111110110011111110111110111000110011111111011101101101011100001110111000 8fb3edb4eb3fbee33fddb5c6a63fc4fb3fbee33fddb5c3b8
UTF-8 勖企횅障렚鬱豆번訂렦障렚鬱淡 111001011000101110010110111001001011110010000001111011011001101010000101111010011001101010011100111010111010000010011010111010011010110010110001111010001011000110000110111010111011001010001000111010001010100010000010111010111010000010100110111010011001101010011100111010111010000010011010111010011010110010110001111001101011011110100001 e58b96e4bc81ed9a85e99a9ceba09ae9acb1e8b186ebb288e8a882eba0a6e99a9ceba09ae9acb1e6b7a1
UHC 勖企횅障렚鬱豆번訂렦障렚鬱淡 11101001111011011101000011101010110010001011011111101110101000011000111010101101111010101010011011010100111001111011100111111000111011111111010010001110101101011110111010100001100011101010110111101010101001101101001110111111 e9edd0eac8b7eea18eadeaa6d4e7b9f8eff48eb5eea18eadeaa6d3bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)