To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????z 00111111001111110011111100111111001111110011111100111111001111110011111101111010 3f3f3f3f3f3f3f3f3f7a
SJIS-WIN 曜???ヨ?諛??z 10010111011010100011111100111111001111111000001110001000001111111110011010000111001111110011111101111010 976a3f3f3f83883fe6873f3f7a
EUC-JP 曜??嫄ヨ?諛??z 110011011100101100111111001111111000111110111010101000011010010111101000001111111110101111100111001111110011111101111010 cdcb3f3f8fbaa1a5e83febe73f3f7a
UTF-8 曜섎끋嫄ヨ쯁諛⒲돩z 11100110100110111001110011101100100001001000111011101011100000011000101111100101101010111000010011100011100000111010100011101100101011111000000111101000101010111001101111100010100100101011001011101011100011111010100101111010 e69b9cec848eeb818be5ab84e383a8ecaf81e8ab9be292b2eb8fa97a
UHC 曜섎끋嫄ヨ쯁諛⒲돩z 11101000111110001001100011101011100001011011110111101010101100011010101111101000101010001001110111101011101100001010100111100011100010011010110001111010 e8f898eb85bdeab1abe8a89debb0a9e389ac7a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)