Character and Charcode - Check how computer recognize characters

To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."

Charset	Character	Bit string (binary)	Bit String (hexadecimal)
ISO-8859-1	??????M}??????M{^	0011111100111111001111110011111100111111001111110100110101111101001111110011111100111111001111110011111100111111010011010111101101011110	3f3f3f3f3f3f4d7d3f3f3f3f3f3f4d7b5e
SJIS-WIN	癌先昻善洩腺M}癌先昻善洩腺M{^	1000101011100000100100001110011011111010110100001001000101010000100010010110101110010001010000100100110101111101100010101110000010010000111001101111101011010000100100010101000010001001011010111001000101000010010011010111101101011110	8ae090e6fad09150896b91424d7d8ae090e6fad09150896b91424d7b5e
EUC-JP	癌先?善洩腺M}癌先?善洩腺M{^	101101001110001011000000111010000011111111000001101100011011000111001100110000011010001101001101011111011011010011100010110000001110100000111111110000011011000110110001110011001100000110100011010011010111101101011110	b4e2c0e83fc1b1b1ccc1a34d7db4e2c0e83fc1b1b1ccc1a34d7b5e
UTF-8	癌先昻善洩腺M}癌先昻善洩腺M{^	1110011110011001100011001110010110000101100010001110011010011000101110111110010110010110100001001110011010110100101010011110100010000101101110100100110101111101111001111001100110001100111001011000010110001000111001101001100010111011111001011001011010000100111001101011010010101001111010001000010110111010010011010111101101011110	e7998ce58588e698bbe59684e6b4a9e885ba4d7de7998ce58588e698bbe59684e6b4a9e885ba4d7b5e
UHC	癌先昻善洩腺M}癌先昻善洩腺M{^	1110010011011111111000001011101111100100111010011110000010111100111000001101110111100000110011010100110101111101111001001101111111100000101110111110010011101001111000001011110011100000110111011110000011001101010011010111101101011110	e4dfe0bbe4e9e0bce0dde0cd4d7de4dfe0bbe4e9e0bce0dde0cd4d7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)