To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???[???[^ 001111110011111100111111010110110011111100111111001111110101101101011110 3f3f3f5b3f3f3f5b5e
SJIS-WIN 谷村遜[谷村遜[^ 100100100100101010010001101110101001000110111011010110111001001001001010100100011011101010010001101110110101101101011110 924a91ba91bb5b924a91ba91bb5b5e
EUC-JP 谷村遜[谷村遜[^ 110000111010101111000010101111001100001010111101010110111100001110101011110000101011110011000010101111010101101101011110 c3abc2bcc2bd5bc3abc2bcc2bd5b5e
UTF-8 谷村遜[谷村遜[^ 111010001011000010110111111001101001110110010001111010011000000110011100010110111110100010110000101101111110011010011101100100011110100110000001100111000101101101011110 e8b0b7e69d91e9819c5be8b0b7e69d91e9819c5b5e
UHC 谷村遜[谷村遜[^ 110011011101101111110101101111011110000111100001010110111100110111011011111101011011110111100001111000010101101101011110 cddbf5bde1e15bcddbf5bde1e15b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)