To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ›hãÀz›hãÀzB 1001101101101000111000111100000001111010100110110110100011100011110000000111101001000010 9b68e3c07a9b68e3c07a42
SJIS-WIN ?h??z?h??zB 0011111101101000001111110011111101111010001111110110100000111111001111110111101001000010 3f683f3f7a3f683f3f7a42
EUC-JP ?hãÀz?hãÀzB 00111111011010001000111110101011101010101000111110101010101000100111101000111111011010001000111110101011101010101000111110101010101000100111101001000010 3f688fabaa8faaa27a3f688fabaa8faaa27a42
UTF-8 ›hãÀz›hãÀzB 1100001010011011011010001100001110100011110000111000000001111010110000101001101101101000110000111010001111000011100000000111101001000010 c29b68c3a3c3807ac29b68c3a3c3807a42
UHC ?h??z?h??zB 0011111101101000001111110011111101111010001111110110100000111111001111110111101001000010 3f683f3f7a3f683f3f7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)