To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????}B 0011111100111111001111110011111100111111001111110111110101000010 3f3f3f3f3f3f7d42
SJIS-WIN 澳??曜?ぐ}B 1110000001010011001111110011111110010111011010100011111110000010101011100111110101000010 e0533f3f976a3f82ae7d42
EUC-JP 澳??曜?ぐ}B 1101111110110100001111110011111111001101110010110011111110100100101100000111110101000010 dfb43f3fcdcb3fa4b07d42
UTF-8 澳랃슈曜띈ぐ}B 1110011010111110101100111110101110011110100000111110110010001010100010001110011010011011100111001110101110011101100010001110001110000001100100000111110101000010 e6beb3eb9e83ec8a88e69b9ceb9d88e381907d42
UHC 澳랃슈曜띈ぐ}B 1110011111111110100011011110111110111101101101001110100011111000101101101110100010101010101100000111110101000010 e7fe8defbdb4e8f8b6e8aab07d42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)