To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 澳??徇??慂?? 111000000101001100111111001111111001110001101101001111110011111110011100110010000011111100111111 e0533f3f9c6d3f3f9cc83f3f
EUC-JP 澳??徇??慂?? 110111111011010000111111001111111101011111001110001111110011111111011000110010100011111100111111 dfb43f3fd7ce3f3fd8ca3f3f
UTF-8 澳뉔숱徇먫젃慂㏂걧 111001101011111010110011111010111000100110010100111011001000100010110001111001011011111010000111111010111010100010101011111011001010000010000011111001101000010110000010111000111000111110000010111010101011000110100111 e6beb3eb8994ec88b1e5be87eba8abeca083e68582e38f82eab1a7
UHC 澳뉔숱徇먫젃慂㏂걧 111001111111111010000111111010011011110110100010111000101101111110010000111010001010000010000111111010011011110110100010111000111000000110010000 e7fe87e9bda2e2df90e8a087e9bda2e38190

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)