To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????\}?????????\{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101110001111101001111110011111100111111001111110011111100111111001111110011111100111111010111000111101101011110 3f3f3f3f3f3f3f3f3f5c7d3f3f3f3f3f3f3f3f3f5c7b5e
SJIS-WIN 昻??徇??徇??\}昻??徇??徇??\{^ 1111101011010000001111110011111110011100011011010011111100111111100111000110110100111111001111110101110001111101111110101101000000111111001111111001110001101101001111110011111110011100011011010011111100111111010111000111101101011110 fad03f3f9c6d3f3f9c6d3f3f5c7dfad03f3f9c6d3f3f9c6d3f3f5c7b5e
EUC-JP ???徇??徇??\}???徇??徇??\{^ 001111110011111100111111110101111100111000111111001111111101011111001110001111110011111101011100011111010011111100111111001111111101011111001110001111110011111111010111110011100011111100111111010111000111101101011110 3f3f3fd7ce3f3fd7ce3f3f5c7d3f3f3fd7ce3f3fd7ce3f3f5c7b5e
UTF-8 昻뽮낸徇귢낸徇뽯젌\}昻뽮낸徇귢낸徇뽯젌\{^ 1110011010011000101110111110101110111101101011101110101110000010101110001110010110111110100001111110101010110111101000101110101110000010101110001110010110111110100001111110101110111101101011111110110010100000100011000101110001111101111001101001100010111011111010111011110110101110111010111000001010111000111001011011111010000111111010101011011110100010111010111000001010111000111001011011111010000111111010111011110110101111111011001010000010001100010111000111101101011110 e698bbebbdaeeb82b8e5be87eab7a2eb82b8e5be87ebbdafeca08c5c7de698bbebbdaeeb82b8e5be87eab7a2eb82b8e5be87ebbdafeca08c5c7b5e
UHC 昻뽮낸徇귢낸徇뽯젌\}昻뽮낸徇귢낸徇뽯젌\{^ 1110010011101001100101101110101010110011101111011110001011011111100000101110101010110011101111011110001011011111100101101110101110100000100011010101110001111101111001001110100110010110111010101011001110111101111000101101111110000010111010101011001110111101111000101101111110010110111010111010000010001101010111000111101101011110 e4e996eab3bde2df82eab3bde2df96eba08d5c7de4e996eab3bde2df82eab3bde2df96eba08d5c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)