To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???徇??徇??B 001111110011111100111111100111000110110100111111001111111001110001101101001111110011111101000010 3f3f3f9c6d3f3f9c6d3f3f42
EUC-JP 倻??徇??徇??B 1000111110110001111101100011111100111111110101111100111000111111001111111101011111001110001111110011111101000010 8fb1f63f3fd7ce3f3fd7ce3f3f42
UTF-8 倻뽮낸徇귢낸徇껊젚B 11100101100000001011101111101011101111011010111011101011100000101011100011100101101111101000011111101010101101111010001011101011100000101011100011100101101111101000011111101010101110111000101011101100101000001001101001000010 e580bbebbdaeeb82b8e5be87eab7a2eb82b8e5be87eabb8aeca09a42
UHC 倻뽮낸徇귢낸徇껊젚B 11100101101001101001011011101010101100111011110111100010110111111000001011101010101100111011110111100010110111111000001111101011101000001001011001000010 e5a696eab3bde2df82eab3bde2df83eba09642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)