To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 兀??淫?兀??淫?B 100110010101100100111111001111111000100011111010001111111001100101011001001111110011111110001000111110100011111101000010 99593f3f88fa3f99593f3f88fa3f42
EUC-JP 兀??淫?兀??淫?B 110100011011101000111111001111111011000011111100001111111101000110111010001111110011111110110000111111000011111101000010 d1ba3f3fb0fc3fd1ba3f3fb0fc3f42
UTF-8 兀덊렆淫엟兀덊렆淫엟B 11100101100001011000000011101011100011011000101011101011101000001000011011100110101101111010101111101100100101111001111111100101100001011000000011101011100011011000101011101011101000001000011011100110101101111010101111101100100101111001111101000010 e58580eb8d8aeba086e6b7abec979fe58580eb8d8aeba086e6b7abec979f42
UHC 兀덊렆淫엟兀덊렆淫엟B 111010001011010010001000111011011000111010100000111010111110001010011110011101101110100010110100100010001110110110001110101000001110101111100010100111100111011001000010 e8b488ed8ea0ebe29e76e8b488ed8ea0ebe29e7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)