To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????E 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 窈??甸?窈??甸?E 111000100111011100111111001111111001100110110010001111111110001001110111001111110011111110011001101100100011111101000101 e2773f3f99b23fe2773f3f99b23f45
EUC-JP 窈??甸?窈??甸?E 111000111101100000111111001111111101001010110100001111111110001111011000001111110011111111010010101101000011111101000101 e3d83f3fd2b43fe3d83f3fd2b43f45
UTF-8 窈붾쩀甸럀窈붾쩀甸럃E 11100111101010101000100011101011101101101011111011101100101010011000000011100111100101001011100011101011100111111000000011100111101010101000100011101011101101101011111011101100101010011000000011100111100101001011100011101011100111111000001101000101 e7aa88ebb6beeca980e794b8eb9f80e7aa88ebb6beeca980e794b8eb9f8345
UHC 窈붾쩀甸럀窈붾쩀甸럃E 111010011010000110010100111010111010010010011010111011111010010010001110010110011110100110100001100101001110101110100100100110101110111110100100100011100110001001000101 e9a194eba49aefa48e59e9a194eba49aefa48e6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)