To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 淏晢セ奇ィ費スェ褻 1111101101000010100111011110111110111110100010101110111110101000100101001110111110111101101010101110010111110110 fb429defbe8aefa894efbdaae5f6
EUC-JP 淏晢セ奇ィ費スェ褻 10001111110001111101100111011010111100011000111010111110101101001111000110001110101010001100100011110001100011101011110110001110101010101110101011111000 8fc7d9daf18ebeb4f18ea8c8f18ebd8eaaeaf8
UTF-8 淏晢セ奇ィ費スェ褻 111001101011011110001111111001101001100110100010111011111011110110111110111001011010010110000111111011111011110110101000111010001011001010111011111011111011110110111101111011111011110110101010111010001010010010111011 e6b78fe699a2efbdbee5a587efbda8e8b2bbefbdbdefbdaae8a4bb
UHC 淏??奇?費??褻 11111011110010000011111100111111110100001111010000111111110111101010100000111111001111111110000011100001 fbc83f3fd0f43fdea83f3fe0e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)