To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN タ肛蕓タ肛葈^ 1100000011100011111010001111101110011010110000001110001111101000111110111001100001011110 c0e3e8fb9ac0e3e8fb985e
EUC-JP タ肛蕓タ肛葈^ 100011101100000011100110111010101000111111011001110001101000111011000000111001101110101010001111110110001101000101011110 8ec0e6ea8fd9c68ec0e6ea8fd8d15e
UTF-8 タ肛蕓タ肛葈^ 11101111101111101000000011101000100000101001101111101000100101011001001111101111101111101000000011101000100000101001101111101000100100011000100001011110 efbe80e8829be89593efbe80e8829be891885e
UHC ?肛蕓?肛?^ 00111111111110011111110111101001111111100011111111111001111111010011111101011110 3ff9fde9fe3ff9fd3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)