To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 藥??言??耶??^ 11100101010110100011111100111111100011001011111000111111001111111001011011101011001111110011111101011110 e55a3f3f8cbe3f3f96eb3f3f5e
EUC-JP 藥??言??耶??^ 11101001101110110011111100111111101110001100000000111111001111111100110011101101001111110011111101011110 e9bb3f3fb8c03f3fcced3f3f5e
UTF-8 藥썸씇言됭꽦耶섉씇^ 11101000100101111010010111101100100011011011100011101100100101001000011111101000101010001000000011101011100100001010110111101010101111011010011011101000100000001011011011101100100001001000100111101100100101001000011101011110 e897a5ec8db8ec9487e8a880eb90adeabda6e880b6ec8489ec94875e
UHC 藥썸씇言됭꽦耶섉씇^ 11100101101101111011110111100110100111011001111111100101111010111000100111101000100001001011000111100101101011011001100011100110100111011001111101011110 e5b7bde69d9fe5eb89e884b1e5ad98e69d9f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)