To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??T??wA??T 00111111001111110101010000111111001111110111011101000001001111110011111101010100 3f3f543f3f77413f3f54
SJIS-WIN 将、T将、wA将、T 10001111101010111010010001010100100011111010101110100100011101110100000110001111101010111010010001010100 8faba4548faba477418faba454
EUC-JP 将、T将、wA将、T 10111110101011011000111010100100010101001011111010101101100011101010010001110111010000011011111010101101100011101010010001010100 bead8ea454bead8ea47741bead8ea454
UTF-8 将、T将、wA将、T 11100101101100001000011011101111101111011010010001010100111001011011000010000110111011111011110110100100011101110100000111100101101100001000011011101111101111011010010001010100 e5b086efbda454e5b086efbda47741e5b086efbda454
UHC ??T??wA??T 00111111001111110101010000111111001111110111011101000001001111110011111101010100 3f3f543f3f77413f3f54

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)