To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???u}???u{^ 0011111100111111001111110111010101111101001111110011111100111111011101010111101101011110 3f3f3f757d3f3f3f757b5e
SJIS-WIN 将蝪鵆u}将蝪鵆u{^ 1000111110101011111001011010000111101001111110110111010101111101100011111010101111100101101000011110100111111011011101010111101101011110 8fabe5a1e9fb757d8fabe5a1e9fb757b5e
EUC-JP 将蝪鵆u}将蝪鵆u{^ 1011111010101101111010101010001111110010111111010111010101111101101111101010110111101010101000111111001011111101011101010111101101011110 beadeaa3f2fd757dbeadeaa3f2fd757b5e
UTF-8 将蝪鵆u}将蝪鵆u{^ 1110010110110000100001101110100010011101101010101110100110110101100001100111010101111101111001011011000010000110111010001001110110101010111010011011010110000110011101010111101101011110 e5b086e89daae9b586757de5b086e89daae9b586757b5e
UHC ???u}???u{^ 0011111100111111001111110111010101111101001111110011111100111111011101010111101101011110 3f3f3f757d3f3f3f757b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)