To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 —~šæW}—~šæW{^ 10010111011111101001101011100110010101110111110110010111011111101001101011100110010101110111101101011110 977e9ae6577d977e9ae6577b5e
SJIS-WIN ?~??W}?~??W{^ 00111111011111100011111100111111010101110111110100111111011111100011111100111111010101110111101101011110 3f7e3f3f577d3f7e3f3f577b5e
EUC-JP ?~?æW}?~?æW{^ 0011111101111110001111111000111110101001110000010101011101111101001111110111111000111111100011111010100111000001010101110111101101011110 3f7e3f8fa9c1577d3f7e3f8fa9c1577b5e
UTF-8 —~šæW}—~šæW{^ 11000010100101110111111011000010100110101100001110100110010101110111110111000010100101110111111011000010100110101100001110100110010101110111101101011110 c2977ec29ac3a6577dc2977ec29ac3a6577b5e
UHC ?~?æW}?~?æW{^ 001111110111111000111111101010011010000101010111011111010011111101111110001111111010100110100001010101110111101101011110 3f7e3fa9a1577d3f7e3fa9a1577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)