To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ゼ灼邪酌ゼ灼邪灼^ 1011111011011110100011101101110010001110110101111000111011011110101111101101111010001110110111001000111011010111100011101101110001011110 bede8edc8ed78edebede8edc8ed78edc5e
EUC-JP ゼ灼邪酌ゼ灼邪灼^ 100011101011111010001110110111101011110011011110101111001101100110111100111000001000111010111110100011101101111010111100110111101011110011011001101111001101111001011110 8ebe8edebcdebcd9bce08ebe8edebcdebcd9bcde5e
UTF-8 ゼ灼邪酌ゼ灼邪灼^ 11101111101111011011111011101111101111101001111011100111100000011011110011101001100000101010101011101001100001011000110011101111101111011011111011101111101111101001111011100111100000011011110011101001100000101010101011100111100000011011110001011110 efbdbeefbe9ee781bce982aae9858cefbdbeefbe9ee781bce982aae781bc5e
UHC ??灼邪酌??灼邪灼^ 0011111100111111111011011100011111011110111101111110110111001100001111110011111111101101110001111101111011110111111011011100011101011110 3f3fedc7def7edcc3f3fedc7def7edc75e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)