To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????^ 001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f5e
SJIS-WIN 松荀禺松荀禺^ 1000111110111100111001001010010011110000110101111110001001011011100011111011110011100100101001001111000011010111111000100101101101011110 8fbce4a4f0d7e25b8fbce4a4f0d7e25b5e
EUC-JP 松荀?禺松荀?禺^ 101111101011111011101000101001100011111111100011101111001011111010111110111010001010011000111111111000111011110001011110 bebee8a63fe3bcbebee8a63fe3bc5e
UTF-8 松荀禺松荀禺^ 11100110100111011011111011101000100011011000000011101110100000101001011011100111101001101011101011100110100111011011111011101000100011011000000011101110100000101001011011100111101001101011101001011110 e69dbee88d80ee8296e7a6bae69dbee88d80ee8296e7a6ba5e
UHC 松荀??松荀??^ 11100001111001101110001011110000001111110011111111100001111001101110001011110000001111110011111101011110 e1e6e2f03f3fe1e6e2f03f3f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)