To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????i????iB 0011111100111111001111110011111101101001001111110011111100111111001111110110100101000010 3f3f3f3f693f3f3f3f6942
SJIS-WIN 迺ァ迥凡i迺ァ迥凡iB 1110011110010010101001111110011110001010100101100111110101101001111001111001001010100111111001111000101010010110011111010110100101000010 e792a7e78a967d69e792a7e78a967d6942
EUC-JP 迺ァ迥凡i迺ァ迥凡iB 11101101111100101000111010100111111011011110101011001011110111100110100111101101111100101000111010100111111011011110101011001011110111100110100101000010 edf28ea7edeacbde69edf28ea7edeacbde6942
UTF-8 迺ァ迥凡i迺ァ迥凡iB 111010001011111110111010111011111011110110100111111010001011111110100101111001011000011110100001011010011110100010111111101110101110111110111101101001111110100010111111101001011110010110000111101000010110100101000010 e8bfbaefbda7e8bfa5e587a169e8bfbaefbda7e8bfa5e587a16942
UHC ???凡i???凡iB 00111111001111110011111111011011111011010110100100111111001111110011111111011011111011010110100101000010 3f3f3fdbed693f3f3fdbed6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)