To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 阮ー霑思阮ー霑思B 111010001001011010110000111010001011111110001110011101101110100010010110101100001110100010111111100011100111011001000010 e896b0e8bf8e76e896b0e8bf8e7642
EUC-JP 阮ー霑思阮ー霑思B 1110111111110110100011101011000011110000110000011011101111010111111011111111011010001110101100001111000011000001101110111101011101000010 eff68eb0f0c1bbd7eff68eb0f0c1bbd742
UTF-8 阮ー霑思阮ー霑思B 11101001100110001010111011101111101111011011000011101001100111001001000111100110100000001001110111101001100110001010111011101111101111011011000011101001100111001001000111100110100000001001110101000010 e998aeefbdb0e99c91e6809de998aeefbdb0e99c91e6809d42
UHC 阮?霑思阮?霑思B 111010001101011000111111111011111100010111011110110101101110100011010110001111111110111111000101110111101101011001000010 e8d63fefc5ded6e8d63fefc5ded642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)