To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????B 00111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???鵝??艤??B 001111110011111100111111111010100100000000111111001111111110010001111110001111110011111101000010 3f3f3fea403f3fe47e3f3f42
EUC-JP 薏??鵝??艤??B 1000111111011001110111100011111100111111111100111010000100111111001111111110011111011111001111110011111101000010 8fd9de3f3ff3a13f3fe7df3f3f42
UTF-8 薏쎌쑞鵝숈씫艤썰퐰B 11101000100101101000111111101100100011101000110011101100100100011001111011101001101101011001110111101100100010001000100011101100100101001010101111101000100010011010010011101100100011011011000011101101100100001011000001000010 e8968fec8e8cec919ee9b59dec8888ec94abe889a4ec8db0ed90b042
UHC 薏쎌쑞鵝숈씫艤썰퐰B 11101011111110111011110111101100100111001011110111100100101111011001100111101100100111011011110111101011111110101011110111100100101111011001100101000010 ebfbbdec9cbde4bd99ec9dbdebfabde4bd9942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)