To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 鄒ゥ郢プ鄒ゥ郢プB 111001111011111010101001111001111011100110000011011101101110011110111110101010011110011110111001100000110111011001000010 e7bea9e7b98376e7bea9e7b9837642
EUC-JP 鄒ゥ郢プ鄒ゥ郢プB 1110111011000000100011101010100111101110101110111010010111010111111011101100000010001110101010011110111010111011101001011101011101000010 eec08ea9eebba5d7eec08ea9eebba5d742
UTF-8 鄒ゥ郢プ鄒ゥ郢プB 11101001100001001001001011101111101111011010100111101001100000111010001011100011100000111001011111101001100001001001001011101111101111011010100111101001100000111010001011100011100000111001011101000010 e98492efbda9e983a2e38397e98492efbda9e983a2e3839742
UHC 鄒??プ鄒??プB 11110101110110110011111100111111101010111101011111110101110110110011111100111111101010111101011101000010 f5db3f3fabd7f5db3f3fabd742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)