To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 牋粤獨牋粤獨B 11100000101011101110001011100011111000001101010111100000101011101110001011100011111000001101010101000010 e0aee2e3e0d5e0aee2e3e0d542
EUC-JP 牋粤獨牋粤獨B 11100000101100001110010011100101111000001101011111100000101100001110010011100101111000001101011101000010 e0b0e4e5e0d7e0b0e4e5e0d742
UTF-8 牋粤獨牋粤獨B 11100111100010011000101111100111101100101010010011100111100011011010100011100111100010011000101111100111101100101010010011100111100011011010100001000010 e7898be7b2a4e78da8e7898be7b2a4e78da842
UHC ??獨??獨B 001111110011111111010100101111000011111100111111110101001011110001000010 3f3fd4bc3f3fd4bc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)