To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 雜セ蜿ウ雜セ蜿ウB 11101000101101101011111011100101100011111011001111101000101101101011111011100101100011111011001101000010 e8b6bee58fb3e8b6bee58fb342
EUC-JP 雜セ蜿ウ雜セ蜿ウB 1111000010111000100011101011111011101001111011111000111010110011111100001011100010001110101111101110100111101111100011101011001101000010 f0b88ebee9ef8eb3f0b88ebee9ef8eb342
UTF-8 雜セ蜿ウ雜セ蜿ウB 11101001100110111001110011101111101111011011111011101000100111001011111111101111101111011011001111101001100110111001110011101111101111011011111011101000100111001011111111101111101111011011001101000010 e99b9cefbdbee89cbfefbdb3e99b9cefbdbee89cbfefbdb342
UHC 雜???雜???B 1110110111011010001111110011111100111111111011011101101000111111001111110011111101000010 edda3f3f3fedda3f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)