To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN セマ跂瀅赦趙軋 101111101100111111100110111000111111101101001110100011101100110111100110111000101110011101100001 becfe6e3fb4e8ecde6e2e761
EUC-JP セマ跂瀅赦趙軋 100011101011111010001110110011111110110011100101100011111100100110101011101111001100111111101100111001001110110111000010 8ebe8ecfece58fc9abbccfece4edc2
UTF-8 セマ跂瀅赦趙軋 111011111011110110111110111011111011111010001111111010001011011110000010111001111000000010000101111010001011010110100110111010001011011010011001111010001011101110001011 efbdbeefbe8fe8b782e78085e8b5a6e8b699e8bb8b
UHC ???瀅赦趙軋 0011111100111111001111111111101110100100110111101111010111110000111000011110010011011000 3f3f3ffba4def5f0e1e4d8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)