To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 蟾撰スェ蟾舌o蔔ェ 111001011011011110010000111011111011110110101010111001011011011110010000111000111000001010001111111001001111011110101010 e5b790efbdaae5b790e3828fe4f7aa
EUC-JP 蟾撰スェ蟾舌o蔔ェ 111010101011100111000000111100011000111010111101100011101010101011101010101110011100000011100101101000111110111111101000111110011000111010101010 eab9c0f18ebd8eaaeab9c0e5a3efe8f98eaa
UTF-8 蟾撰スェ蟾舌o蔔ェ 111010001001111110111110111001101001001010110000111011111011110110111101111011111011110110101010111010001001111110111110111010001000100010001100111011111011110110001111111010001001010010010100111011111011110110101010 e89fbee692b0efbdbdefbdaae89fbee8888cefbd8fe89494efbdaa
UHC 蟾撰??蟾舌o蔔? 111000001110101011110011101111000011111100111111111000001110101011100000110111111010001111101111110111001101101100111111 e0eaf3bc3f3fe0eae0dfa3efdcdb3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)