To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 罌??碣嚥??肉 111000111010000000111111001111111110000111110000100110101000101100111111001111111001001111110111 e3a03f3fe1f09a8b3f3f93f7
EUC-JP 罌??碣嚥??肉 111001101010001000111111001111111110001011110010110100111110101100111111001111111100011011111001 e6a23f3fe2f2d3eb3f3fc6f9
UTF-8 罌븍틷碣嚥싰퀎肉 111001111011110110001100111010111011100010001101111011011000101110110111111001111010001010100011111001011001101010100101111011001000101110110000111011011000000010001110111010001000001010001001 e7bd8cebb88ded8bb7e7a2a3e59aa5ec8bb0ed808ee88289
UHC 罌븍틷碣嚥싰퀎肉 11100101101000101011101011101011101110101001111011001010111001011110011010111111100110101110101010110011100001001110101110111111 e5a2baebba9ecae5e6bf9aeab384ebbf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)