To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???蹂??酉??佯 00111111001111110011111111100110111110000011111100111111100100111101000100111111001111111001100011010001 3f3f3fe6f83f3f93d13f3f98d1
EUC-JP ???蹂??酉??佯 00111111001111110011111111101100111110100011111100111111110001101101001100111111001111111101000011010011 3f3f3fecfa3f3fc6d33f3fd0d3
UTF-8 劣꾠렕蹂쒎틫酉몌퐵佯 111011111010011010011101111010101011111010100000111010111010000010010101111010001011100110000010111011001001001010001110111011011000101110101011111010011000010110001001111010111010101010001100111011011001000010110101111001001011110110101111 efa69deabea0eba095e8b982ec928eed8babe98589ebaa8ced90b5e4bdaf
UHC 劣꾠렕蹂쒎틫酉몌퐵佯 1110011011101011100001001110001110001110101010101110101110110011100111001110010110111010100101011110101110110111101110001110111110111101100111101110010110111010 e6eb84e38eaaebb39ce5ba95ebb7b8efbd9ee5ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)