To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 阡慕ソォ蜍晞数 111010001001010010010101111001111011111110101011111001011000101110011101111010011001000010010100 e89495e7bfabe58b9de99094
EUC-JP 阡慕ソォ蜍晞数 1110111111110100110010101110100110001110101111111000111010101011111010011110101111011010111010111011111111110100 eff4cae98ebf8eabe9ebdaebbff4
UTF-8 阡慕ソォ蜍晞数 111010011001100010100001111001101000010110010101111011111011110110111111111011111011110110101011111010001001110010001101111001101001100110011110111001101001010110110000 e998a1e68595efbdbfefbdabe89c8de6999ee695b0
UHC 阡慕???晞? 11110100110001101101100110110111001111110011111100111111111111011111010100111111 f4c6d9b73f3f3ffdf53f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)