To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 晶澈ァ而晶諶ァ 10001111101110111111100110100100111110110100101110100111100011101010011110001111101110111111100110100100111110111010101010100111 8fbbf9a4fb4ba78ea78fbbf9a4fbaaa7
EUC-JP 晶?澈ァ而晶?諶ァ 101111101011110100111111100011111100100011100101100011101010011110111100101010011011111010111101001111111000111111011110101101011000111010100111 bebd3f8fc8e58ea7bca9bebd3f8fdeb58ea7
UTF-8 晶澈ァ而晶諶ァ 111001101001100110110110111011101001101110111111111001101011111010001000111011111011110110100111111010001000000010001100111001101001100110110110111011101001101110111111111010001010101110110110111011111011110110100111 e699b6ee9bbfe6be88efbda7e8808ce699b6ee9bbfe8abb6efbda7
UHC 晶?澈?而晶?諶? 1110111111011100001111111111010011001101001111111110110010111011111011111101110000111111111001001010011000111111 efdc3ff4cd3fecbbefdc3fe4a63f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)