To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 熙室紗コ﨩「 111010101010010010001110101110101000111011010001101110101111101111101010101000101111100110100110 eaa48eba8ed1bafbeaa2f9a6
EUC-JP 熙室紗コ?「? 111101001010011010111100101111001011110011010011100011101011101000111111100011101010001000111111 f4a6bcbcbcd38eba3f8ea23f
UTF-8 熙室紗コ﨩「 111001111000011010011001111001011010111010100100111001111011010010010111111011111011110110111010111011111010100010101001111011111011110110100010111011101001110010000001 e78699e5aea4e7b497efbdbaefa8a9efbda2ee9c81
UHC 熙室紗???? 11111101111101111110001111111000110111101110100100111111001111110011111100111111 fdf7e3f8dee93f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)