To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 闕オ蜻キ謚ケ螯、 111010001000110110110101111001011001000110110111111001101000101010111001111001011010011010100100 e88db5e591b7e68ab9e5a6a4
EUC-JP 闕オ蜻キ謚ケ螯、 11101111111011011000111010110101111010011111000110001110101101111110101111101010100011101011100111101010101010001000111010100100 efed8eb5e9f18eb7ebea8eb9eaa88ea4
UTF-8 闕オ蜻キ謚ケ螯、 111010011001011110010101111011111011110110110101111010001001110010111011111011111011110110110111111010001010110010011010111011111011110110111001111010001001111010101111111011111011110110100100 e99795efbdb5e89cbbefbdb7e8ac9aefbdb9e89eafefbda4
UHC 闕???謚??? 11001111111101000011111100111111001111111110110011010000001111110011111100111111 cff43f3f3fecd03f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)