To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 搖??竊????????愉??恂ル?阿 10011101100010100011111100111111111000101000011000111111001111110011111100111111001111110011111100111111001111111001011011111001001111110011111110011100100101101000001110001011001111111000100010100010 9d8a3f3fe2863f3f3f3f3f3f3f3f96f93f3f9c96838b3f88a2
EUC-JP 搖??竊??洧??孼??愉??恂ル?阿 1101100111101010001111110011111111100011111001100011111100111111100011111100011110110100001111110011111110001111101110101100001100111111001111111100110011111011001111110011111111010111111101101010010111101011001111111011000010100100 d9ea3f3fe3e63f3f8fc7b43f3f8fbac33f3fccfb3f3fd7f6a5eb3fb0a4
UTF-8 搖깅쵎竊섓㎖洧덀걶孼꾩뮇愉쇿톹恂ル쐠阿 111001101001000010010110111010101011100110000101111011001011010110001110111001111010101110001010111011001000010010010011111000111000111010010110111001101011010010100111111010111000110110000000111010101011000110110110111001011010110110111100111010101011111010101001111010111010111010000111111001101000010010001001111011001000011110111111111011011000011010111001111001101000000110000010111000111000001110101011111011001001000010100000111010011001100010111111 e69096eab985ecb58ee7ab8aec8493e38e96e6b4a7eb8d80eab1b6e5adbceabea9ebae87e68489ec87bfed86b9e68182e383abec90a0e998bf
UHC 搖깅쵎竊섓㎖洧덀걶孼꾩뮇愉쇿톹恂ル쐠阿 1110100011110100101100011110101110101100100100001110111110111100100110001110111110100111101000101110101011111011100010001110001110000001100111001110010111101101100001001110110010010010100101101110101011110000100110011110010110110111100011011110001011100001101010111110101110011100100001101110010010111001 e8f4b1ebac90efbc98efa7a2eafb88e3819ce5ed84ec9296eaf099e5b78de2e1abeb9c86e4b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)