To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 曆쇔ââ½å¾‡ë¨¬êµšäø´ì–µëè 1110111110100110100010111110110010000111100101001110001011100010101111011110010110111110100001111110101110101000101011001110101010110101100110101110010011111000101101001110110010010110101101011110101111101000 efa68bec8794e2e2bde5be87eba8aceab59ae4f8b4ec96b5ebe8
SJIS-WIN ?????????????¨¬?????´????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110000001010011101000000111001010001111110011111100111111001111110011111110000001010011000011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f814e81ca3f3f3f3f3f814c3f3f3f3f3f
EUC-JP ï¦?ì??ââ?å??먬ê??äø´ì??ëè 10001111101010111100000110001111101000101100001100111111100011111010101111000000001111110011111110001111101010111010010010001111101010111010010000111111100011111010101110101001001111110011111110001111101010111011001110100001101011111010001011001100100011111010101110110100001111110011111110001111101010111010001110001111101010011100110010100001101011011000111110101011110000000011111100111111100011111010101110110011100011111010101110110010 8fabc18fa2c33f8fabc03f3f8faba48faba43f8faba93f3f8fabb3a1afa2cc8fabb43f3f8faba38fa9cca1ad8fabc03f3f8fabb38fabb2
UTF-8 曆쇔ââ½å¾‡ë¨¬êµšäø´ì–µëè 11000011101011111100001010100110110000101000101111000011101011001100001010000111110000101001010011000011101000101100001110100010110000101011110111000011101001011100001010111110110000101000011111000011101010111100001010101000110000101010110011000011101010101100001010110101110000101001101011000011101001001100001110111000110000101011010011000011101011001100001010010110110000101011010111000011101010111100001110101000 c3afc2a6c28bc3acc287c294c3a2c3a2c2bdc3a5c2bec287c3abc2a8c2acc3aac2b5c29ac3a4c3b8c2b4c3acc296c2b5c3abc3a8
UHC ????????½?¾??¨?????ø´????? 00111111001111110011111100111111001111110011111100111111001111111010100011110110001111111010100011111010001111110011111110100001101001110011111100111111001111110011111100111111101010011010101010100010101001010011111100111111001111110011111100111111 3f3f3f3f3f3f3f3fa8f63fa8fa3f3fa1a73f3f3f3f3fa9aaa2a53f3f3f3f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)