To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 嶸??兄①????D嶸??兄①????D^ 111110101011010000111111001111111000110001011010100001110100000000111111001111110011111100111111010001001111101010110100001111110011111110001100010110101000011101000000001111110011111100111111001111110100010001011110 fab43f3f8c5a87403f3f3f3f44fab43f3f8c5a87403f3f3f3f445e
EUC-JP 嶸??兄?????D嶸??兄?????D^ 100011111011101111110100001111110011111110110111101110110011111100111111001111110011111100111111010001001000111110111011111101000011111100111111101101111011101100111111001111110011111100111111001111110100010001011110 8fbbf43f3fb7bb3f3f3f3f3f448fbbf43f3fb7bb3f3f3f3f3f445e
UTF-8 嶸뤹옚兄①츕狀⅛옱D嶸뤹옚兄①츕狀⅛옱D^ 111001011011011010111000111010111010010010111001111011001001100010011010111001011000010110000100111000101001000110100000111011001011100010010101111011111010011110111010111000101000010110011011111011001001100010110001010001001110010110110110101110001110101110100100101110011110110010011000100110101110010110000101100001001110001010010001101000001110110010111000100101011110111110100111101110101110001010000101100110111110110010011000101100010100010001011110 e5b6b8eba4b9ec989ae58584e291a0ecb895efa7bae2859bec98b144e5b6b8eba4b9ec989ae58584e291a0ecb895efa7bae2859bec98b1445e
UHC 嶸뤹옚兄①츕狀⅛옱D嶸뤹옚兄①츕狀⅛옱D^ 111001111010111010001111111001111001111010011110111110101111110010101000111001111010111010001111111011011110111010101000111110111001111010101100010001001110011110101110100011111110011110011110100111101111101011111100101010001110011110101110100011111110110111101110101010001111101110011110101011000100010001011110 e7ae8fe79e9efafca8e7ae8fedeea8fb9eac44e7ae8fe79e9efafca8e7ae8fedeea8fb9eac445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)