To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 賊?紆?調紗?寃??拒賊?紆?調紗?寃??居^ 1001000110101111001111111110001011111100001111111001001010110010100011101101000100111111100110111000001100111111001111111000101110010001100100011010111100111111111000101111110000111111100100101011001010001110110100010011111110011011100000110011111100111111100010111000111101011110 91af3fe2fc3f92b28ed13f9b833f3f8b9191af3fe2fc3f92b28ed13f9b833f3f8b8f5e
EUC-JP 賊?紆?調紗?寃??拒賊?紆?調紗?寃??居^ 1100001010110001001111111110010011111110001111111100010010110100101111001101001100111111110101011110001100111111001111111011010111110001110000101011000100111111111001001111111000111111110001001011010010111100110100110011111111010101111000110011111100111111101101011110111101011110 c2b13fe4fe3fc4b4bcd33fd5e33f3fb5f1c2b13fe4fe3fc4b4bcd33fd5e33f3fb5ef5e
UTF-8 賊렠紆렣調紗떵寃닿렋拒賊렠紆렣調紗떵寃닿렋居^ 11101000101100111000101011101011101000001010000011100111101101001000011011101011101000001010001111101000101010101011111111100111101101001001011111101011100101101011010111100101101011111000001111101011100010111011111111101011101000001000101111100110100010111001001011101000101100111000101011101011101000001010000011100111101101001000011011101011101000001010001111101000101010101011111111100111101101001001011111101011100101101011010111100101101011111000001111101011100010111011111111101011101000001000101111100101101100011000010101011110 e8b38aeba0a0e7b486eba0a3e8aabfe7b497eb96b5e5af83eb8bbfeba08be68b92e8b38aeba0a0e7b486eba0a3e8aabfe7b497eb96b5e5af83eb8bbfeba08be5b1855e
UHC 賊렠紆렣調紗떵寃닿렋拒賊렠紆렣調紗떵寃닿렋居^ 111011101110010010001110101100011110100111100001100011101011010011110000111000001101111011101001101101101011101011101010101100101011010011101010100011101010001011001011110111101110111011100100100011101011000111101001111000011000111010110100111100001110000011011110111010011011011010111010111010101011001010110100111010101000111010100010110010111101110001011110 eee48eb1e9e18eb4f0e0dee9b6baeab2b4ea8ea2cbdeeee48eb1e9e18eb4f0e0dee9b6baeab2b4ea8ea2cbdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)