Skip to main content

API Custom Endpoints

Refer to the Import.io API Docs for detailed information regarding the main Import.io API.

Import.io provides a number of custom endpoints to help you access your data in the most convenient way for your use case. All the direct links for those endpoints are available under the Integrate tab on the extractor page at app.import.io.

Custom endpoints available

  • Live query API
  • CSV/JSON from the latest successful run
  • Google sheets
  • Crawlruns history (RSS feed)

Live query API

Live query API is an easy way to extract data from a single URL via API. Live means that the data is extracted in real-time, and the response is returned immediately for the original HTTP request after the extraction is complete.

Use this endpoint in your API requests to programmatically run an extractor on a single URL. The API response consists of the extracted data in JSON format.

Each API request runs the extractor live and counts as one query toward your plan total. The endpoint contains the URL for one webpage. To retrieve data from another webpage of similar structure, replace the URL in the endpoint with the URL of the other webpage.

CSV/JSON from the latest successful run

Use these endpoints in your API requests to programmatically retrieve the compiled dataset from the most-recent run of an extractor. The API response consists of a GZIP-compressed file containing data in either CSV or JSON format.

These endpoints return data that your extractor has already collected, therefore using the endpoints do not count as queries toward your plan total. When running a scheduled extraction, use this endpoint to collect the most current data.

Google Sheets

Use this endpoint as a Google Sheets formula to extract your data directly into Google Sheets.

Because the API response returns data from the latest extractor run, you can keep your spreadsheet current by setting your extractor to run on a schedule. For example, schedule your extractor to run on Monday at 5am and have instant access to the current data in your Google Sheet when you arrive at the office.

This endpoint returns data that your extractor has already collected, therefore using the endpoint does not count as a query toward your plan total.

Crawlrun history (RSS feed)

Use this endpoint in your API requests to collect information relevant to the history of your runs for a particular extractor.

You can use Zapier to integrate your RSS feed.

This endpoint does not count as a query toward your plan total.