Contents
Get the full page contents, summaries, and metadata for a list of URLs.
Returns instant results from our cache, with automatic live crawling as fallback for uncached pages.
Documentation Index
Fetch the complete documentation index at: https://exa.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
Get your Exa API key
Authorizations
Pass your Exa API key in the x-api-key header. You can also authenticate with Authorization: Bearer .
Body
Array of URLs to crawl (backwards compatible with 'ids' parameter).
1 - 100 elements1 - 2048["https://arxiv.org/pdf/2307.06435"]Array of document IDs obtained from searches.
1 - 100 elements1 - 2048["https://arxiv.org/pdf/2307.06435"]Enterprise-only compliance mode. Set to hipaa to require HIPAA-safe processing. Requests fail closed or restrict features when the requested behavior requires non-HIPAA-safe processors.
hipaa "hipaa"
Text extraction options for each result.
Text snippets the LLM identifies as most relevant from each page.
Summary of the webpage.
Extra parameters to pass.
Deprecated: Use highlights or text instead. Returns page contents as a combined context string.
true
Deprecated: Use maxAgeHours instead for content freshness control. livecrawl does not guarantee freshly fetched parser output and may be served according to server freshness policy. Do not send livecrawl and maxAgeHours together.
never, always, fallback, preferred "preferred"
The timeout for livecrawling in milliseconds.
0 < x <= 900001000
Maximum age of cached content in hours. Positive values use cached content if it is less than this many hours old; 0 fetches fresh content and is the supported way to apply text rendering options to newly fetched pages; -1 always uses cache; omitted uses fallback fetching when cached content is unavailable. Maximum supported value is 720 hours.
-1 <= x <= 72024
The number of subpages to crawl. The actual number crawled may be limited by system constraints.
0 <= x <= 1001
Term to find specific subpages of search results. Can be a single string or an array of strings.
1 - 100"sources"
Response
OK
Unique identifier for the request.
"b5947044c4b78efa9552a7c89b306d95"
Deprecated. Combined context string from search results. Use highlights or text instead.
Status information for each requested URL or document ID.
Endpoint-dependent estimated dollar cost breakdown for the completed request. Billing is computed from usage counters rather than this response object.

