Introducing Parallel FindAll

3 min read Original article ↗

Parallel's new FindAll API turns natural language queries into custom datasets from the web. It finds entities like companies, people, or locations based on your criteria, then enriches them with structured data—all with citations. FindAll Pro achieves 61% recall, 3x better than competitors.

## Turn the web into your own structured dataset

**FindAll** finds any set of entities (companies, people, events, locations, houses, etc.) based on a set of match criteria. For example, with **FindAll, **you can run a natural language query like “Find all dental practices located in Ohio that have 4+ star Google reviews.”

Find all dental practicies in ohio with a 4+ star rating on google

![Find all dental practicies in ohio with a 4+ star rating on google](https://cdn.sanity.io/images/5hzduz3y/production/69fdb0f405893405132f44fd83130f721ee5b7c8-3278x1948.png)
An example of a FindAll query

This is a powerful way to discover the complete long tail of interesting entities from the web and filter them down with match criteria that are personalized to your unique use case. The result is an extensible tool that can produce high-quality datasets on demand, as opposed to buying static, stale, and generic datasets.

## How FindAll works

FindAll executes a three-stage pipeline optimized for both coverage and efficiency:

**1. Generate candidates from web data: FindAll** searches across our proprietary web index to identify potential entities matching your query. Unlike traditional search, which returns a fixed result set, **FindAll** generates candidates dynamically based on your specific criteria.

**2. Evaluate against match conditions: **Each candidate is evaluated against your match conditions using multi-hop reasoning across web sources. Only candidates which satisfy all conditions reach matched status and are included in the results. This staged approach means you only pay to process entities that actually matter.

**3. Extract Structured Enrichments: **For matched entities, **FindAll** automatically orchestrates our **Task API**[**Task API**](https://docs.parallel.ai/task-api/task-quickstart) to extract any additional fields you've specified— from basic attributes like revenue and employee count to complex data points like the strategic initiatives a company is prioritizing.

Illustration demonstrating deep research API concepts, web search capabilities, or AI agent integration features

![](https://cdn.sanity.io/images/5hzduz3y/production/3ef8b555150213bf564e8d57f6d24b25e6cb49bc-1066x1718.png)

## State-of-the-art performance

To test the performance of **FindAll**, we created our own benchmark of 40 complex multi-criteria queries covering public companies, startups, SMBs, specialized entities, and people (e.g., executives, researchers, and professionals). Recall measures the proportion of all correct matches within the entire competitive set of successfully identified entities.

Some sample questions:

  • - "Find all former McKinsey & Company consultants who are currently employed in C-level or VP positions at healthcare technology startups with Series A or later funding" — combines employment history, current role level, industry focus, and funding stage.
  • - "Find all wedding venues in Florida with capacity between 150-300 guests that offer both indoor and outdoor ceremony options, provide in-house catering, and have availability in 2025" — combines location, capacity range, facility features, service offerings, and temporal availability.
  • - "Find all climate technology startups that have active pilot programs with Fortune 500 companies, raised pre-Series A funding, and focus on carbon capture or renewable energy storage" — combines industry focus, corporate partnerships, funding stage, and specific technology areas.

****

**FindAll Pro** achieves state-of-the-art results with 61% recall, ~3X higher than OpenAI Deep Research, Anthropic Deep Research, and Exa. Higher recall means that Parallel **FindAll **finds more correct matches for a given query.** FindAll** **Base** also achieves 30% recall while being the lowest cost on the market, making it the most cost-effective yet performant option.

## About Parallel Web Systems

Parallel develops critical web search infrastructure for AI. Our suite of web search and agent APIs is built on a rapidly growing proprietary index of the global internet. These solutions transform human tasks that previously took days and weeks into agentic tasks that now take seconds and minutes.

Fortune 100 and 500 companies use Parallel’s web intelligence APIs in insurance, finance, and retail, as well as AI-first businesses like Clay, Starbridge, and Sourcegraph.

Parallel avatar

By Parallel

November 18, 2025

## Related Posts64

Actively + Parallel

Parallel Raises at $2 Billion Valuation to Scale Web Infrastructure for Agents

Search & Extract Benchmarks

How Finch is scaling plaintiff law with AI agents that research like associates

Genpact and Parallel Web Systems Partner to Drive Tangible Efficiency from AI Systems

Genpact & Parallel

DeepSearchQA: Parallel Task API benchmarks deepresearch

How Modal saves tens of thousands annually by building in-house GTM pipelines with Parallel

Opendoor and Parallel Case Study

Introducing stateful web research agents with multi-turn conversations

Parallel is now live on Tempo via the Machine Payments Protocol (MPP)

Kepler | Parallel Case Study

Introducing the Parallel CLI

Profound + Parallel Web Systems

How Harvey is expanding legal AI internationally with Parallel

Tabstack + Parallel Case Study

Parallel | Vercel

Product release: Authenticated page access for the Parallel Task API

Introducing structured outputs for the Monitor API

Product release: Research Models with Basis for the Parallel Chat API

Parallel + Cerebras

DeepSearch QA: Task API

Product release: Granular Basis

How Amp’s coding agents build better software with Parallel Search

Latency improvements on the Parallel Task API

Product release: Extract

Product release: Monitor API

Parallel raises $100M Series A to build web infrastructure for agents

Product release - Parallel Search API

Benchmarks: SealQA: Task API

Introducing LLMTEXT, an open source toolkit for the llms.txt standard

Starbridge + Parallel

Building a market research platform with Parallel Deep Research

How Lindy brings state-of-the-art web research to automation flows

Introducing the Parallel Task MCP Server

Introducing the Core2x Processor for improved compute control on the Task API

How Day AI merges private and public data for business intelligence

Full Basis framework for all Task API Processors

Building a real-time streaming task manager with Parallel

How Gumloop built a new AI automation framework with web intelligence as a core node

Introducing the TypeScript SDK

Building a serverless competitive intelligence platform with MCP + Task API

Introducing Parallel Deep Research reports

BrowseComp / DeepResearch: Task API

Building a Full-Stack Search Agent with Parallel and Cerebras

Webhooks for the Parallel Task API

Introducing Parallel: Web Search Infrastructure for AIs

Introducing SSE for Task Runs

A new line of advanced Processors: Ultra2x, Ultra4x, and Ultra8x

Introducing Auto Mode for the Parallel Task API

A linear dithering of a search interface for agents

Parallel Search MCP Server in Devin

Introducing Tool Calling via MCP Servers

Introducing the Parallel Search MCP Server

Starting today, Source Policy is available for both the Parallel Task API and Search API - giving you granular control over which sources your AI agents access and how results are prioritized.

The Parallel Task Group API

State of the Art Deep Research APIs

Introducing the Parallel Search API

Introducing the Parallel Chat API - a low latency web research API for web based LLM completions. The Parallel Chat API returns completions in text and structured JSON format, and is OpenAI Chat Completions compatible.

Parallel Web Systems introduces Basis with calibrated confidences - a new verification framework for AI web research and search API outputs that sets a new industry standard for transparent and reliable deep research.

The Parallel Task API is a state-of-the-art system for automated web research that delivers the highest accuracy at every price point.