3.1.0
When a server exposes hundreds or thousands of tools, sending the full catalog to an LLM wastes tokens and degrades tool selection accuracy. Search transforms solve this by replacing the tool listing with a search interface — the LLM discovers tools on demand instead of receiving everything upfront.
How It Works
When you add a search transform,list_tools() returns just two synthetic tools instead of the full catalog:
search_toolsfinds tools matching a query and returns their full definitionscall_toolexecutes a discovered tool by name
"email" would match a tool named send_email, a tool with “email” in its description, or a tool with an email_address parameter.
Search results are returned in the same JSON format as list_tools, including the full input schema, so the LLM can construct valid calls immediately without a second round-trip.
Search Strategies
FastMCP provides two search transforms. They share the same interface — two synthetic tools, same configuration options — but differ in how they match queries to tools.Regex Search
RegexSearchTransform matches tools against a regex pattern using case-insensitive re.search. It has zero overhead and no index to build, making it a good default when the LLM knows roughly what it’s looking for.
search_tools call takes a pattern parameter — a regex string:
BM25 Search
BM25SearchTransform ranks tools by relevance using the BM25 Okapi algorithm. It’s better for natural language queries because it scores each tool based on term frequency and document rarity, returning results ranked by relevance rather than filtering by match/no-match.
search_tools call takes a query parameter — natural language:
Which to Choose
Use regex when your LLM is good at constructing targeted patterns and you want deterministic, predictable results. Regex is also simpler to debug — you can see exactly what pattern was sent. Use BM25 when your LLM tends to describe what it needs in natural language, or when your tool catalog has nuanced descriptions where relevance ranking adds value. BM25 handles partial matches and synonyms better because it scores on individual terms rather than requiring a single pattern to match.Configuration
Both search transforms accept the same configuration options.Limiting Results
By default, search returns at most 5 tools. Adjustmax_results based on your catalog size and how much context you want the LLM to receive per search:
Pinning Tools
Some tools should always be visible regardless of search. Usealways_visible to pin them in the listing alongside the synthetic tools:
list_tools so the LLM can call them without searching. They’re excluded from search results to avoid duplication.
Custom Tool Names
The default namessearch_tools and call_tool can be changed to avoid conflicts with real tools:
The call_tool Proxy
The call_tool proxy forwards calls to the real tool. When a client calls call_tool(name="search_database", arguments={...}), the proxy resolves search_database through the server’s normal tool pipeline — including transforms and middleware — and executes it.
The proxy rejects attempts to call the synthetic tools themselves. call_tool(name="call_tool") raises an error rather than recursing.
Tools discovered through search can also be called directly via
client.call_tool("search_database", {...}) without going through the proxy. The proxy exists for LLMs that only know about the tools returned by list_tools and need a way to invoke discovered tools through a tool they can see.Auth and Visibility
Search results respect the full authorization pipeline. Tools filtered by middleware, visibility transforms, or component-level auth checks won’t appear in search results. The search tool querieslist_tools() through the complete pipeline at search time, so the same filtering that controls what a client sees in the listing also controls what they can discover through search.
ctx.disable_components()) are also reflected immediately in search results.
