search()
API call. These flags are off by default. These are useful when building agents that require fine-grained retrieval control
Mode | Approximate Latency |
---|---|
keyword_search | <10ms |
rerank | 150–200ms |
filter_memories | 200–300ms |
keyword_search
for broader recall when query context is limitedrerank
to prioritize the top-most relevant resultfilter_memories
in production-facing or safety-critical agentssearch
method. There is no required sequence for these modes, and any combination can be used based on your needs.