Agentset
Open-source RAG platform with built-in citations, deep research, 22+ file format support. MCP integration for agent retrieval workflows. 1,983 GitHub stars and 5 commits on main in the last 30 days.
“Open-source RAG platform with built-in citations, deep research, 22+ file format support. 1,983 GitHub stars and 5 commits on main in the last 30 days.”
INSTALL THIS SERVER
{
"mcpServers": {
"agentset": {
"command": "python",
"args": [
"-m",
"agentset"
]
}
}
}
{
"mcpServers": {
"agentset": {
"command": "python",
"args": [
"-m",
"agentset"
]
}
}
}
{
"mcpServers": {
"agentset": {
"command": "python",
"args": [
"-m",
"agentset"
]
}
}
}
{
"mcpServers": {
"agentset": {
"command": "python",
"args": [
"-m",
"agentset"
]
}
}
}
{
"mcpServers": {
"agentset": {
"command": "python",
"args": [
"-m",
"agentset"
]
}
}
}
6 TOOLS AVAILABLE
OUR ASSESSMENT
- 1,983 GitHub stars.
- 5 commits on main in the last 30 days.
- MIT license.
- 22+ file format ingestion.
- Built-in citation metadata.
- Deep-research orchestration included.
- Self-hosted setup requires DevOps capacity (vector DB plus orchestration).
- Query quality depends on chunk-size tuning per corpus.
- Community-maintained.
AGENTSET_API_KEY (hosted) is account-scoped. Ingested documents may contain PII; restrict the agent flow to a workspace with appropriate retention and access policy.
RAG-driven agent workflows that need source citations; teams that want one open-source platform across document ingestion, retrieval, and research; cost-conscious orgs avoiding closed RAG vendors.
TECHNICAL DETAILS
ADOPTION METRICS
// Reading this1,983 stars on agentset-ai/agentset. 5 commits on main in the last 30 days.
// Reading thisPairs with Pinecone, Qdrant, Cognee, Codebase Memory, Arize Phoenix, MLflow, W&B, Engram, Mastra in ai-ml. Agentset owns the open-source RAG-with-citations slot.
SOURCES & VERIFICATION
We don't take any single directory's word for it. Before scoring, we cross-reference 4 public MCP sources, install the server ourselves against the clients we cover, and record when we last re-verified.
The same server, 4 different lenses. We reconcile these signals into our editorial score, which is why our number sometimes diverges from a directory-aggregate star count.
| Source | Their rating | Their star count | Their downloads | Last synced |
|---|---|---|---|---|
| AutomationSwitch This page | 4.3editorial | 1,983 | — | MAY 14, 2026 |
| PulseMCP | — unrated | unavailable | unavailable | MAY 14, 2026 |
| MCP.so | — unrated | unavailable | unavailable | MAY 14, 2026 |
| Glama | — unrated | unavailable | unavailable | MAY 14, 2026 |
| Smithery | — unrated | unavailable | unavailable | MAY 14, 2026 |
// Counts are directory-reported; we don't adjust them. Discrepancies usually come from different snapshot times or star-caching.
OTHER AI / ML MCP SERVERS
Cognee MCP
Knowledge graph plus vector memory engine for AI agents, exposed as an MCP server with V2 session-aware memory tools (remember, recall, forget, improve) and classic V1 ingestion pipelines (cognify, codify). Three transports: stdio, SSE, Streamable HTTP. 16,965 GitHub stars, Apache-2.0.
HeyGen Hyperframes
Write HTML, render video. Built for agents. HeyGen's framework that turns HTML templates into video output via MCP. 16,400 GitHub stars and 100 commits on main in the last 30 days.
Codebase Memory MCP
High-performance code intelligence MCP server for AI coding agents. Indexes a codebase into a queryable knowledge graph in milliseconds, with 14 MCP tools spanning structural search, call-chain tracing, impact analysis, dead-code detection, and Cypher queries. Single static C binary, 66 languages via tree-sitter, zero runtime dependencies.
Arize Phoenix MCP
LLM observability platform exposing prompts, projects, traces, spans, sessions, datasets, and experiments through MCP. Published to npm as @arizeai/phoenix-mcp, current 4.0.8 (2026-04-29). 9,496 stars on parent monorepo, Elastic License 2.0.
Qdrant MCP Server
Official Qdrant vector database MCP server. Acts as a semantic memory layer on top of Qdrant: store information with metadata, retrieve via similarity search. Two tools, very small surface area, exceptionally maintained by the Qdrant team. Configurable embedding provider (fastembed default), supports remote and local Qdrant clusters.
Mastra
TypeScript framework for building AI-powered applications and agents, with built-in MCP server support. From the team behind Gatsby. 23,680 GitHub stars and 100 commits on main in the last 30 days.
DISCUSS YOUR
MCP REQUIREMENTS.
Evaluating a server, scoping an internal deployment, or working out whether MCP is the right fit at all. Start the conversation and we will point you at the right piece of the ecosystem.