— papers. — researchers.
— citations mapped.
One MCP server. Ask anything about AI research — and get the truth.
Semantic Scholar gives you a website. OpenAlex gives you a REST API. BrunoSan gives you a chatbar MCP — fully queryable by any AI agent, deterministic, UUID-stable, EU-hosted. No other system exposes all of this as MCP.
Every paper is a UUID-stable object. Every connection is an explicit edge.
det_uuid("arxiv_paper", arxiv_id) —
same input, same UUID. Always. No vectors, no approximation, no hallucination.
Pure SQL over verified ArXiv data.
Citation graph. Co-occurrence network. Author → Institution edges. Paper → GitHub repo links. Questions that no other platform answers as an API: "Which papers cite this paper?" "Where do LoRA and MMLU overlap?" Two JOINs. No inference.
Entity trends across months and years. When did LoRA explode? When did RAG become mainstream? When did Chain-of-Thought peak? Every data point is deterministic — grouped by month, quarter, or year. Reproducible. Auditable.
These are the questions every AI researcher, investor, and journalist asks first. No website answers them. No other MCP exposes them. BrunoSan does — deterministically.
"Which 20 papers are cited most often in AI research right now?"
arxiv_most_cited(limit=20)
COUNT(refs.target_arxiv_id) GROUP BY —
pure SQL over — citation edges. No inference. The most cited paper in our database
is the most cited paper in our database — not an opinion.
"How did LoRA grow — month by month from 2023 to today?"
arxiv_entity_trend("LoRA", granularity="month")
Tracks entity mentions across — papers over time. Watch LoRA explode in early 2023. Watch RAG go mainstream. Watch BERT slowly decline. Grouped by month, quarter, or year — fully deterministic.
★ 30+ monthly data points per entity"Show me all papers mentioning GPT-4 AND RLHF together."
arxiv_co_occurrence("GPT-4", "RLHF")
Intersection over — papers via two JOINs. Reveals research that explicitly bridges two concepts — not just papers mentioning one. The papers where LoRA meets MMLU, or RAG meets Chain-of-Thought.
★ No hallucination — explicit graph edges onlyFour capability blocks — Search, Trends, People, Graph. Each tool is purpose-built, read-only, deterministic. Every result is citable. Every query is reproducible.
Full-text search over —+ papers using FTS5 — the same engine that powers SQLite's fastest search. Searches title AND abstract simultaneously. Supports boolean operators, phrase matching, and prefix search. Filter by category, date range, empirical-only, or papers with code release.
Complete paper object with all connected data — authors with their position (first/middle/last), matched entities (benchmarks, models, methods, datasets), full reference list (up to 100), and linked GitHub repos. The full knowledge graph node in one call.
Entity ranking by mention count across all papers — benchmarks, models, methods, datasets.
The title_only flag is a precision filter:
MMLU in the title means the paper IS about MMLU. MMLU in the abstract means it just uses it.
That distinction exists nowhere else.
How often is an entity mentioned — per month, quarter, or year? Tracks the rise and fall of any benchmark, model, method, or dataset across the entire research literature. Watch LoRA explode. Watch BERT decline. Watch RAG go from niche to mainstream. All deterministic.
Researchers ranked by paper count — with a critical role filter.
role=last_author is the PI filter:
in academic AI, the last author IS the lab director, the grant holder, the research agenda setter.
role=first_author finds who does the work.
No other research intelligence system exposes this distinction as an API.
All papers by a specific researcher — newest first, with their position on each paper. Fuzzy name matching handles variations. Shows first/middle/last role per paper, LLM task classification, and one-sentence contribution summary (where available).
Institutions ranked by paper count — with an optional second signal from GitHub org links. Affiliation data comes from ArXiv HTML parsing. GitHub orgs (openai, google-deepmind, microsoft) are often more complete. Combining both signals gives the most accurate picture of who produces AI research — and who ships the code.
The most cited papers in the database — ranked by inbound citation count.
COUNT(refs.target_arxiv_id) GROUP BY over
—+ citation edges. Pure SQL. No inference.
This is the question every researcher, VC, and journalist asks first.
No other platform answers it as a queryable API.
Citation graph for any paper — who cites it, or what does it cite.
Direction cited_by finds papers in our database that reference this paper.
Direction citing shows its full reference list with ArXiv IDs.
Depth 2 expands one hop further — the papers that cite the papers that cite it.
Papers that mention BOTH entity A and entity B — the intersection over —+ papers via two JOINs. Reveals research that explicitly bridges two concepts. The papers where LoRA meets MMLU. Where RAG meets Chain-of-Thought. Where GPT-4 meets RLHF. No hallucination — explicit graph edges only.
GitHub organizations and repositories ranked by paper count — the open-source
output of the research community. repos.org extracted
from —+ GitHub links in paper HTML.
Shows which orgs (openai, google-deepmind, microsoft, huggingface) ship the most research code.
Filterable by org name and date.
Full system status in one call — database counts, pipeline progress percentages, frontier date (how far back the backfill has reached), quality report from the last automated check, and the 5 most recent quality log entries. The health dashboard for the entire knowledge graph.
One URL. Any MCP-compatible agent. API key required for all tool calls.
For Claude.ai, Claude Code, Cursor, n8n, and any MCP-compatible platform. Copy the URL — that's it.
https://arxiv.mcp.brunosan.de/mcpapi_key parameter.All intelligence verticals — one API key. AI News · Biotech · Crypto · Cyber · Finance · Geopolitics · Regulatory · Robotics · ArXiv.
All payments via Mollie — secure, EU-based payment processing. Questions? hello@brunosan.de
Find the most cited papers in your area. Track which benchmarks dominate. Discover who is building on your work. Citation network in one API call.
Spot research trends before they reach product. Identify the PIs whose labs become startups. Track which orgs ship the most research code on GitHub.
Give your agent real research context — not hallucinated summaries. Ask about SOTA methods, benchmark results, and paper citations. All deterministic.
"What is the most cited AI paper of 2025?" — one tool call. Entity trends over time. Institutional rankings. All citable, all sourced from ArXiv directly.
Live MCP demo for students — show how AI agents query research databases in real time. The ArXiv graph is the perfect hands-on MCP integration example.
Track competitor research labs over time. Monitor which methods your competitors are publishing on. Feed research signals into internal knowledge bases.