Skip to content

feat(pmc): add PubMed Central integration for biomedical full-text papers#34

Open
hongkongkiwi wants to merge 5 commits intoopenags:mainfrom
hongkongkiwi:feature/pmc-integration
Open

feat(pmc): add PubMed Central integration for biomedical full-text papers#34
hongkongkiwi wants to merge 5 commits intoopenags:mainfrom
hongkongkiwi:feature/pmc-integration

Conversation

@hongkongkiwi
Copy link

Add PubMed Central (PMC) integration for biomedical full-text papers. PMC provides free access to complete articles (unlike PubMed which has abstracts only). Adds search_pmc, get_pmc_paper, download_pmc, read_pmc_paper tools.

…ic paper search

OpenAlex is a free and open catalog of the global research system with over 200M works.
This integration provides:

- Full paper search with advanced filters (year, type, full-text availability)
- Citation and reference traversal (forward and backward citations)
- Author-based search
- Related papers discovery based on concepts and references
- DOI and OpenAlex ID lookup
- Comprehensive metadata including concepts, keywords, and open access info

Features:
- search_openalex: Main search function with filtering options
- get_openalex_paper: Get paper by OpenAlex ID
- get_openalex_paper_by_doi: Get paper by DOI
- get_openalex_citations: Get papers that cite this work
- get_openalex_references: Get papers referenced by this work
- search_openalex_by_author: Search papers by author name
- get_openalex_related: Find related papers
- download_openalex: Download PDF from open access sources
- read_openalex_paper: Extract text from PDF
…holar

This enhancement adds comprehensive citation and reference functionality:

Semantic Scholar enhancements:
- get_semantic_citations: Get papers that cite this work (forward citations)
- get_semantic_references: Get papers referenced by this work (backward citations)
- get_semantic_related: Get related papers based on concepts and citations
- search_semantic_by_author: Search papers by author name

Note: OpenAlex already has full citation/reference support from the previous feature.

These tools enable:
- Citation graph traversal (forward and backward)
- Related paper discovery
- Author-based paper search
- Comprehensive citation analysis
Sci-Hub provides access to millions of research papers behind paywalls.
This integration exposes the existing Sci-Hub fetcher as an MCP tool.

Features:
- download_scihub: Download PDFs using DOI, PMID, or URL

Note:
- Sci-Hub operates in a legal gray area
- Only use for legitimate research purposes
- Ensure compliance with local laws and institution policies
Same papers often appear in multiple sources (arXiv, Semantic Scholar, etc.).
This feature adds intelligent deduplication based on:
- DOI matching (most reliable)
- Title similarity (>= 90% match)
- Author + year matching

Features:
- deduplicate_papers: Remove duplicates from paper list
- merge_papers: Merge duplicates by combining metadata
- find_duplicate_groups: Analyze duplicates without removing

Use cases:
- Combine results from multiple search sources
- Remove duplicate papers from aggregated results
- Merge complementary metadata from different sources
…pers

PMC provides free access to full-text biomedical and life sciences literature.
This integration enables searching and downloading complete articles.

Features:
- search_pmc: Search PMC using E-utilities API
- get_pmc_paper: Get paper by PMCID
- download_pmc: Download PDF of PMC articles
- read_pmc_paper: Extract text from PMC PDFs

PMC differs from PubMed (abstracts only) - PMC has complete full-text articles
from biomedical and life sciences journals at NIH/NLM.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant