arxiv-research
Ray0907/arxiv-research-skillUse when searching academic papers on arXiv, understanding research content, building literature reviews, or generating citations for academic writing
SKILL.md
name: arxiv-research description: Use when searching academic papers on arXiv, understanding research content, building literature reviews, or generating citations for academic writing
arXiv Research Skill
Overview
This skill enables systematic academic research through three core capabilities that form the minimal complete loop of knowledge building:
connect -> understand -> evidence
Find -> Comprehend -> Cite
Core Principles
Why this exists: Research is reducing uncertainty about reality by building on existing knowledge. arXiv contains codified human knowledge. This skill helps navigate and utilize that knowledge effectively.
The Three Pillars
1. Connect (Knowledge Navigation)
Purpose: Find relevant existing knowledge
When to use:
- Starting research on a new topic
- Finding related work for a paper
- Discovering what exists in a field
Capabilities:
- Semantic search across arXiv
- Filter by category, author, date
- Rank by citation impact (via Semantic Scholar)
- Find similar papers to a known paper
Usage:
# Run the connect script
python connect.py search "transformer attention mechanism" --category cs.LG --limit 20
python connect.py similar "2301.00001" --limit 10
python connect.py recent cs.AI --days 7
python connect.py by-author "Yann LeCun"
2. Understand (Meaning Extraction)
Purpose: Comprehend what the knowledge contains
When to use:
- Need to quickly grasp a paper's contribution
- Extracting methodology details
- Comparing multiple papers
- Writing literature review sections
Capabilities:
- Structured paper analysis (problem, method, contribution, limitations)
- Key findings extraction
- Methodology breakdown
- Multi-paper comparison
Usage:
# Get paper content for analysis
python connect.py content "2301.00001"
# Then use the understanding prompts in your analysis
Analysis Prompts (use with paper content):
Quick Summary
Analyze this paper and provide:
1. Problem: What problem does it solve? (1-2 sentences)
2. Method: How does it solve it? (2-3 sentences)
3. Contribution: What's new/novel? (1-2 sentences)
4. Limitation: What are the limitations? (1-2 sentences)
Deep Methodology
Extract the methodology:
1. Core algorithm/approach
2. Key assumptions
3. Experimental setup
4. Evaluation metrics
5. Baseline comparisons
Literature Comparison
Compare these papers on:
| Aspect | Paper A | Paper B | Paper C |
|--------|---------|---------|---------|
| Problem |
| Method |
| Dataset |
| Results |
| Limitations |
3. Evidence (Source Attribution)
Purpose: Create verifiable links to sources
When to use:
- Writing academic papers
- Need proper citations
- Building bibliography
- Ensuring traceability of claims
Capabilities:
- BibTeX generation
- Multiple citation formats (APA, IEEE, ACM, Chicago)
- Batch citation export
- Citation verification
Usage:
# Generate citations
python evidence.py bibtex "2301.00001"
python evidence.py apa "2301.00001"
python evidence.py batch "2301.00001,2302.00002,2303.00003" --format bibtex
Workflow Examples
Literature Review Workflow
1. CONNECT: Find seed papers
python connect.py search "your topic" --limit 50
2. CONNECT: Rank by impact
(Results include citation counts from Semantic Scholar)
3. CONNECT: Expand with similar papers
python connect.py similar "top_paper_id"
4. UNDERSTAND: Analyze each paper
python connect.py content "paper_id" | analyze with prompts
5. EVIDENCE: Generate bibliography
python evidence.py batch "id1,id2,id3" --format bibtex > refs.bib
Finding Evidence for a Claim
1. CONNECT: Search for supporting research
python connect.py search "your claim keywords"
2. UNDERSTAND: Verify the paper supports your claim
python connect.py content "paper_id"
3. EVIDENCE: Generate proper citation
python evidence.py apa "paper_id"
API Dependencies
| Service | Purpose | Rate Limit | API Key Required |
|---|---|---|---|
| arXiv | Paper search, content | 1 req/3s | No |
| Semantic Scholar | Citations, similar papers | 100 req/5min | No (optional for higher limits) |
| Jina Reader | Full text extraction | Generous | No |
File Structure
arxiv-research-skill/
├── SKILL.md # This file - usage guide
├── connect.py # Knowledge navigation
├── understand.py # Analysis utilities
└── evidence.py # Citation generation
Common Patterns
Finding Foundational Papers
python connect.py search "topic" --sort citations --limit 10
Tracking Recent Developments
python connect.py recent cs.AI --days 30
Building a Reading List
python connect.py search "topic" > papers.json
# Review and filter
python evidence.py batch "selected_ids" --format bibtex
Error Handling
- Rate limited: Wait and retry, scripts have built-in backoff
- Paper not found: Verify arXiv ID format (YYMM.NNNNN)
- No citations: Paper may be too new for Semantic Scholar
README
arXiv Research Skill
Agent skill for academic research on arXiv.
Core Principle
Research = Building knowledge on existing knowledge
connect -> understand -> evidence
Find -> Comprehend -> Cite
Installation
uv sync
The Three Pillars
1. Connect (Knowledge Navigation)
Find relevant existing knowledge.
# Search papers
uv run python connect.py search "transformer attention" --limit 10
# Search with citation counts
uv run python connect.py search "LLM agents" --with-citations --sort citations
# Find similar papers
uv run python connect.py similar 2301.00001
# Get recent papers in a category
uv run python connect.py recent cs.AI --limit 20
# Search by author
uv run python connect.py by-author "Yann LeCun"
# Get paper details
uv run python connect.py paper 2301.00001
# Get full paper content
uv run python connect.py content 2301.00001
2. Understand (Meaning Extraction)
Comprehend what the knowledge contains.
# List available analysis prompts
uv run python understand.py list
# Get a specific prompt
uv run python understand.py get quick
uv run python understand.py get methodology
uv run python understand.py get critical
uv run python understand.py get compare
# Generate analysis request from paper content
uv run python connect.py content 2301.00001 | uv run python understand.py analyze quick
Available prompts:
quick- Fast structured summarymethodology- Detailed methodology extractioncontribution- Identify and rank contributionscritical- Critical analysis with strengths/weaknessescompare- Multi-paper comparison tableliterature- Organize for literature reviewimplementation- Extract reproduction detailsevidence- Evaluate as evidence for a claim
3. Evidence (Source Attribution)
Create verifiable links to sources.
# Generate BibTeX
uv run python evidence.py bibtex 2301.00001
# Generate APA citation
uv run python evidence.py apa 2301.00001
# Generate IEEE citation
uv run python evidence.py ieee 2301.00001
# Generate all formats
uv run python evidence.py all 2301.00001
# Batch generate citations
uv run python evidence.py batch "2301.00001,2302.00002,2303.00003" --format bibtex
# Get raw metadata
uv run python evidence.py metadata 2301.00001
Workflow Examples
Literature Review
# 1. Find seed papers
uv run python connect.py search "your topic" --limit 30 --with-citations
# 2. Get similar papers from top results
uv run python connect.py similar 2301.00001
# 3. Analyze each paper
uv run python connect.py content 2301.00001 | uv run python understand.py analyze literature
# 4. Generate bibliography
uv run python evidence.py batch "id1,id2,id3" --format bibtex > refs.bib
Finding Evidence for a Claim
# 1. Search for supporting research
uv run python connect.py search "your claim keywords" --with-citations
# 2. Verify the paper supports your claim
uv run python connect.py content 2301.00001
# 3. Generate citation
uv run python evidence.py apa 2301.00001
API Dependencies
| Service | Purpose | Rate Limit |
|---|---|---|
| arXiv API | Paper search, metadata | 1 req/3s |
| Semantic Scholar | Citation counts, similar papers | 100 req/5min |
| Jina Reader | Full text extraction | Generous |
No API keys required.
License
MIT