---
name: arxiv-research
description: Use when searching academic papers on arXiv, understanding research content, building literature reviews, or generating citations for academic writing
---

# arXiv Research Skill

## Overview

This skill enables systematic academic research through three core capabilities that form the minimal complete loop of knowledge building:

```
connect -> understand -> evidence
  Find  ->  Comprehend -> Cite
```

## Core Principles

**Why this exists:** Research is reducing uncertainty about reality by building on existing knowledge. arXiv contains codified human knowledge. This skill helps navigate and utilize that knowledge effectively.

## The Three Pillars

### 1. Connect (Knowledge Navigation)

**Purpose:** Find relevant existing knowledge

**When to use:**
- Starting research on a new topic
- Finding related work for a paper
- Discovering what exists in a field

**Capabilities:**
- Semantic search across arXiv
- Filter by category, author, date
- Rank by citation impact (via Semantic Scholar)
- Find similar papers to a known paper

**Usage:**
```bash
# Run the connect script
python scripts/connect.py search "transformer attention mechanism" --category cs.LG --limit 20
python scripts/connect.py search "LLM agents" --since 2023-01 --until 2024-06  # Date filtering
python scripts/connect.py similar "2301.00001" --limit 10
python scripts/connect.py recent cs.AI --days 7
python scripts/connect.py by-author "Yann LeCun"
python scripts/connect.py cited-by "2301.00001" --limit 20  # Forward citations
python scripts/connect.py coauthors "Yann LeCun" --limit 20  # Collaboration network
```

### 2. Understand (Meaning Extraction)

**Purpose:** Comprehend what the knowledge contains

**When to use:**
- Need to quickly grasp a paper's contribution
- Extracting methodology details
- Comparing multiple papers
- Writing literature review sections

**Capabilities:**
- Structured paper analysis (problem, method, contribution, limitations)
- Key findings extraction
- Methodology breakdown
- Multi-paper comparison

**Usage:**
```bash
# Get paper content for analysis (single or batch)
python scripts/connect.py content "2301.00001"
python scripts/connect.py content "2301.00001,2302.00002,2303.00003"

# Then use the understanding prompts in your analysis
```

**Analysis Prompts** (use with paper content):

#### Quick Summary
```
Analyze this paper and provide:
1. Problem: What problem does it solve? (1-2 sentences)
2. Method: How does it solve it? (2-3 sentences)
3. Contribution: What's new/novel? (1-2 sentences)
4. Limitation: What are the limitations? (1-2 sentences)
```

#### Deep Methodology
```
Extract the methodology:
1. Core algorithm/approach
2. Key assumptions
3. Experimental setup
4. Evaluation metrics
5. Baseline comparisons
```

#### Literature Comparison
```
Compare these papers on:
| Aspect | Paper A | Paper B | Paper C |
|--------|---------|---------|---------|
| Problem |
| Method |
| Dataset |
| Results |
| Limitations |
```

### 3. Evidence (Source Attribution)

**Purpose:** Create verifiable links to sources

**When to use:**
- Writing academic papers
- Need proper citations
- Building bibliography
- Ensuring traceability of claims

**Capabilities:**
- BibTeX generation
- Multiple citation formats (APA, IEEE, ACM, Chicago, RIS)
- Batch citation export
- RIS export for Zotero/Mendeley/EndNote

**Usage:**
```bash
# Generate citations
python scripts/evidence.py bibtex "2301.00001"
python scripts/evidence.py apa "2301.00001"
python scripts/evidence.py ris "2301.00001"  # For Zotero/Mendeley
python scripts/evidence.py batch "2301.00001,2302.00002,2303.00003" --format bibtex
python scripts/evidence.py batch "2301.00001,2302.00002" --format ris > refs.ris
```

## Workflow Examples

### Literature Review Workflow

```
1. CONNECT: Find seed papers
   python scripts/connect.py search "your topic" --limit 50

2. CONNECT: Rank by impact
   (Results include citation counts from Semantic Scholar)

3. CONNECT: Expand with similar papers
   python scripts/connect.py similar "top_paper_id"

4. UNDERSTAND: Analyze each paper
   python scripts/connect.py content "paper_id" | analyze with prompts

5. EVIDENCE: Generate bibliography
   python scripts/evidence.py batch "id1,id2,id3" --format bibtex > refs.bib
```

### Finding Evidence for a Claim

```
1. CONNECT: Search for supporting research
   python scripts/connect.py search "your claim keywords"

2. UNDERSTAND: Verify the paper supports your claim
   python scripts/connect.py content "paper_id"

3. EVIDENCE: Generate proper citation
   python scripts/evidence.py apa "paper_id"
```

## API Dependencies

| Service | Purpose | Rate Limit | API Key Required |
|---------|---------|------------|------------------|
| arXiv | Paper search, content | 1 req/3s | No |
| Semantic Scholar | Citations, similar papers | 100 req/5min | No (optional for higher limits) |
| Jina Reader | Full text extraction | Generous | No |

## File Structure

```
arxiv-research-skill/
├── SKILL.md              # This file - usage guide
└── scripts/
    ├── connect.py        # Knowledge navigation
    ├── understand.py     # Analysis utilities
    └── evidence.py       # Citation generation
```

## Common Patterns

### Finding Foundational Papers
```bash
python scripts/connect.py search "topic" --sort citations --limit 10
```

### Tracking Recent Developments
```bash
python scripts/connect.py recent cs.AI --days 30
```

### Building a Reading List
```bash
python scripts/connect.py search "topic" > papers.json
# Review and filter
python scripts/evidence.py batch "selected_ids" --format bibtex
```

## Error Handling

- **Rate limited:** Wait and retry, scripts have built-in backoff
- **Paper not found:** Verify arXiv ID format (YYMM.NNNNN)
- **No citations:** Paper may be too new for Semantic Scholar