Quick Start Guide

Get up and running with the Multi-Modal Academic Research System in 5 minutes.

Prerequisites Checklist
5-Minute Setup
Your First Query
Collecting Your First Papers
Understanding the Interface
Next Steps

Prerequisites Checklist

Before you begin, ensure you have completed:

Python 3.9+ installed (python --version)
Docker installed and running (docker --version)
Google Gemini API key (free from https://makersuite.google.com/app/apikey)
Project downloaded/cloned to your local machine

Not ready? See the full Installation Guide for detailed setup instructions.

5-Minute Setup

Step 1: Start OpenSearch (1 minute)

Open a terminal and run:

docker run -d \
  --name opensearch-research \
  -p 9200:9200 \
  -e "discovery.type=single-node" \
  -e "OPENSEARCH_INITIAL_ADMIN_PASSWORD=MyStrongPassword@2024!" \
  opensearchproject/opensearch:latest

Wait 30 seconds for OpenSearch to initialize.

Step 2: Set Up Python Environment (2 minutes)

Navigate to the project directory and run:

# Create and activate virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

Step 3: Configure API Key (1 minute)

Create your .env file:

cp .env.example .env

Edit .env and add your Gemini API key:

GEMINI_API_KEY=your_actual_api_key_here
OPENSEARCH_HOST=localhost
OPENSEARCH_PORT=9200

Step 4: Launch the Application (1 minute)

python main.py

You should see:

🚀 Initializing Multi-Modal Research Assistant...
✅ Connected to OpenSearch at localhost:9200
✅ Research Assistant ready!
🌐 Opening web interface...

Running on local URL:  http://0.0.0.0:7860
Running on public URL: https://xxxxx.gradio.live

Success! Open http://localhost:7860 in your browser.

Your First Query

The system comes with no data initially. Let's test it with a simple query to understand the interface, then collect some papers.

Test the System

Navigate to the Research tab (should be open by default)
In the query box, type: "What is machine learning?"
Click Ask Question

Expected Result: You'll see a message indicating no documents are available yet. This is normal - you need to collect data first!

Collecting Your First Papers

Let's populate the system with academic papers about a topic.

Step 1: Navigate to Data Collection

Click the Data Collection tab at the top of the interface

Step 2: Collect Papers from ArXiv

Find the Collect Papers section
Enter a topic you're interested in, for example:
- "machine learning"
- "natural language processing"
- "computer vision"
- "quantum computing"
Set Number of papers to: 5 (for a quick start)
Click Collect Papers

What happens: The system will:

Search ArXiv for the latest papers on your topic
Download the PDFs
Extract text and diagrams
Analyze diagrams using Gemini Vision
Generate embeddings
Index everything in OpenSearch

Time: Expect 2-5 minutes for 5 papers.

Step 3: Monitor Progress

You'll see status updates like:

✅ Collected 5 papers on 'machine learning'
Processing paper 1/5: "Deep Learning for Computer Vision"...
✅ Indexed paper: Deep Learning for Computer Vision
Processing paper 2/5: "Attention Is All You Need"...
...

Step 4: Run Your First Real Query

Go back to the Research tab
Enter a query related to your collected papers:
- "What are the key concepts in machine learning?"
- "Explain neural networks"
- "How do transformers work?"
Click Ask Question

Expected Result: You'll receive:

A comprehensive answer synthesized from the papers
Citations in brackets [1], [2], etc.
Source information showing which papers were used

Example response:

Machine learning is a subset of artificial intelligence that enables
systems to learn and improve from experience [1]. Key concepts include:

1. Neural Networks: Computational models inspired by biological neurons [1][2]
2. Training: The process of adjusting model parameters using data [2]
3. Deep Learning: Multi-layer neural networks for complex patterns [3]

Sources:
[1] "Deep Learning Fundamentals" (Smith et al., 2023)
[2] "Introduction to Neural Networks" (Johnson, 2023)
[3] "Modern Machine Learning" (Lee et al., 2024)

Understanding the Interface

The Research Assistant has four main tabs:

1. Research Tab

Purpose: Query your knowledge base and get AI-powered answers with citations

Key Features:

Query input box
AI-generated responses with citations
Conversation history
Source attribution

Usage Tips:

Ask specific questions for better results
Use follow-up questions to dive deeper
Check citations to verify information

2. Data Collection Tab

Purpose: Gather academic content from multiple sources

Sources Available:

Academic Papers: ArXiv, PubMed Central, Semantic Scholar
YouTube Videos: Educational channels and lectures
Podcasts: Academic and educational podcast episodes

Parameters:

Topic/search query
Number of items to collect
Source preference

Usage Tips:

Start with 5-10 papers to avoid long wait times
Choose topics that match your research interests
Mix different sources (papers, videos, podcasts) for diverse perspectives

3. Citation Manager Tab

Purpose: View and export citations from your research sessions

Features:

List of all cited sources
Export to BibTeX format
Citation details (authors, title, date, URL)

Usage Tips:

Export citations after each research session
Use BibTeX exports in your LaTeX documents
Keep track of sources for academic writing

4. Settings Tab

Purpose: Configure system settings and connections

Settings:

OpenSearch connection (host, port)
API keys (Gemini)
Index management
System health status

Usage Tips:

Check connection status if searches fail
Verify OpenSearch is running
Update API keys if needed

Quick Tips for Success

1. Collection Strategy

Start Small: Collect 5-10 papers initially to test the system

Faster processing
Easier to verify quality
Quick feedback on topics

Scale Up: Once comfortable, collect 20-50 papers per topic

Better coverage
More comprehensive answers
Diverse perspectives

2. Query Techniques

Be Specific:

Good: "What is the attention mechanism in transformers?"
Less effective: "Tell me about AI"

Ask Follow-ups:

"Can you explain that in simpler terms?"
"What are the practical applications?"
"How does this compare to other approaches?"

Request Evidence:

"What evidence supports this claim?"
"Which papers discuss this topic?"

3. Content Diversity

Mix different content types for richer research:

Papers: Detailed technical information, formulas, experiments
Videos: Visual explanations, demonstrations, lectures
Podcasts: Discussions, interviews, high-level overviews

4. Regular Maintenance

Update Your Knowledge Base:

Collect new papers weekly on your topics
Keep content current with latest research

Monitor Storage:

PDFs and processed data accumulate in data/ folder
Clean up old content periodically

Check Logs:

Review logs/ directory for any errors
Helps troubleshoot issues early

Common Quick Start Issues

Issue: "Cannot connect to OpenSearch"

Quick Fix:

# Check if OpenSearch is running
docker ps | grep opensearch

# If not running, start it
docker start opensearch-research

# If container doesn't exist, create it (see Step 1)

Issue: "GEMINI_API_KEY not found"

Quick Fix:

Verify .env file exists: ls -la .env
Check content: cat .env
Ensure key has no quotes: GEMINI_API_KEY=AIza... not GEMINI_API_KEY="AIza..."
Restart application: python main.py

Issue: Papers collecting but not processing

Quick Fix:

Check logs in logs/ directory for errors
Verify Gemini API key is valid
Try with fewer papers (1-2) to isolate issues
Check internet connection

Issue: Slow performance

Quick Fix:

Start with fewer papers (5 instead of 20)
Close other applications to free memory
Wait for first-time model downloads to complete
Check Docker has enough memory allocated (4GB+ recommended)

Example Workflows

Workflow 1: Research a New Topic

Collect: Data Collection → Enter "quantum computing" → Collect 10 papers
Explore: Research → "What is quantum computing?"
Deep Dive: Research → "How do quantum gates work?"
Compare: Research → "What are the differences between quantum and classical computing?"
Export: Citation Manager → Export BibTeX

Workflow 2: Literature Review

Broad Collection: Collect 30 papers on your research area
Overview: "What are the main research directions in [topic]?"
Specific Topics: "What methods are used for [specific problem]?"
Gaps: "What are open challenges in [topic]?"
Timeline: "How has [topic] evolved over time?"

Workflow 3: Learning a New Concept

Mixed Media: Collect papers + YouTube videos on the topic
Introduction: "Explain [concept] in simple terms"
Technical: "What is the mathematical foundation of [concept]?"
Visual: Videos provide diagrams and animations
Practice: "What are example applications of [concept]?"

Next Steps

Now that you're up and running:

Learn More

Configuration Guide: Customize settings, logging, and advanced options
- See Configuration Guide
Architecture: Understand how the system works under the hood
- See Technology Stack
Full Documentation: Explore all features and capabilities
- See Main README

Expand Your Knowledge Base

Collect papers from multiple sources (ArXiv, PubMed, Semantic Scholar)
Add YouTube lectures from educational channels
Include podcast episodes for diverse perspectives

Optimize Your Workflow

Create topic-specific collections
Use citation exports for your papers
Experiment with different query styles
Build a comprehensive research database

Troubleshooting

Still having issues?

Check Logs: logs/research_system_*.log contains detailed error information
Verify Setup: Run through the Installation Guide checklist
Review Configuration: See Configuration Guide
Common Issues: Full list in Installation Guide - Common Issues

Getting Help

Documentation: Start with CLAUDE.md in the project root
Logs: Check logs/ directory for error details
Issues: Report bugs or request features on GitHub

Congratulations! You're now ready to conduct AI-powered academic research with multi-modal sources.

Happy researching!

FilesExpand file tree

quick-start.md

Latest commit

History

quick-start.md

File metadata and controls

Quick Start Guide

Table of Contents

Prerequisites Checklist

5-Minute Setup

Step 1: Start OpenSearch (1 minute)

Step 2: Set Up Python Environment (2 minutes)

Step 3: Configure API Key (1 minute)

Step 4: Launch the Application (1 minute)

Your First Query

Test the System

Collecting Your First Papers

Step 1: Navigate to Data Collection

Step 2: Collect Papers from ArXiv

Step 3: Monitor Progress

Step 4: Run Your First Real Query

Understanding the Interface

1. Research Tab

2. Data Collection Tab

3. Citation Manager Tab

4. Settings Tab

Quick Tips for Success

1. Collection Strategy

2. Query Techniques

3. Content Diversity

4. Regular Maintenance

Common Quick Start Issues

Issue: "Cannot connect to OpenSearch"

Issue: "GEMINI_API_KEY not found"

Issue: Papers collecting but not processing

Issue: Slow performance

Example Workflows

Workflow 1: Research a New Topic

Workflow 2: Literature Review

Workflow 3: Learning a New Concept

Next Steps

Learn More

Expand Your Knowledge Base

Optimize Your Workflow

Troubleshooting

Getting Help