How long does this build an llm wiki with a single multi-platform search api tutorial take?

Most developers complete this tutorial in 15 to 30 minutes. You will need a Scavio API key (free tier works) and a working Python or JavaScript environment.

What do I need before starting?

Scavio API key. Python 3.8+. Markdown-based wiki (git repo or filesystem). A Scavio API key gives you 500 free credits per month.

Can I run this tutorial with the free tier?

Yes. The free tier includes 500 credits per month, which is more than enough to complete this tutorial and prototype a working solution.

What frameworks does this work with?

Scavio has a native LangChain package (langchain-scavio), an MCP server, and a plain REST API that works with any HTTP client. This tutorial uses the raw REST API, but you can adapt to your framework of choice.

LLM Wiki Single Search API Tutorial

Karpathy's LLM Wiki project sparked discussion on r/AI_Agents about ingestion pipelines. Most wiki builders use separate tools for Google, Reddit, YouTube, and Amazon data. This tutorial replaces them all with a single API that covers multiple platforms, reducing integration complexity from 5 SDKs to 1.

Prerequisites

Scavio API key
Python 3.8+
Markdown-based wiki (git repo or filesystem)

Walkthrough

Step 1: Define wiki topic research function

One function searches all platforms for a topic.

Python

import requests, os
H = {'x-api-key': os.environ['SCAVIO_API_KEY']}
URL = 'https://api.scavio.dev/api/v1/search'

def research_topic(topic):
    sources = {}
    for platform in ['google', 'reddit', 'youtube']:
        data = requests.post(URL, headers=H,
            json={'platform': platform, 'query': topic}).json()
        sources[platform] = data.get('results', []) or data.get('organic_results', [])
    return sources

Step 2: Generate wiki entry from multi-platform data

Combine search results into a structured wiki page.

Python

def generate_wiki_entry(topic, sources):
    entry = f'# {topic}\n\n'
    entry += '## Overview\n'
    # Use top Google results for the overview
    for r in sources.get('google', [])[:3]:
        entry += f"- [{r.get('title', '')}]({r.get('link', '')}): {r.get('snippet', '')}\n"
    entry += '\n## Community Discussion\n'
    for r in sources.get('reddit', [])[:3]:
        entry += f"- [{r.get('title', '')}]({r.get('url', '')})\n"
    entry += '\n## Video Resources\n'
    for r in sources.get('youtube', [])[:3]:
        entry += f"- {r.get('title', '')}\n"
    return entry

Step 3: Build the wiki in batch

Process a list of topics and save as markdown files.

Python

import os

def build_wiki(topics, output_dir='wiki'):
    os.makedirs(output_dir, exist_ok=True)
    for topic in topics:
        slug = topic.lower().replace(' ', '-')
        sources = research_topic(topic)
        entry = generate_wiki_entry(topic, sources)
        with open(f'{output_dir}/{slug}.md', 'w') as f:
            f.write(entry)
        print(f'Built wiki page: {slug}.md')

topics = ['transformer architecture', 'RLHF training', 'RAG pipeline', 'MCP protocol']
build_wiki(topics)

Step 4: Add freshness checks

Re-research topics that are older than 7 days.

Python

import datetime

def needs_refresh(filepath, max_age_days=7):
    if not os.path.exists(filepath):
        return True
    mtime = datetime.datetime.fromtimestamp(os.path.getmtime(filepath))
    return (datetime.datetime.now() - mtime).days > max_age_days

def refresh_wiki(topics, output_dir='wiki'):
    for topic in topics:
        slug = topic.lower().replace(' ', '-')
        path = f'{output_dir}/{slug}.md'
        if needs_refresh(path):
            sources = research_topic(topic)
            entry = generate_wiki_entry(topic, sources)
            with open(path, 'w') as f:
                f.write(entry)
            print(f'Refreshed: {slug}')

Python Example

Python

import os, requests
H = {'x-api-key': os.environ['SCAVIO_API_KEY']}

def wiki_page(topic):
    data = {}
    for p in ['google', 'reddit', 'youtube']:
        r = requests.post('https://api.scavio.dev/api/v1/search', headers=H,
            json={'platform': p, 'query': topic}).json()
        data[p] = r.get('results', []) or r.get('organic_results', [])
    return data

# 4 topics x 3 platforms = 12 queries = $0.06

JavaScript Example

JavaScript

const platforms = ['google', 'reddit', 'youtube'];
const sources = {};
for (const platform of platforms) {
  const res = await fetch('https://api.scavio.dev/api/v1/search', {
    method: 'POST',
    headers: {'x-api-key': process.env.SCAVIO_API_KEY, 'Content-Type': 'application/json'},
    body: JSON.stringify({platform, query: topic})
  });
  sources[platform] = await res.json();
}

Expected Output

JSON

Markdown wiki with multi-platform sources per topic. One API key replaces separate Google, Reddit, and YouTube ingestion tools. 3 queries per topic = $0.015.

How to Build an LLM Wiki with a Single Multi-Platform Search API

Prerequisites

Walkthrough

Step 1: Define wiki topic research function

Step 2: Generate wiki entry from multi-platform data

Step 3: Build the wiki in batch

Step 4: Add freshness checks

Python Example

JavaScript Example

Expected Output

Related Tutorials

Frequently Asked Questions

How long does this build an llm wiki with a single multi-platform search api tutorial take?

What do I need before starting?

Can I run this tutorial with the free tier?

What frameworks does this work with?

Start Building