Workflow

Daily Job Aggregation Workflow

HiringCafe-style cron: per target employer, pull latest career-page jobs via Scavio + extract + LLM-parse + dedupe + rank.

Overview

Daily 6am cron pulls latest job postings from a target-employer list, parses with LLM, dedupes against history, ranks for end-user filters.

Trigger

Daily cron 6am

Schedule

Daily 6am

Workflow Steps

1

Iterate target-employer list

From a Postgres table.

2

Per employer: Scavio dorked discovery

site:{d}/careers + site:jobs.lever.co/{d} + site:boards.greenhouse.io/{d}.

3

Per career-page URL: Scavio /extract for markdown

Clean markdown for LLM input.

4

LLM structured parse

Returns JSON list of {title, location, salary_min, salary_max, summary, apply_url}.

5

Dedupe against history (employer, title, location)

Same role on multiple aggregators = one record.

6

Rank by user-skill match + salary + recency

Per user filter, return top-N.

7

Push to user notifier (email / Slack / push)

Per user's saved filters.

Python Implementation

Python
import requests, os
H = {'x-api-key': os.environ['SCAVIO_API_KEY']}

def daily_employer_pull(domain):
    dorks = [f'site:{domain}/careers', f'site:jobs.lever.co/{domain.split(".")[0]}']
    urls = []
    for q in dorks:
        r = requests.post('https://api.scavio.dev/api/v1/search', headers=H, json={'query': q}).json()
        urls.extend(o['link'] for o in r.get('organic_results', [])[:10])
    return list(set(urls))

JavaScript Implementation

JavaScript
// Same in TS.

Platforms Used

Google

Web search with knowledge graph, PAA, and AI overviews

Frequently Asked Questions

Daily 6am cron pulls latest job postings from a target-employer list, parses with LLM, dedupes against history, ranks for end-user filters.

This workflow uses a daily cron 6am. Daily 6am.

This workflow uses the following Scavio platforms: google. Each platform is called via the same unified API endpoint.

Yes. Scavio's free tier includes 500 credits per month with no credit card required. That is enough to test and validate this workflow before scaling it.

Daily Job Aggregation Workflow

HiringCafe-style cron: per target employer, pull latest career-page jobs via Scavio + extract + LLM-parse + dedupe + rank.