LinkedIn contains valuable data — post results, profile results, company results, post snippets, and more. Scraping this data directly means dealing with anti-bot detection, CAPTCHAs, IP rotation, and constantly breaking selectors. The Scavio API handles all of that and returns clean, structured JSON from a single POST request.
This tutorial shows you how to scrape LinkedIn using Java and the Scavio API. By the end, you will have a working Java script that fetches real-time LinkedIn data and parses the results.
Prerequisites
- Java installed on your machine
- A Scavio API key (free tier includes 500 credits/month — no credit card required)
Step 1: Install Dependencies
HttpClient is built into Java, so there is nothing to install.
# HttpClient is built into Java 11+Step 2: Make Your First LinkedIn Search
Send a POST request to the Scavio LinkedIn API endpoint with your query. The API returns structured JSON with post results, profile results, company results, and more.
import java.net.http.*;
import java.net.URI;
var apiKey = "your_scavio_api_key";
var body = "{\"query\":\"" + query + "\"}";
var request = HttpRequest.newBuilder()
.uri(URI.create("https://api.scavio.dev/api/v1/search"))
.header("x-api-key", apiKey)
.header("Content-Type", "application/json")
.POST(HttpRequest.BodyPublishers.ofString(body))
.build();
var client = HttpClient.newHttpClient();
var response = client.send(request, HttpResponse.BodyHandlers.ofString());
System.out.println(response.body());Step 3: Example Response
The API returns structured JSON. Here is an example response for a LinkedIn search:
{
"search_metadata": { "status": "success" },
"organic_results": [
{
"position": 1,
"title": "Jane Doe - Senior AI Engineer - Acme Corp | LinkedIn",
"link": "https://www.linkedin.com/in/janedoe",
"snippet": "San Francisco Bay Area · Senior AI Engineer at Acme Corp · 5,000+ connections · Experienced in LangChain, LangGraph, and RAG..."
}
]
}Every field is structured and typed — no HTML parsing, no CSS selectors, no regex extraction. Your Java code can access any field directly.
Step 4: Full Working Example
Here is a complete, runnable Java script that searches LinkedIn and prints the results:
import java.net.http.*;
import java.net.URI;
/**
* Scrape LinkedIn search results using Scavio API.
* Requires Java 11+.
*/
public class LinkedInSearch {
private static final String API_URL = "https://api.scavio.dev/api/v1/search";
private static final String API_KEY = System.getenv("SCAVIO_API_KEY");
public static String search(String query) throws Exception {
var body = "{\"query\":\"" + query + "\"}";
var request = HttpRequest.newBuilder()
.uri(URI.create(API_URL))
.header("x-api-key", API_KEY)
.header("Content-Type", "application/json")
.POST(HttpRequest.BodyPublishers.ofString(body))
.build();
var response = HttpClient.newHttpClient()
.send(request, HttpResponse.BodyHandlers.ofString());
if (response.statusCode() != 200) {
throw new RuntimeException("Scavio API error: " + response.statusCode());
}
return response.body();
}
public static void main(String[] args) throws Exception {
System.out.println(search("site:linkedin.com/in AI engineer San Francisco"));
}
}Why Use Scavio Instead of Scraping LinkedIn Directly?
- No proxy management. Direct scraping requires rotating proxies to avoid IP bans. Scavio handles all of this server-side.
- No CAPTCHA solving. LinkedIn aggressively blocks automated requests. Scavio returns clean data every time.
- Structured JSON output. No HTML parsing or CSS selector maintenance. Get typed, consistent data from every request.
- Multi-platform in one API. Search Google, Amazon, YouTube, and Walmart from the same API key with the same authentication pattern.
- Free tier included. 500 credits/month with no credit card required. Each search costs 1 credit.