n8nflow.net logo

Scrape and Analyze Websites with Custom Prompts using Gemini, Apify, and LangChain

by Msaid Mohamed el hadiβ€’Updated: Last update 2 months agoβ€’Source: n8n.io
Loading workflow viewer...

Getting Started

πŸ” AI-Powered Website Prompt Executor (Apify + OpenRouter)

This workflow combines the power of Apify and OpenRouter to scrape website content and execute any custom prompt using AI. You define what you want β€” whether it’s extracting contact details, summarizing content, collecting job offers, or anything else β€” and the system intelligently processes the site to give you results.

πŸš€ Overview

This workflow allows you to:

  1. Input a URL and define a prompt.
  2. Scrape the specified number of pages from the website.
  3. Process each page’s metadata and Markdown content.
  4. Use AI to interpret and respond to the prompt on each page.
  5. Aggregate and return structured output.

🧠 How It Works

Input Example

{
  "enqueue": true,
  "maxPages": 5,
  "url": "https://apify.com",
  "method": "GET",
  "prompt": "collect all contact informations available on this website"
}

Workflow Steps

StepAction
1Triggered by another workflow with JSON input.
2Calls the Apify actor firescraper-ai-website-content-markdown-scraper to scrape content.
3Loops through the scraped pages.
4AI analyzes each page based on the input prompt.
5Aggregates AI outputs across all pages.
6Final AI processing step to return a clean structured result.

πŸ›  Technologies Used

  • Apify – Scrapes structured content and Markdown from websites.
  • OpenRouter – Provides access to advanced AI models like Gemini.
  • LangChain – Handles AI agent orchestration and prompt interpretation.

πŸ”§ Customization

Customize the workflow via the following input fields:

  • url: Starting point for scraping
  • maxPages: Limit the number of pages to crawl
  • prompt: Define any instruction (e.g., β€œsummarize this website,” β€œextract product data,” β€œlist all emails,” etc.)

This allows dynamic, flexible use across various use cases.


πŸ“¦ Output

The workflow returns a JSON result that includes:

  • Processed prompt responses from each page
  • Aggregated AI insights
  • Structured and machine-readable format

πŸ§ͺ Example Use Cases

  • πŸ” Extracting contact information from websites
  • πŸ“„ Summarizing articles or company profiles
  • πŸ›οΈ Collecting product information
  • πŸ“‹ Extracting job listings or news
  • πŸ“¬ Generating outreach lists from public data
  • πŸ€– Used as a tool within other AI agents for real-time web analysis
  • 🧩 Integrated as an external tool in MCP (Multi-Component Prompt) servers to enhance AI capabilities

πŸ” API Credentials Required

You will need:

  • Apify API token – For running the scraper actor
  • OpenRouter API key – For AI-powered prompt processing

Set these credentials in your environment or n8n credential manager before running.