Scraping Reviews with Screaming Frog?

By Geoff Griffiths @mmatraining1980

You can scrape reviews if they are :

– In Schema
– Have their own specific class of <p> tag

e.g. class=”review-tex text-base font-secondary”

Can you scrape the reviews then?

Yes! If the reviews are in schema markup (structured data like Review, AggregateRating, etc.) or they have a specific

class, you can definitely scrape them using Screaming Frog’s Custom Extraction feature.

How to Scrape Reviews in Schema or Specific HTML Classes Using Screaming Frog
1⃣ Enable Structured Data Extraction (for Schema Reviews)
If the reviews are within schema markup (JSON-LD, Microdata, or RDFa), Screaming Frog can extract them directly:

Open Screaming Frog SEO Spider.
Go to Configuration > Spider > Enable Structured Data.
Start a crawl and Screaming Frog will extract JSON-LD, Microdata, and RDFa formats.
After the crawl is complete, go to Reports > Structured Data > Review Schema and export the data.

2⃣ Extract Reviews from a Specific

Class (e.g., class=”review-text text-base font-secondary”)
If the reviews are inside a specific

tag with a class, you can extract them using Custom Extraction:

Open Screaming Frog and go to Configuration > Custom Extraction.
Choose XPath or CSS Selector, depending on your data structure.
Use the following extraction methods:

Option 1: Using CSS Selector (Best for Class-Based Extraction)
css
Copy
Edit
p.review-text.text-base.font-secondary
This will pull all

tags that contain the class review-text text-base font-secondary.

Option 2: Using XPath (For More Advanced Queries)
xpath
Copy
Edit
//p[contains(@class, ‘review-text text-base font-secondary’)]
This will find all

elements with a class containing “review-text text-base font-secondary”.

3⃣ Run the Crawl and Export the Review Data
Start the crawl and let Screaming Frog extract the reviews.
After completion, navigate to Custom Extraction > Export Data.
Open the file in Excel or Google Sheets for further analysis.

Alternative Approach: Scraping JSON-LD Schema Reviews
If reviews are embedded in JSON-LD schema markup, you can extract them with:

Custom Extraction (JSON-LD):

xpath
Copy
Edit
//script[@type=’application/ld+json’]/text()
This extracts all JSON-LD data, which you can filter for reviews.
Use JSON formatting tools (like jq or Python json.loads()) to clean and parse the data.

Summary: Can Screaming Frog Scrape Reviews?
YES, if reviews are inside Schema Markup (JSON-LD, Microdata, RDFa).
YES, if they have a specific

class, using Custom Extraction.
YES, if the reviews are static HTML, meaning they exist in the page source.
NO, if reviews are loaded dynamically via JavaScript (use Selenium instead).

Need Help Automating This?
Would you like a step-by-step guide or a custom Python script to scrape and structure reviews automatically?