Family Magazine

Scraping Reviews with Screaming Frog?

By Geoff Griffiths @mmatraining1980

You can scrape reviews if they are :

– In Schema
– Have their own specific class of <p> tag

e.g. class=”review-tex text-base font-secondary”

Can you scrape the reviews then?

Yes! If the reviews are in schema markup (structured data like Review, AggregateRating, etc.) or they have a specific

class, you can definitely scrape them using Screaming Frog’s Custom Extraction feature.

✅
How to Scrape Reviews in Schema or Specific HTML Classes Using Screaming Frog
1⃣ Enable Structured Data Extraction (for Schema Reviews)
If the reviews are within schema markup (JSON-LD, Microdata, or RDFa), Screaming Frog can extract them directly:

✔
Open Screaming Frog SEO Spider.
✔
Go to Configuration > Spider > Enable Structured Data.
✔
Start a crawl and Screaming Frog will extract JSON-LD, Microdata, and RDFa formats.
✔
After the crawl is complete, go to Reports > Structured Data > Review Schema and export the data.

2⃣ Extract Reviews from a Specific

Class (e.g., class=”review-text text-base font-secondary”)
If the reviews are inside a specific

tag with a class, you can extract them using Custom Extraction:

✔
Open Screaming Frog and go to Configuration > Custom Extraction.
✔
Choose XPath or CSS Selector, depending on your data structure.
✔
Use the following extraction methods:

📝
Option 1: Using CSS Selector (Best for Class-Based Extraction)
css
Copy
Edit
p.review-text.text-base.font-secondary
✔
This will pull all

tags that contain the class review-text text-base font-secondary.

📝
Option 2: Using XPath (For More Advanced Queries)
xpath
Copy
Edit
//p[contains(@class, ‘review-text text-base font-secondary’)]
✔
This will find all

elements with a class containing “review-text text-base font-secondary”.

3⃣ Run the Crawl and Export the Review Data

✔
Start the crawl and let Screaming Frog extract the reviews.
✔
After completion, navigate to Custom Extraction > Export Data.
✔
Open the file in Excel or Google Sheets for further analysis.

🚀
Alternative Approach: Scraping JSON-LD Schema Reviews
If reviews are embedded in JSON-LD schema markup, you can extract them with:

Custom Extraction (JSON-LD):

xpath
Copy
Edit
//script[@type=’application/ld+json’]/text()

✔
This extracts all JSON-LD data, which you can filter for reviews.
✔
Use JSON formatting tools (like jq or Python json.loads()) to clean and parse the data.

🔹
Summary: Can Screaming Frog Scrape Reviews?
✅
YES, if reviews are inside Schema Markup (JSON-LD, Microdata, RDFa).
✅
YES, if they have a specific

class, using Custom Extraction.

✅
YES, if the reviews are static HTML, meaning they exist in the page source.
❌
NO, if reviews are loaded dynamically via JavaScript (use Selenium instead).

🎯
Need Help Automating This?
Would you like a step-by-step guide or a custom Python script to scrape and structure reviews automatically?


Back to Featured Articles on Logo Paperblog