Open Screaming Frog
Go to Configuration in the top menu
Custom > Custom Extraction
Use Inspect Element (right click on the copy and choose “inspect” if you use Chrome browser) – to identify the name, class or ID of the div or element the page copy is contained in:
In this example the Div class is “prose” (f8ck knows why)
The following will scrape any text in the div called “prose”:
Choose:
Extractor 1
X Path
enter –> //div[@class=’classofdiv’] —-> in this example – //div[@class=’prose’]
“extract Text”
To extract the text inside headings (H1s) – use the instructions above, but use the code:
//h1/text()