Open Screaming Frog
Go to Configuration in the top menu
Custom > Custom Extraction
Use Inspect Element (right click on the copy and choose “inspect” if you use Chrome browser) – to identify the name, class or ID of the div or element the page copy is contained in:
In this example the Div class is “prose” (f8ck knows why)
![Custom Extraction – Page Copy – With Screaming Frog [2024] Custom Extraction – Page Copy – With Screaming Frog [2024]](https://m5.paperblog.com/i/761/7619143/custom-extraction-page-copy-with-screaming-fr-L-MPORH2.png)
The following will scrape any text in the div called “prose”:
![Custom Extraction – Page Copy – With Screaming Frog [2024] Custom Extraction – Page Copy – With Screaming Frog [2024]](https://m5.paperblog.com/i/761/7619143/custom-extraction-page-copy-with-screaming-fr-L-2F4ghc.png)
Choose:
Extractor 1
X Path
enter –> //div[@class=’classofdiv’] —-> in this example – //div[@class=’prose’]
“extract Text”
To extract the text inside headings (H1s) – use the instructions above, but use the code:
//h1/text()