- Extract data from SSR sites (React, Next.js, Remix, etc) (example).
- Some websites will have JSON-LD embedded for search engines to scrap. You can, too!
<script type="application/ld+json">
- Extract sitemaps to get a list of pages to scrap.
- Extract
<meta>
tags for basic and not-so-basic metadata. - Inject code into React apps (monkeypatching or using the browser debugger) and send the data to your own API server. Demo