In recent years, the web has gotten very hostile to the lowly web scraper. The data is public but completely inaccessible to machines because of aggressive anti-web-scraping measures. I plucked out a unique string from the data visible on the web page and took a heap snapshot of the browser's Javascript runtime via Chrome's Dev Tools.
Table of contents
puppeteer-heap-snapshot is born.Future proof web-scraping?Get started heap snapshot scraping.2 Comments
Sort: