Best of CrawlingJune 2024

  1. 1
    Article
    Avatar of communityCommunity Picks·2y

    Building a Netflix show recommender using Crawlee and React

    Learn how to build a Netflix show recommender using Crawlee and React, guided through scraping Netflix content with Crawlee and visualizing it with a React app built with Vite. The guide covers prerequisites, installation steps, writing scraping code using Cheerio, and integrating the scraped data into a React application.

  2. 2
    Article
    Avatar of devtoDEV·2y

    I built my first SaaS - NotiFast

    NotiFast is a versatile notification bot designed to alert users about changes on websites they follow, such as new items or content updates. Built on the Discord platform, it offers seamless notifications without requiring user authentication and integrates easily with Discord's payment system. Initially derived from the open-source project webscraper-bot, NotiFast aims to simplify webpage monitoring with an easy-to-use visual creator. A free beta is currently available for the first 100 users.

  3. 3
    Article
    Avatar of communityCommunity Picks·2y

    Vercel + Puppeteer

    Learn how to use Puppeteer with Vercel to generate PDFs of websites. Discover best practices for setting up Puppeteer for Vercel and deploying your Puppeteer code on Vercel.

  4. 4
    Article
    Avatar of mlnewsMachine Learning News·2y

    Firecrawl: A Powerful Web Scraping Tool for Turning Websites into Large Language Model (LLM) Ready Markdown or Structured Data

    Firecrawl, developed by Mendable AI, is a state-of-the-art web scraping tool designed to efficiently extract data from websites, including those with dynamic JavaScript-rendered content. It outputs clean, well-formatted Markdown suitable for Large Language Model (LLM) applications, while incorporating caching mechanisms and generative feedback loops to enhance data quality and extraction efficiency. Users can access Firecrawl via an intuitive API and multiple SDKs for different programming languages.