Best of Crawling2023

  1. 1
    Article
    Avatar of freecodecampfreeCodeCamp·3y

    Web Scraping in JavaScript – How to Use Puppeteer to Scrape Web Pages

    Web Scraping in JavaScript - How to Use Puppeteer to Scrape Web Pages. We'll introduce the basics of web scraping in JavaScript and Node.js in this article. The ToScrape website provides two projects specifically designed for web scraping, making it a great starting point to learn the basics.

  2. 2
    Article
    Avatar of communityCommunity Picks·3y

    Web Scraping Explained

    Web scraping, also known as data scraping, is the process of extracting and collecting data from the internet or websites. The scraped data can be saved on a local system or can be used for data analysis. Web scraping is used by large corporations to extract data for sentiment research on social media, brand product information, stock and financial analysis.

  3. 3
    Article
    Avatar of asayerasayer·3y

    Playwright vs Puppeteer: which to choose?

    Playwright and Puppeteer are headless browsers designed for end-to-end (E2E) automated testing of web apps. Both are go-to for web automation programmers’ toolkits. Tests ensure that code behaves as expected and catches bugs early on.

  4. 4
    Article
    Avatar of communityCommunity Picks·3y

    Creating a website brand with ChatGPT, Stable Diffusion, React and NodeJS 🤯

    Stable Diffusion is a text-to-image model that can convert any text into a picture. ChatGPT is also an AI language model trained by OpenAI to generate text and interact with users in a human-like conversational manner.

  5. 5
    Article
    Avatar of communityCommunity Picks·3y

    Creating a website aggregator with ChatGPT, React, and Node.js 🚀

    A website aggregator is a website that collects data from other websites across the internet. ChatGPT is an AI language model trained by OpenAI to generate text and interact with users in a human-like conversational manner. Users can submit requests and get answers to questions from a wide range of topics such as history, science, mathematics.

  6. 6
    Article
    Avatar of hackernoonHacker Noon·3y

    Design of a Scraping Platform

    Design of a Scraping Platform can be split into 2 categories that require different infrastructure and techniques. The system only has one functional requirement: it must be able to execute arbitrary scrapers on schedule. Scrapyd deploy eggified Python packages Packages may include dependencies in the egg file and different scrapers may use different versions.

  7. 7
    Article
    Avatar of syscolabsslSysco LABS Sri Lanka·3y

    Web Scraping 101

    Web scraping is the automation of data extraction from web pages. It is required when a large amount of data needs to be collected from multiple sources. The correct toolset can be identified by considering the need for user interaction and the programming language choice.

  8. 8
    Article
    Avatar of awstipAWS Tip·3y

    Introducing Scrapy: The powerful Python Library For Efficient Web Scraping

    Scrapy is designed to be fast and efficient, so it can handle large amounts of data without slowing down. It is highly extensible, so you can customize it to fit your specific needs. You can write your own middleware, pipelines, and extensions to modify the behavior of Scrapey.

  9. 9
    Article
    Avatar of lambdatestLambdaTest·3y

    A Complete Guide to Web Scraping with Python

    Web scraping is a powerful tool for collecting data from websites. Python is a popular programming language for web scraping, with libraries like BeautifulSoup and Selenium making the process easier. Web scraping can be used for various purposes, such as competitor analysis, lead generation, and data analysis. It's important to be aware of the legal and ethical considerations of web scraping and to comply with a website's terms of service.

  10. 10
    Article
    Avatar of hackernoonHacker Noon·3y

    116 Stories To Learn About Web Scraping

    The amount of data available on the internet nowadays is practically limitless, with much of it no more than a few clicks away. Learn how to scrape the web using scripts written in node.js to automate scraping data off of the website and using it for whatever purpose.

  11. 11
    Article
    Avatar of inPlainEngHQPython in Plain English·3y

    One scraping browser to rule them all

    Learn about the benefits of using a web scraping browser, different headless browsers available for web scraping, and the main benefits of the Bright Data scraping browser.

  12. 12
    Article
    Avatar of freecodecampfreeCodeCamp·3y

    How to Scrape Multiple Web Pages Using Python

    Learn how to perform web scraping using Python's Beautiful Soup library. Understand the proper approach for web scraping, how to scrape a single web page, and how to scrape multiple web pages.

  13. 13
    Article
    Avatar of communityCommunity Picks·3y

    Web Scraping: Learn how to Use Tools and Techniques

    Web scraping is a powerful tool in our digital age. It’s used in a wide array of applications, from data analysis and machine learning, to content aggregation and competitive analysis. In the next section, we’ll start looking at some tools we can use to scrape data from websites.

  14. 14
    Article
    Avatar of communityCommunity Picks·3y

    Web Scraping with Bright Data, Node.js, and Puppeteer

    Web scraping is the process of extracting structured data from websites automatically. It involves using software tools to access and retrieve specific information from web pages. Bright Data stands out as a powerful and versatile solution. The legality of web scraping varies across jurisdictions and depends on factors such as website’s terms of service.

  15. 15
    Article
    Avatar of freecodecampfreeCodeCamp·3y

    Use Scrapy for Web Scraping in Python

    The Scrapy Beginners Course is broken down into 13 easy-to-follow parts that cover everything you need to know to get started with web scraping. The course also teaches you how to deploy your scraper to the cloud and schedule it to run periodically. Using fake user-agents and browser headers to avoid getting blocked.

  16. 16
    Article
    Avatar of sitepointSitePoint·3y

    Top React Testing Libraries in 2023: A Comprehensive Review

    In 2023, we’ll explore some of the top React testing libraries that developers rely on in 2023 to test React components and apps. Jest is the default testing framework for testing React apps, making it a popular choice for developers. Jasmine is a simple yet powerful test framework for browsers and Node.js.

  17. 17
    Article
    Avatar of hackernoonHacker Noon·3y

    Web scraping using a headless browser in NodeJS

    Web scraping using a headless browser in NodeJS Too Long; Didn't Read Web scraping collects and extracts unstructured data from a website to a more readable structured format like CSV and more. Organizations set restrictions for web scraping guiding how users are allowed to collect data which on every website has a guiding principle.

  18. 18
    Article
    Avatar of asayerasayer·2y

    End-to-end React apps testing made easy with Puppeteer

    This post explores how Puppeteer makes automated testing simple and effective for React applications. It explains what Puppeteer and Jest are and provides instructions on setting up a project with Puppeteer and Jest.

  19. 19
    Article
    Avatar of communityCommunity Picks·2y

    The Complete Puppeteer Cheatsheet

    Puppeteer is a Node.js library developed by Google for controlling headless Chrome and Chromium. It allows you to automate UI testing, scraping, screenshot testing, and more.

  20. 20
    Article
    Avatar of medium_jsMedium·3y

    Practical Web Scraping for Data Analysts

    Learn what web scraping is and how to scrape an e-commerce website using Python. Use libraries such as BeautifulSoup, csv, pandas, and requests.

  21. 21
    Article
    Avatar of hackernoonHacker Noon·3y

    Web Scraping with Python Using Regular Expressions

    HackerNoon provides information on top trending tech job openings, writing contests, and the ability for writers to collect emails and grow their newsletter.

  22. 22
    Article
    Avatar of kdnuggetsKDnuggets·3y

    A Step-by-Step Guide to Web Scraping with Python and Beautiful Soup

    A Step-by-Step Guide to Web Scraping with Python and Beautiful Soup - KDnuggets. Learn the basics of Web Scrapeing and its Python implementation. Also, get to know about the various methods of Beautiful Soup library.