Best of Crawling — 2022

1
Article
Towards Dev·4y
Web scraping with JavaScript and Node.js
Web scraping is the process of extracting content and data from a website. Many industries use web scraping for market analysis, price comparison, and machine learning. We are going to build a simple web scraper with javascript and Node.js. We will use it to scrape information related to a product, specifically the product’s name, price, and image.
69
3
2
Article
Towards Dev·4y
How we migrated from Bootstrap to Tailwind CSS.
Tailwind CSS is a utility-first CSS framework packed with classes like flex, pt-4, text-center, and rotate-90 that can be composed to build any design, directly in your markup. In our examples, we will use packages and commands related to the GatsbyJS static-site generation framework. This function adds the content in before the page is rendered. Don’t waste your time trying networkidle.
55
4
3
Article
Community Picks·4y
Reverse Proxy vs. Forward Proxy: The Differences
Reverse proxy vs Forward Proxy: The Differences Back to blog1666902057.72391514e Iveta Vistorskyte. In general, proxies are intermediaries between a user and a web server. In this article, we’ll explain the differences between the forward vs reverse proxy servers.
39
1
4
Article
Product Hunt·4y
Crawlee - Crawlee helps you build reliable crawlers, fast
Crawlee is an intuitive, customizable open-source library for web scraping and browser automation.
37
5
Article
Community Picks·4y
Building an interactive screen-sharing app with Puppeteer and React
Puppeteer is a Node.js library that automates several browser actions such as form submission, crawling single-page applications, UI testing, and generating screenshot and PDF versions of web pages. It can be In-App (the bell icon like you have in the Dev Community - Websockets), Emails, SMSs and so //New imports. The code snippet below starts the server using Nodemon. The code snippets below begins the server. Nodomon is the code snippet above. For more information on the code below.
33
1
6
Article
DZone·4y
Web Scraping as an API Service
Playwright is a Python tool that can be used to scrape web pages. It can also be used as a way to integrate a web scraping tool with an API service. Playwright generates Python code that can then be used in API integrations. The tool is good and has many other uses that go beyond the scope of this text.
31
1
7
Article
Community Picks·4y
The Full Guide to Web Scraping & Automation with JavaScript and NodeJS
Node has created an enormous ecosystem and use cases ranging from data science, game development, AI and of course, web automation. The Full Guide to Web Scraping & Automation with JavaScript and NodeJS is a guide for anyone who wants to start web automation in 2022 using this incredible JavaScript platform.
30
2
8
Article
Geek Culture·4y
Dynamic web scraping using selenium : scrape protected websites
In this article we will use beautiful soup to scrape the information from webpages. Beautiful Soup can only Handel static webpages not dynamic. Selenium is a framework which can run and execute the scripts and control your web browser by sending and receiving methods calls and data to or from the Web Driver.
23
1
9
Article
LogRocket·4y
Web scraping with Rust
Web scraping refers to gathering data from a webpage in an automated way. The details of web scraping highly depend on the page you’re getting the data from.
22
1
10
Article
Hashnode·4y
Find resources to learn and opportunities to earn.
13bhai helps you get everything from freelance jobs to courses to hackathons on any topic you like from the most popular and trusted sources across the internet. Just select what you want to look for, hit search and there it is. You have all the information with the original links. Scraping relevant information was the biggest challenge.
21
11
Article
HEARTBEAT·4y
How to Code a Web Scraper
When web scraping code is executed, a request is made to the specified URL (Universal Resource Locator) The server transmits the information in response to the request. After parsing the HTML or XML page, the code then extracts the data. A web scraper’s durability is also a serious issue.
20
12
Article
Community Picks·4y
Web Scraping with Go
Wikipedia defines web scraping as follows: Web scraping, web harvesting, or web data extraction. Web scraping is a process for extracting data from websites and is used in many cases, ranging from data analysis to lead generation. In this article, you will learn how to create a simple web scraper using Go.
19
13
Article
freeCodeCamp·4y
How to Use Puppeteer With Node.js
Puppeteer is a JavaScript library that allows you to script and interact with browser windows. With it, you can run tests in the browser and then see the results in real-time on your terminal. To use Puppeteer with Node.js, you'll need to install several packages and set up a few environment variables.
19
14
Article
gitconnected·4y
How I scrape lots of sites with one python script
Mykhailo Kushnir has created a python script that can scrape 90% of websites for free. He uses a dockerized version of Selenium and chromedriver for stability. The script can be used to scrape good-old quotes.toscrape.com. The result is stored in a CSV file.
16
15
Article
freeCodeCamp·4y
Web Scraping in Python – How to Scrape Sci-Fi Movies from IMDB
The script pulls in movie titles, years, ratings (PG-13, R, and so on), genres, runtimes, reviews, and votes for each movie. The script uses the sleep function, for example, to slow down the pull requests intentionally. It takes 40 min to scrape 200 webpages using the Google Col #parse the content of current iteration of the current iteration. It takes 50 movies for that page in movie-containers. For more information, see the script.
14
1
16
Article
DEV·4y
Turning React apps into PDFs with Next.js, NodeJS and puppeteer
A month ago, I rebuild my resume with Next.js and Tailwindcss. The technique can come in handy if you for example, would have to generate invoices inside your already existing applictaion. This is NOT a production ready implementation. If there's enough interest, I can make a follow-up post.
11

See all Crawling archives