Best of PandasSeptember 2024

  1. 1
    Article
    Avatar of mlmMachine Learning Mastery·2y

    Automating Data Cleaning Processes with Pandas

    Discover how to automate data cleaning processes using the Pandas library. Learn about typical data cleaning functions like filling missing values, removing duplicates, manipulating strings, and converting date formats. The post also introduces a custom class, DataCleaner, to encapsulate these steps into a reusable pipeline for an efficient and systematic approach to data cleaning.

  2. 2
    Article
    Avatar of dailydoseofdsDaily Dose of Data Science | Avi Chawla | Substack·2y

    Accelerate Pandas 20x using FireDucks

    FireDucks is a highly optimized alternative to Pandas, boasting up to 20x performance improvements by leveraging multi-core CPU capabilities and lazy execution. With the same API as Pandas, FireDucks allows for seamless integration into existing Pandas pipelines by simply changing the import statement. The library is currently available for Linux x86_64, with versions for Windows and MacOS in development.

  3. 3
    Article
    Avatar of planetpythonPlanet Python·2y

    Using Pandas to Read JSON from URL

    Learn how to use Pandas in Python to read JSON data directly from a URL into a DataFrame. This tutorial covers a basic example and explains the key parameters of the `pd.read_json()` method, enabling customization of the data reading process.