Best of Data ScienceMay 2024

  1. 1
    Video
    Avatar of fireshipFireship·2y

    Google's secret algorithm exposed via leak to GitHub…

    Leaked documents reveal potential lies about Google's search ranking algorithm, including the use of site authority, the importance of clicks, the impact of data collected from Chrome users, and the continued importance of high-quality backlinks.

  2. 2
    Article
    Avatar of communityCommunity Picks·2y

    The State of Data Engineering 2024

    The 2024 State of Data Engineering report discusses the influence of GenAI on software infrastructure, the expansion of product offerings due to the economic downturn, and the impact of open table formats and their catalogs in the data lake industry. It also highlights the importance of data version control and observability in AI/ML systems.

  3. 3
    Article
    Avatar of kdnuggetsKDnuggets·2y

    5 Free MIT Courses to Learn Math for Data Science

    Learn math for data science with free courses from MIT, covering topics such as linear algebra, calculus, statistics, and probability.

  4. 4
    Article
    Avatar of kdnuggetsKDnuggets·2y

    Top SQL Queries for Data Scientists

    Learn about the main SQL concepts for data scientists, including querying and filtering data, working with NULLs, data type conversion, data aggregation, and more.

  5. 5
    Video
    Avatar of communityCommunity Picks·2y

    How I'd Learn AI in 2024 (if I could start over)

    Learn the fundamentals of AI and data science, set up a Python work environment, work on projects, specialize in a specific area, share knowledge, continue learning and upskilling, and monetize your skills.

  6. 6
    Article
    Avatar of kdnuggetsKDnuggets·2y

    10 Free Must-Take Data Science Courses to Get Started

    A list of 10 free data science courses for beginners to kickstart their career in data science.

  7. 7
    Article
    Avatar of communityCommunity Picks·2y

    7 Best Python Visualization Libraries for 2024

    Discover the top Python libraries for data visualization in 2024.

  8. 8
    Article
    Avatar of dailydoseofdsDaily Dose of Data Science | Avi Chawla | Substack·2y

    5 LLM Fine-tuning Techniques Explained Visually

    This post explains five fine-tuning techniques for LLMs, including LoRA, LoRA-FA, VeRA, Delta-LoRA, and LoRA+.

  9. 9
    Article
    Avatar of mlmMachine Learning Mastery·2y

    Beginning Data Science (7-day mini-course)

    This post provides a 7-day mini-course on beginning data science. It covers topics such as tools in data science, target audience, and the lessons covered in the course.

  10. 10
    Video
    Avatar of TechWithTimTech With Tim·2y

    5 Coding Niches That ACTUALLY Make You Money in 2024

    Explore the highest paying coding niches in 2024, including artificial intelligence and machine learning, data science, blockchain development, cybersecurity, and DevOps.

  11. 11
    Article
    Avatar of earthlyEarthly·2y

    Top 10 Python Libraries for Data Science

    This post explores the top Python libraries for data science, including libraries for data acquisition, data analysis and processing, machine learning, and data visualization.

  12. 12
    Article
    Avatar of kdnuggetsKDnuggets·2y

    5 Simple Steps to Automate Data Cleaning with Python

    Learn how to automate the data cleaning process with a 5-step pipeline in Python. The pipeline includes steps for identifying data format, removing duplicates, handling missing values, and dealing with outliers.

  13. 13
    Article
    Avatar of medium_jsMedium·2y

    Building an Agent for Data Visualization (Plotly)

    Learn how to build an agent for data visualization using Plotly. Discover the limitations of language models in data visualization and how an agent can mitigate these issues.

  14. 14
    Article
    Avatar of awegoAwesome Go·2y

    Introduction to Generative AI with Go

    Learn about generative AI and how to integrate it into Go applications in this free remote webinar with Daniel Whitenack.

  15. 15
    Article
    Avatar of kdnuggetsKDnuggets·2y

    Navigating Your Data Science Career: From Learning to Earning

    Data science career requires a combination of academic education and self-learning. Technical skills such as programming and data manipulation are important, as well as soft skills such as communication and analytical thinking. The career offers attractive salaries and plenty of job opportunities.

  16. 16
    Article
    Avatar of mlnewsMachine Learning News·2y

    Empowering Developers and Non-Coders Alike to Build Interactive Web Applications Effortlessly

    Taipy Designer is a no-code studio that allows users to design web pages by dragging and dropping graphical widgets. It integrates Python code with web interfaces and is accessible to both Python developers and non-professional developers. It provides basic widgets for charts and easy access to chart libraries like Matplotlib and Plotly. Taipy Designer aims to empower developers and non-coders alike to build interactive web applications effortlessly.

  17. 17
    Article
    Avatar of dailydoseofdsDaily Dose of Data Science | Avi Chawla | Substack·2y

    A Visual Guide to AdaBoost

    A step-by-step explanation of how AdaBoost works, using decision trees as weak learners. AdaBoost progressively learns from previous model's mistakes and reweighs instances to improve predictions.

  18. 18
    Article
    Avatar of communityCommunity Picks·2y

    Which programming language to use for coding interviews

    The choice of programming language for coding interviews can greatly impact performance, with Python and Java being commonly preferred. Familiarity with the language is also important, and it's recommended to use a language you're already familiar with. However, there are exceptions for domain-specific positions. Learning a new language just for interviewing is generally not recommended.

  19. 19
    Article
    Avatar of medium_jsMedium·2y

    Getting Started with Rust

    A post sharing resources and recommendations for getting started with Rust programming language, emphasizing its performance and memory safety features.

  20. 20
    Article
    Avatar of kdnuggetsKDnuggets·2y

    The Best Strategies for Fine-Tuning Large Language Models

    Learn how to fine-tune large language models for specialized tasks and customize them to suit specific requirements.

  21. 21
    Article
    Avatar of dailydoseofdsDaily Dose of Data Science | Avi Chawla | Substack·2y

    Most Important Plots in Data Science

    The post discusses the most important plots in data science, including the KS Plot, SHAP Plot, ROC Curve, Precision-Recall Curve, QQ Plot, Cumulative Explained Variance Plot, Elbow Curve, Silhouette Curve, Gini-Impurity and Entropy, Bias-Variance Tradeoff, and Partial Dependency Plots.

  22. 22
    Article
    Avatar of kdnuggetsKDnuggets·2y

    Harvard’s Top Free Courses for Aspiring Data Scientists

    Harvard offers free courses for aspiring data scientists, including an introduction to programming with Python, probability from the ground up, introduction to data science with Python, and machine learning and AI with Python.

  23. 23
    Article
    Avatar of dailydoseofdsDaily Dose of Data Science | Avi Chawla | Substack·2y

    Build Interactive Data Apps of Scikit-learn Models Using Taipy

    Learn how to build interactive data apps of Scikit-learn models using Taipy, a low-code data pipeline interface. Taipy allows for parallelization and caching to optimize the execution of data pipelines. Install Taipy and Taipy Studio to get started. The code examples provided demonstrate creating a model app using Taipy, defining tasks and pipelines, and creating a graphical interface for user interaction.

  24. 24
    Article
    Avatar of kdnuggetsKDnuggets·2y

    Feature Engineering for Beginners

    This guide introduces key techniques in feature engineering, including handling missing values, encoding categorical variables, and scaling and normalizing data. It also covers advanced techniques such as feature creation, dimensionality reduction, and time series feature engineering. The post provides practical examples in Python and offers practical tips and best practices.

  25. 25
    Article
    Avatar of taiTowards AI·2y

    Data Science Interview Question: Creating ROC & Precision-Recall Curves From Scratch

    Learn how to create ROC and Precision-Recall curves from scratch in data science interviews. Import necessary libraries and generate actual data and probabilities. Combine actual and predicted values and calculate metrics for curve creation. Plot the ROC, Sensitivity-Specificity, and Precision-Recall curves.