Best of Data Science — 2023

  1. 1
    Article
    Avatar of kdnuggetsKDnuggets·3y

    Stop Doing this on ChatGPT and Get Ahead of the 99% of its Users

    The problem often lies not with the AI itself, but with the limitations and vagueness of the input provided. By training ChatGPT, you can get a personal writing assistant for free! It’s time to discover the art of crafting powerful prompts to make the most of this cutting-edge technology.

  2. 2
    Article
    Avatar of communityCommunity Picks·2y

    My favorite coding question to give candidates (and why)

    The article discusses the author's favorite coding question to ask candidates during interviews. It emphasizes the importance of clarifying questions, understanding code complexity, and optimizing solutions. The question involves generating a list of loyal customers based on log files. The article highlights different approaches taken by candidates and provides insights for interviewers.

  3. 3
    Article
    Avatar of kdnuggetsKDnuggets·3y

    Why is DuckDB Getting Popular?

    DuckDB combines the simplicity and ease of use of SQLite with the analytical performance of specialized columnar databases. DuckDB is free and open-source software, so anyone can use and modify the code. It's embedded, meaning the DBMS ( database management system) runs in the same process as the application that uses it.

  4. 4
    Article
    Avatar of freecodecampfreeCodeCamp·3y

    850+ Online Courses with Real College Credit that You Can Access for Free

    Class Central has compiled 70+ MOOC-based master’s degrees. Some courses are “freebies’ meant to give prospective students a taste of what the full online degree would be like. Others, like edX's MicroMasters, are essentially partial degree programs.

  5. 5
    Article
    Avatar of devtoDEV·3y

    20 Essential Coding Patterns to Ace Your Next Coding Interview

    Learn about coding patterns that can enhance problem-solving skills and help ace coding interviews.

  6. 6
    Article
    Avatar of communityCommunity Picks·3y

    Best coding interview question of all time.

    This post discusses the best coding interview question of all time, which is about finding a duplicate number in an array. It covers various aspects of the problem, including clarification questions, optimal solutions using HashMap and array index, and additional constraints like immutable arrays. The post also highlights the importance of competence, creativity, and attitude in solving algorithmic problems during interviews.

  7. 7
    Article
    Avatar of kdnuggetsKDnuggets·3y

    7 Ways ChatGPT Makes You Code Better and Faster

    ChatGPT is your trusty companion throughout the entire development process, offering valuable assistance every step of the way. It can help you think through the scope, architecture, requirements and feasibility of your software development or data science project. ChatGPT's suggested workflow and roadmap have also saved me time by pointing me in the right direction from the start.

  8. 8
    Article
    Avatar of theregisterThe Register·3y

    Twitter algorithm will be open sourced 'next week': Musk

    Twitter's algorithm is going open source next week, according to Elon Musk. Opening up the algorithm can help combat algorithmic bias and speed up adoption of machine learning algorithms. The article also mentions a Supreme Court case that could impact how the internet functions.

  9. 9
    Article
    Avatar of communityCommunity Picks·3y

    Bubble Sort...in PURE CSS? [No JS] đŸ˜±

    Learn how to implement a functioning bubble sort algorithm in pure CSS and add visualizations to it. Discover interesting CSS switches and booleans that may be useful in future scenarios.

  10. 10
    Article
    Avatar of tigerdataTigerData (Creators of TimescaleDB)·2y

    Quickly Building SQL Dashboards for Time Series

    Continuous aggregates in TimescaleDB are an enhanced version of PostgreSQL materialized views that automatically refresh as new data is added or modified. These aggregates enable real-time dashboards for time-series data in PostgreSQL, allowing for fast queries and analytics, even with large amounts of data.

  11. 11
    Article
    Avatar of aiplainenglishAI in Plain English·3y

    Machine Learning: Must Do Projects for Beginners

    Machine Learning: Must Do Projects for Beginners. We will examine some of the most well-liked machine learning datasets and models in this article. In the next sections, we will discuss several datasets and the models that may be used in that exercises.

  12. 12
    Article
    Avatar of mlnewsMachine Learning News·3y

    Best Free Prompt Engineering Resources (2023)

    PromptPerfect is made to automatically improve prompts for large language models (LLMs), large models (LMs), and LMOps. It can optimize prompts for the ChatGPT, GPT-3.5, DALLE, and StableDiffusion models. Using PromptPerfect’s multi-goal optimization, users may tailor prompt optimization to suit their unique requirements.

  13. 13
    Article
    Avatar of tdsTowards Data Science·2y

    System Design Cheatsheets: ElasticSearch

    ElasticSearch is a powerful search engine that excels at performing full-text searches on large datasets. It can be used as a secondary database for full-text search operations, a real-time data analysis pipeline, or a recommendations system. However, ElasticSearch is not suitable for ACID compliance, complex joins, or small datasets with simple query needs. When using ElasticSearch in system design, consider its distributed architecture, scalability, document-based data modeling, real-time data analysis capabilities, and cost implications.

  14. 14
    Article
    Avatar of kdnuggetsKDnuggets·3y

    5 Free Books to Help You Master Python

    This article presents five free books to help readers become proficient in Python and build maintainable applications. The recommended books cover topics such as data analysis, automation, patterns, recipes, and clean architecture.

  15. 15
    Article
    Avatar of kotlinKotlin·3y

    Introducing Kotlin Notebook

    Kotlin Notebook is an interactive tool that allows you to combine code, visualizations, and text in a single document. In notebooks, you can write and run code snippets (cells), view the results, and document your thought process all in one place. This post will offer insight into the plugin’s functionality and use cases.

  16. 16
    Article
    Avatar of inPlainEngHQPython in Plain English·3y

    This Book is “Perfect” for Learning Python

    The book is written in a clear and concise style, making it easy to follow along with the examples and exercises. The author has done an excellent job of breaking down complex concepts into simple, easy-to-understand language that even a beginner can comprehend. The book includes several practical projects, including data visualization and app development.

  17. 17
    Article
    Avatar of kdnuggetsKDnuggets·3y

    Docker for Data Science Cheat Sheet

    Docker is an open-source platform for building, packaging, and distributing portable applications, making it an important tool for data science. Docker helps ensure reproducibility and collaboration by allowing code and dependencies to be packaged in containers. The Docker for Data Science cheat sheet provides a handy reference for data scientists, covering topics like containerization, Docker Compose, and advanced management of Docker images.

  18. 18
    Article
    Avatar of kdnuggetsKDnuggets·3y

    Learn Machine Learning From These GitHub Repositories

    Learn Machine Learning From These GitHub Repositories Kickstart your machine learning career with these curated GitHub repositories. The best of ML with Python repository has 910 open-source projects that are grouped into 34 categories. TensorFlow is one of the most popular machine learning frameworks out there and is vital to your skillset.

  19. 19
    Article
    Avatar of freecodecampfreeCodeCamp·3y

    Create Your Own Programming Language Using Python

    The course is designed for people who already have a fundamental understanding of Python and are eager to explore the depths of computer science and language design. The hands-on curriculum teaches you the underpinnings of how programming languages work, and how you can build one using Python.

  20. 20
    Article
    Avatar of mlnewsMachine Learning News·3y

    This New API Makes It Easy And Cheap For Developers To Build Machine Learning ML-Powered Apps Using Stable Diffusion

    Stable Diffusion allows developers to create Machine Learning ML-powered applications using TuneMyAI. It manages everything, including model deployment, inference, and fine-tuning, and integrates with HuggingFace. The group is developing several fresh features, such as hosted WebUIs and support for extra models like Whisper.

  21. 21
    Article
    Avatar of freecodecampfreeCodeCamp·3y

    Free College Algebra Course (with Python Code!)

    Free College Algebra Course is designed as a full one-semester college course. The course is broken up into 15 "weeks" You will learn how to implement all the Algebra concepts using the popular Python programming language. All the coding is done in Google Colaboratory notebooks.

  22. 22
    Article
    Avatar of kdnuggetsKDnuggets·2y

    25 Free Courses to Master Data Science, Data Engineering, Machine Learning, MLOps, and Generative AI

    Discover a collection of top free courses to master data science, data engineering, machine learning, MLOps, and generative AI. Enhance your skills and embark on a new career with flexible and accessible learning opportunities.

  23. 23
    Article
    Avatar of kdnuggetsKDnuggets·3y

    Forget ChatGPT, This New AI Assistant Is Leagues Ahead and Will Change the Way You Work Forever

    Poe is a chatbot service that allows you to use state of the art models like Claude +, GPT-3.5-Turbo, G PT-4, LlaMA 2, PaLM, all new LLM models. Poe offers a wider variety of AI models that provide more accurate answers than ChatGPT.

  24. 24
    Article
    Avatar of hnHacker News·3y

    LAION-AI/Open-Assistant: OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

    Open Assistant is a chat-based assistant project that aims to improve language and provide access to a large language model. It includes a chat frontend and a data collection app for improving the capabilities of the assistant.

  25. 25
    Article
    Avatar of kdnuggetsKDnuggets·3y

    5 Free Courses to Master Machine Learning

    Learn machine learning with these free courses. Gain proficiency in both theory and building models. Recommended courses cover various topics such as K-Nearest Neighbors, Naive Bayes, logistic regression, linear regression, and more.