Best of NLP — April 2024
- 1
- 2
freeCodeCamp·2y
Mastering RAG from Scratch
Learn how to implement Retrieval-Augmented Generation (RAG) from scratch with an in-depth course on the freeCodeCamp.org YouTube channel. RAG combines retrieval systems with advanced natural language generation and is valuable in chatbot development and other fields.
- 3
- 4
Community Picks·2y
lm-sys/FastChat: An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
FastChat is an open platform for training, serving, and evaluating large language model based chatbots. It powers Chatbot Arena, a platform for LLM battles and has released several models such as Vicuna and LongChat. The repo provides instructions on installation, model weights, inference, serving, evaluation, and more.
- 5
- 6
Community Picks·2y
TransformersPHP
TransformersPHP is a toolkit for PHP developers to easily add machine learning capabilities to their projects by using pre-trained models. It utilizes the ONNX Runtime to run these models locally on the server. The project was inspired by the Xenova/transformers project for JavaScript.
- 7
- 8
Community Picks·2y
Large language models, explained with a minimum of math and jargon
This post explains how large language models work, including how they represent words using vectors, how they predict the next word, and how they are trained. It also discusses the surprising performance of GPT-3 on tasks requiring high-level reasoning and its potential to understand meanings of words.
- 9
KDnuggets·2y
Semantic Search with Vector Databases
This post discusses semantic search and how to use a Vector Database with Python. Semantic search considers the meaning of words and sentences, using numerical vectors to capture that information. Weaviate is an open-source Vector Database used for storing and searching text data. The Python implementation demonstrates the process of semantic search using Weaviate.
- 10
Medium·2y
Mastering the Art of Data: Python Code Snippets to Elevate Your Skills
Master the art of data with Python and enhance your data skills using powerful Python code snippets. From data engineering and data science to data visualization and machine learning, this article covers a wide range of data-related tasks. Explore essential Python code snippets, tips, and tricks to streamline workflows, uncover insights, and deliver high-quality data solutions. Whether you're a beginner or an experienced practitioner, this article serves as a gateway to mastering data with Python.
- 11
AI in Plain English·2y
LLaMA3: A New Era in Large Language Models
LLaMA3 is a powerful AI tool that represents a significant step forward in large language models. It aims to democratize access to state-of-the-art language models and has the potential to contribute to the development of Artificial General Intelligence (AGI).
- 12
- 13
- 14
Community Picks·2y
Four Data Cleaning Techniques to Improve Large Language Model (LLM) Performance
This post explores four common natural language processing techniques to clean text before ingestion in large language models. It highlights the importance of data cleaning to ensure accuracy, improve quality, and facilitate analysis. The post also discusses the use of retrieval-augmented generation (RAG) in enhancing the performance of large language models.