Best of Vector Search2024

  1. 1
    Article
    Avatar of freecodecampfreeCodeCamp·2y

    Learn Generative AI in 30 Hours

    Learn Generative AI through a comprehensive course on the freeCodeCamp.org YouTube channel. The course covers various aspects of generative AI and its applications, including frameworks and models. It includes hands-on projects and is taught by leading experts in the field.

  2. 2
    Article
    Avatar of communityCommunity Picks·2y

    The State of Data Engineering 2024

    The 2024 State of Data Engineering report discusses the influence of GenAI on software infrastructure, the expansion of product offerings due to the economic downturn, and the impact of open table formats and their catalogs in the data lake industry. It also highlights the importance of data version control and observability in AI/ML systems.

  3. 3
    Article
    Avatar of gopenaiGoPenAI·2y

    Build an Advanced RAG App: Query Routing

    This post explores how to build an advanced RAG application using a technique called Query Routing. Query Routing enables the application to make decisions based on a user's query, selecting the most appropriate action from predefined choices such as retrieving context from multiple data sources, using different indexes, or performing a web search. Various types of Query Routers are discussed, including LLM Selector Router, LLM Function Calling Router, Semantic Router, and more. Example implementations demonstrate how to create Query Routers and enhance the decision-making capabilities of RAG applications.

  4. 4
    Article
    Avatar of gopenaiGoPenAI·2y

    Anthropic’s New RAG Approach

    LLMs excel at general tasks but struggle with specialized domains. Fine-tuning enhances their performance in targeted areas, but it's complex and costly. Retrieval-Augmented Generation (RAG) offers a solution by connecting LLMs directly to knowledge bases, enabling domain-specific data retrieval without extensive retraining. Techniques like Contextual Retrieval and BM25 integration improve accuracy by situating chunks within their full context. This approach balances semantic understanding with traditional keyword search, addressing challenges like incomplete responses.

  5. 5
    Video
    Avatar of TechWithTimTech With Tim·2y

    Build a Full RAG Based AI App Including Deployment In Under 20 Minutes

    Learn how to build a full RAG-based AI app using Vex, a platform that allows you to quickly build and deploy applications. The app demonstrated in the post is a travel assistant that provides personalized recommendations based on travel plans and preferences.

  6. 6
    Article
    Avatar of weaviateWeaviate·2y

    Vector Search Explained

    Vector search leverages numerical representations called vector embeddings to retrieve similar items based on semantic meaning rather than exact text matches. It is used in modern AI applications, such as image retrieval, recommendation systems, and search engines. Implementing vector search involves converting data and query into vector embeddings and calculating their similarity using distance metrics like cosine similarity. Vector databases and Approximate Nearest Neighbor algorithms significantly speed up search times, making them suitable for large-scale datasets. Use cases include search systems, recommendation systems, and Retrieval Augmented Generation (RAG).

  7. 7
    Article
    Avatar of dailydoseofdsDaily Dose of Data Science | Avi Chawla | Substack·1y

    Building a RAG app using Llama-3.3

    Meta released Llama-3.3, and this post provides a hands-on demo for building a RAG app using it. The app allows users to interact with a document via chat. It uses LlamaIndex for orchestration, Qdrant for a self-hosted vector database, and Ollama for serving Llama-3.3 locally. The implementation steps include loading and parsing a knowledge base, creating embeddings, indexing and storing them, defining a custom prompt template, and setting up a query engine.

  8. 8
    Article
    Avatar of neo4jneo4j·2y

    Graph vs. Vector RAG — Benchmarking, Optimization Levers, and a Financial Analysis Example

    Exploring the use of graph and vector search in retrieval-augmented generation (RAG) systems, focusing on their application in financial analysis. Discusses the differences between graph and vector search, optimization levers for graph search, and the combination of both methods in RAG. Highlights the benefits of graph databases for modeling complex relationships and dependencies, as well as the limitations and complementarity of vector search. Demonstrates the application of graph and vector search in a financial report RAG example.

  9. 9
    Video
    Avatar of TechWithTimTech With Tim·2y

    Advanced Multi-Agent AI App Walkthrough (Python, Langflow, Streamlit & More!)

    This post provides a walkthrough on building an advanced multi-agent AI application using Python, Langflow, Streamlit, and other tools. The application can handle multiple tasks with different language models and integrates a full front end for interaction. Key technologies include Langflow for low-code AI flows, Streamlit for front-end development, and Astrab for a vector database to implement retrieval augmented generation features. The tutorial offers a comprehensive guide on setting up the application, integrating AI features, customizing flow, and connecting with different tools and APIs.

  10. 10
    Article
    Avatar of communityCommunity Picks·2y

    Building an Advanced RAG System With Self-Querying Retrieval

    Learn how to build an advanced Retrieval Augmented Generation (RAG) system that leverages self-querying retrieval to improve search relevance. This tutorial covers extracting metadata filters from natural language queries, combining metadata filtering with vector search, and generating structured outputs using LLMs. The guide focuses on developing an investment assistant to answer financial questions using MongoDB as the vector store and LangGraph for orchestration.

  11. 11
    Article
    Avatar of mongoMongoDB·2y

    MongoDB AI Course in Partnership with Andrew Ng and DeepLearning.AI

    MongoDB has partnered with Andrew Ng and DeepLearning.AI to launch a course focused on bridging database technology with modern AI applications. Titled 'Prompt Compression and Query Optimization,' this course covers Retrieval Augmented Generation (RAG) and uses MongoDB Atlas Vector Search to teach developers how to build efficient AI applications. The course also introduces techniques like prompt compression to reduce operational costs and showcases MongoDB's flexible document model ideal for AI data handling. It's tailored for developers interested in vector search, RAG applications, and AI optimization strategies.

  12. 12
    Article
    Avatar of communityCommunity Picks·2y

    A Broke B**ch’s Guide to Tech Start-up: Choosing Vector Database — Part 1 Self-Hosted

    Vector databases are crucial for GenAI applications, offering augmented knowledge bases for language models with support for fuzzy searching using text or media embeddings. The post evaluates various self-hosted vector databases like MongoDB, ChromaDB, Weaviate, Milvus, Neo4j, KDB.AI, PostgreSQL, and SQLite. Recommendations include using Docker for ease of setup and highlighting the benefits and limitations of each option. The guide emphasizes starting with self-hosted instances to control costs while prototyping and suggests evaluating multiple databases to find the optimal setup for your application.

  13. 13
    Article
    Avatar of communityCommunity Picks·2y

    Full-text search vs vector search

    Explore the pros and cons of full-text search and vector search, and discover how they can be combined in hybrid search to create fast and relevant search experiences.

  14. 14
    Article
    Avatar of medium_jsMedium·2y

    How to Build RAG Applications with Pinecone Serverless, OpenAI, Langchain and Python

    Learn how to build RAG applications using Pinecone Serverless, OpenAI, Langchain, and Python. Discover why Pinecone is a preferred vector database, and follow a step-by-step guide on building RAG apps. Upsert data to the vector database and query it to retrieve relevant information.

  15. 15
    Article
    Avatar of aiplainenglishAI in Plain English·2y

    Your guide to Vector Databases

    This post provides a guide to understanding Vector Databases, including their definition, functionality, and applications. It also highlights the features of a good Vector Database and offers best practices for choosing the right one. The post includes a list of over 80 Vector Databases for AI projects.

  16. 16
    Article
    Avatar of infoworldInfoWorld·2y

    Using PostgreSQL as a vector database in RAG

    Learn how to build a local retrieval-augmented generation (RAG) application using PostgreSQL with the pgvector extension, Ollama, and the Llama 3 large language model. This guide describes how Postgres can store both vector and tabular data, making it a versatile option for medium-sized RAG applications. It covers setting up a vector database, ingesting text from multiple sources, conducting similarity searches, and querying a large language model to generate answers. Practical coding examples and step-by-step instructions are provided to help developers get started quickly.

  17. 17
    Article
    Avatar of rubylaRUBYLAND·2y

    Building a Personal RAG Application for PDF-Based Question Answering

    Learn about building a Retrieval-Augmented Generation (RAG) application for PDF-based question answering using LLMs, embedding models, and vector databases. This guide utilizes Meta's Llama3, Qdrant VectorDB, and Llama Index for embedding with Python, providing a way to interact with PDF content through natural, conversational queries.

  18. 18
    Article
    Avatar of gopenaiGoPenAI·2y

    Introduction to Retrieval-Augmented Generation (RAG): A Beginner’s Guide

    Introduction to Retrieval-Augmented Generation (RAG): A Beginner's Guide. RAG combines retrieval and generative AI techniques to ensure accurate and meaningful responses. The RAG process involves document ingestion, retrieval, and response generation. RAG systems provide precise and top-notch text responses, elevating the performance of AI applications.

  19. 19
    Article
    Avatar of redislabsRedis·2y

    Get better RAG responses with Ragas

    Many teams struggle to measure the performance of their Retrieval Augmented Generation (RAG) applications due to issues like hallucinations and incorrect data. A metrics-driven approach, using frameworks like RAG Assessment (Ragas), can improve evaluation by quantitatively measuring faithfulness, answer relevancy, context precision, and context recall. This ensures engineers can optimize their systems without relying on anecdotal evidence. Using tools like OpenAI, LangChain, and Redis, developers can establish and test baseline metrics efficiently.

  20. 20
    Article
    Avatar of baeldungBaeldung·2y

    Building a RAG App Using MongoDB and Spring AI

    Learn how to build a Retrieval-Augmented Generation (RAG) Wiki application using MongoDB and Spring AI. The tutorial details setting up MongoDB Atlas Vector Search for storing documents, adding necessary dependencies, and configuring the application to save and retrieve documents based on context. The application leverages a vector store for similarity search and utilizes LLM for generating responses, making it suitable for developing chatbots, automated wikis, and search engines.

  21. 21
    Article
    Avatar of taiTowards AI·2y

    What are Vector Databases?

    Vector databases are designed specifically for storing vector embeddings and enhance operations such as semantic search, recommendation engines, and advanced AI/ML processes. They leverage similarity metrics for data retrieval and offer benefits like scalability, flexibility, ultra-low latency, high performance, and optimized storage and memory.

  22. 22
    Article
    Avatar of tdsTowards Data Science·2y

    Scaling RAG from POC to Production

    Retrieval Augmented Generation (RAG) is becoming a key architecture for large-scale applications of AI, balancing the capabilities of large language models with the accuracy of indexed data. Scaling from a proof of concept (POC) to production presents multiple challenges, including performance, data management, and risk mitigation. Addressing these challenges involves architectural components such as scalable vector databases, caching mechanisms, advanced search techniques, and a Responsible AI layer. Strategic planning and integration into existing workflows are crucial for successful scaling.

  23. 23
    Article
    Avatar of communityCommunity Picks·2y

    LLM app dev using AWS Bedrock and Langchain

    The post explains how to develop applications using Large Language Models (LLMs) with Amazon Bedrock and Langchain to perform tasks like Question Answering over large document corpora. It introduces the concept of retrieval-augmented generation (RAG), which uses document processing and vector embedding to fetch relevant document chunks for question answering. The process includes setting up LLM and embedding models, loading and splitting documents into chunks, creating a vector database using SingleStoreDB, and performing similarity searches to generate context-aware responses.

  24. 24
    Article
    Avatar of communityCommunity Picks·2y

    Semantic search using OpenAI, pg_embedding and Neon

    Learn how to build a semantic search app using OpenAI, Neon, and pg_embedding. The app transforms user queries into vector embeddings to perform vector similarity searches, retrieving the most relevant results based on meaning instead of keyword matches. The methodology includes generating embeddings, storing them in a Postgres database using pg_embedding, and retrieving similar items through vector similarity search. Step-by-step instructions and code are included for building the app, from gathering data to deploying the frontend and API.

  25. 25
    Article
    Avatar of taiTowards AI·2y

    Prompt Like a Pro Using DSPy: A Guide to Build a Better Local RAG Model using DSPy, Qdrant, and Ollama

    Manual prompting in AI is becoming obsolete, with DSPy offering a revolutionary approach to optimize language model (LM) prompts algorithmically. DSPy uses signatures, modules, metrics, and optimizers to attain consistent and reproducible results across different LMs. This guide details the step-by-step process of integrating DSPy with Qdrant for vector databases and Ollama for local LLM deployments. Highlights include dataset loading, creating a vector database, and implementing a Chain of Thought Reasoning with a RAG model.