Hope everyone has read my previous article about deploying Local or Fine-tuned LLMs in FastAPI and achieve streaming response in the same. However, I have received few requests on how to extend the…

Medium_JS is a curated collection of insights and tutorials on JavaScript development, designed to help developers stay informed and inspired in the ever-evolving world of web development. By featuring a selection of high-quality articles, tutorials, and expert opinions from the JavaScript community, Medium_JS offers  guidance on mastering JavaScript language features, exploring modern frameworks and libraries, and solving common development challenges. Whether you're a frontend developer, a full-stack engineer, or an aspiring JavaScript enthusiast, Medium_JS provides a  knowledge and resources to fuel your JavaScript journey.

Medium

This article explains how to extend the concept of deploying Local or Fine-tuned LLMs in FastAPI to closed source models of OpenAI, Google, etc using Langchain. It covers the architecture of Langchain, the main functions of interest, and provides next steps for improvement.

Streaming Responses from LLM using Langchain + FastAPI

Now that we have finished, lets test out our streamer