LibreQoS is an open-source project for monitoring and ensuring quality-of-experience for ISPs by tracking individual data flows. To make the data understandable, ASN (Autonomous System Numbers) needs categorization, which is automated using Rust and large language models (LLMs). The post explains how to obtain ASN data, load and deduplicate it, set up a local LLM, and categorize data using context scraped from associated websites. The process involves using crates like Serde, CSV, Itertools, Reqwest, and Scraper for efficient data handling, and leveraging Tokio for parallel processing to speed up the categorization task.
Table of contents
IntroductionObtaining the ASN DataLoading the DataSetting Up a Local LLMTalking to the LLMCategorizing the Data: First Try - Oneshot!Adding ContextLet’s Add Some Performance!Conclusion1 Comment
Sort: