How to Build a RAG Knowledge Base from Any Documentation Site in 5 Minutes
This article discusses a solution to the problem of web scraping documentation sites, which often results in messy HTML. The RAG Docs Extractor on Apify can be used to crawl any docs site and get clean, structured markdown output. This output can then be used to build a knowledge base. The process involves loading the extracted chunks, converting them to LangChain documents, and creating a vector store. This allows for efficient querying of the knowledge base using a language model.