Allycat

⬆️ open-source/

Allycat is an end to end open-source RAG pipeline for website content. It can

  • scape websites
  • clean up content
  • index content, vectorize and store them in a vector database
  • and a UI for queries.

The entire stack is open source. It supports LLMs and embedding models running locally or using an inference service.

repo
GitHub stars GitHub forks

Here is the architecture of Allycat.

Talks / Workshops Using Allycat

2025-Nov: Allycat workshop at QConSF
session details

2025-Nov: Workshop at Tech Equity AI Summit - Fall 2025

Pics