Title
Arxiv Sanity Preserver
Description
Self-hosted literature-monitoring service for tracking and curating arXiv research, built to shorten the loop between discovery and experimentation with fast search, filtering, and similarity-based recommendations.
Catalogue
>10k papers
Embedding
SVM
Developed Description
Originally inspired by Karpathy’s arxiv-sanity, this version modernizes the experience and repositions it as a daily and personal research cockpit rather than a social paper browser. The system ingests arXiv metadata and PDFs, extracts text, builds TF‑IDF vectors, and computes similarity using an SVM-based approach for personalized and paper-to-paper recommendations. It is designed to manage 10k+ papers with non-noticeable latency for search and similarity browsing while remaining fully usable from the UI (search, filter, library, and discovery flows without CLI reliance). Compared to Karpathy’s original, this project adds a fresh and modern UI, expanded research filters, and UI-driven workflows for triage and organization. It also introduces “Hot/Slop” paper classification (based on citation signals).