// project 05

Arxiv Sanity Preserver

Overview

Self-hosted literature-monitoring service for tracking and curating arXiv research, built to shorten the loop between discovery and experimentation with fast search, filtering, and similarity-based recommendations.

Open SourceRustHTMLRecommendation Systemsource

Originally inspired by Karpathy's arxiv-sanity, this version modernizes the experience and repositions it as a daily and personal research cockpit rather than a social paper browser.

The system ingests arXiv metadata and PDFs, extracts text, builds TF-IDF vectors, and computes similarity using an SVM-based approach for personalized and paper-to-paper recommendations. It is designed to manage 10k+ papers with non-noticeable latency for search and similarity browsing while remaining fully usable from the UI (search, filter, library, and discovery flows without CLI reliance).

Compared to Karpathy's original, this project adds a fresh and modern UI, expanded research filters, and UI-driven workflows for triage and organization. It also introduces "Hot/Slop" paper classification (based on citation signals).