A modular research framework for large‑scale text analysis: it ingests heterogeneous corpora, applies multilingual preprocessing, computes dense embeddings, performs similarity‑based clustering, creates LLM‑driven abstractive summaries, and continuously updates a temporal knowledge‑graph that captures topic evolution and inter‑document relations.
-
Updated
Oct 19, 2025 - Python