Chi-Scraper
This tool enables you to take control of your experience on ArXiv, showing you only the papers that you are interested in.
Put simply, it is a powerful filtering system, combined with a LLM powered ranking algorithm. This means you only see what is relevant to you, and what is most relevant first.
How To Use
Collect Articles
python ./main.py -s -c <config name> -lb <number of days to seach>
Rank Articles
python ./main.py -r
Display Articles
While you can browse the generated markdown files in your favourite markdown editor. E.g. Obsidian. You can also browse them in our web app.
python ./main -w
See an example output here. (Note: the filter, sort, show and settings functionality won't work on the online example.)
Do it all!
You can combine the flags to do all three.
python ./main -s -r -w -c <configname> -lb <lookback days> -sk
The -sk
flag will run the entire program without prompting you for further input. It defaults to not modifying any entries on disk in the case of a duplicate.
How To Install
For detailed install instructions, go to the guide on the side bar. For quick-start.
git pull https://github.com/ChiScraper/ChiScraper.git
pip install -r requirements
python ./ChiScraper