Skip to content

GabPrato/DailyArXiv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Filters the abstracts of all arXiv papers released on the specified date using the specified keywords. E.g.:

python -m DailyArXiv --subjects="cs.AI" --required-keywords="LLM"

Requirements

  • requests
  • beautifulsoup4
  • rich

Installation

Download the repo:

git clone https://github.com/GabPrato/DailyArXiv.git

Install required packages:

pip install -r DailyArXiv/requirements.txt

Instructions

View command arguments:

$ python -m DailyArXiv --help
usage: DailyArXiv [-h] [--subjects SUBJECTS] [--date DATE] [--required-keywords REQUIRED_KEYWORDS] [--any-keywords ANY_KEYWORDS]

Searches all arXiv papers in the chosen subjects, released on the specified date, and containing the given keywords in the abstract.
If both --required_keywords and --any_keywords are NOT specified, all papers will be selected. If both are specified,
the search will select papers where the abstract contains all of the keywords from --required-keywords and at least one keyword from --any-keyword.

options:
  -h, --help            show this help message and exit
  --subjects SUBJECTS   List of comma-separated arXiv subjects, where papers must belong to at least one of the specified subjects.
                        View the list at https://arxiv.org/. Searches all Artificial Intelligence (cs.AI) and Machine Learning (cs.LG) papers by default.
  --date DATE           Specify the date in YYYY-MM-DD format. Default is your system's current date.
  --required-keywords REQUIRED_KEYWORDS
                        List of comma separated keywords (e.g., 'quantization,4-bit') that must all appear in the abstract.
                        Keywords are case-insensitive. List is empty by default.
  --any-keywords ANY_KEYWORDS
                        List of comma separated keywords (e.g., 'LLM,language,model'), where a paper will be selected if any of the
                        keywords in this list appear in the abstract. Keywords are case-insensitive. List is empty by default.

Examples

By default, lists all machine learning (cs.LG) and AI (cs.AI) arXiv releases of today:

python -m DailyArXiv

Specify the subject, e.g., all Classical Physics and General Physics:

python -m DailyArXiv --subjects="physics.class-ph,physics.gen-ph"

Specify the date:

python -m DailyArXiv --date="2023-12-04"

Search for papers where the abstract contains LLM and NLP. Note that keywords are case-insensitive.

python -m DailyArXiv --required-keywords="LLM,NLP"

Search for papers where the abstract contains either LLM or NLP:

python -m DailyArXiv --any-keywords="LLM,NLP"

Search for papers where the abstract contains LLM and either NLP or quantization:

python -m DailyArXiv --required-keywords="LLM" --any-keywords="NLP,quantization"

About

Search daily arXiv releases.

Resources

License

Stars

Watchers

Forks

Languages