📚 Documents: paperdl.readthedocs.io
- 2026-05-30:
A simple and extensible toolkit for searching, organizing, and downloading academic papers from specific websites.
If this project helps your research workflow, please consider giving it a star ⭐. Your support helps more people discover the project and motivates future improvements.
This repository is intended for lawful, educational, academic, and research-related purposes only, such as learning Python, exploring academic paper search workflows, and assisting non-profit research or study.
Users are solely responsible for ensuring that their use of this project complies with applicable laws, website terms of service, copyright rules, publisher policies, institutional requirements, and third-party rights. This project must not be used for illegal purposes, copyright infringement, unauthorized access, abusive downloading, or any activity that may harm authors, publishers, platforms, or institutions.
This project is released under the Apache License 2.0. The authors and contributors provide no warranty, commercial authorization, indemnity, or liability commitment beyond the license terms, and are not responsible for any misuse or consequences arising from the use, modification, redistribution, or commercial application of this project.
| Client | Description | 🔎 Search | ⬇️ Download | Code Snippet |
|---|---|---|---|---|
| ArxivPaperClient | arXiv preprint search and PDF download. arXiv 预印本论文搜索与 PDF 下载。 |
✅ | ✅ | arxiv_paper_client.py |
| OpenReviewPaperClient | OpenReview paper search and PDF download, especially for conference submissions and reviews. OpenReview 论文搜索与 PDF 下载,适合会议投稿与评审数据。 |
✅ | ✅ | openreview_paper_client.py |
| ACLAnthologyPaperClient | ACL Anthology paper search and PDF download for NLP and computational linguistics papers. ACL Anthology 论文搜索与 PDF 下载,主要面向 NLP 和计算语言学论文。 |
✅ | ✅ | acl_anthology_paper_client.py |
| BioRxivPaperClient | bioRxiv preprint search and PDF download for biology-related papers. bioRxiv 生物学预印本论文搜索与 PDF 下载。 |
✅ | ✅ | biorxiv_paper_client.py |
| MedRxivPaperClient | medRxiv preprint search and PDF download for medical and health science papers. medRxiv 医学与健康科学预印本论文搜索与 PDF 下载。 |
✅ | ✅ | biorxiv_paper_client.py |
| PMLRPaperClient | PMLR paper search and PDF download for machine learning proceedings. PMLR 机器学习会议论文集搜索与 PDF 下载。 |
✅ | ✅ | pmlr_paper_client.py |
| PMCOAPaperClient | PubMed Central Open Access paper search and PDF download. PubMed Central 开放获取论文搜索与 PDF 下载。 |
✅ | ✅ | pmc_oa_paper_client.py |
Paperdl requires Python 3.10+. Using a virtual environment is recommended to avoid dependency conflicts.
Install from PyPI:
python -m pip install -U paperdlOr install the latest version from GitHub:
python -m pip install -U git+https://github.com/CharlesPikachu/paperdl.git@mainFor local development:
git clone https://github.com/CharlesPikachu/paperdl.git
cd paperdl
python -m pip install -e .Most paper clients work without browser dependencies. However, some bioRxiv / medRxiv PDF downloads may require the optional Playwright-based browser fallback.
Install with browser support:
python -m pip install -U "paperdl[browser]"
python -m playwright install chromiumFor local development with browser support:
python -m pip install -e ".[browser]"
python -m playwright install chromiumOn some Linux servers, Playwright may also require system dependencies:
python -m playwright install-deps chromium| Project | ⭐ Stars | 📦 Version | ⏱ Last Update | 🛠 Repository |
|---|---|---|---|---|
| 🎵 Musicdl 轻量级无损音乐下载器 |
🛠 Repository | |||
| 🎬 Videodl 轻量级高清无水印视频下载器 |
🛠 Repository | |||
| 🖼️ Imagedl 轻量级海量图片搜索下载器 |
🛠 Repository | |||
| 🖼️ Paperdl 轻量级学术论文搜索下载器 |
🛠 Repository | |||
| 🌐 FreeProxy 全球海量高质量免费代理采集器 |
🛠 Repository | |||
| 🌐 MusicSquare 简易音乐搜索下载和播放网页 |
🛠 Repository | |||
| 🌐 FreeGPTHub 真正免费的GPT统一接口 |
🛠 Repository |
If you use this project in your research, please cite the repository.
@misc{musicdl2020,
author = {Zhenchao Jin},
title = {Paperdl: A Unified Asynchronous Framework for Scholarly Paper Search and Download},
year = {2022},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/CharlesPikachu/paperdl}},
}
| WeChat Appreciation QR Code (微信赞赏码) | Alipay Appreciation QR Code (支付宝赞赏码) |
|---|---|
![]() |
![]() |



