- Add support to optionally use ReadabiliPy, a wrapper around Mozilla's Readability.js, to improve text extraction of web articles. This closes #53, thanks to @sirupsen for reporting the problem.
- Improve NeurIPS provider to add support for papers.neurips.cc
- Fix OpenReview provider after site change
- Add provider for SagePub
- Improve robustness of Springer provider
- Add provider for SemanticScholar papers
- Fix bug that made
no_crop
option no longer work
- Increase robustness to PDF issues by passing through GhostScript (fixes #51). Thanks to @sirupsen.
- Bugfix for code that removes arXiv stamp.
- Bugfix to url validation: allow underscore in subdomains.
- Corrections to code that removes the arXiv stamp (#49). Thanks to @mr-ubik.
- Further fixes for images in HTML sources (#45). Thanks to @sirupsen.
- Properly resolve image urls in HTML sources (#45). Thanks to @sirupsen.
- Allow
+
in urls
- Print to log whether removing arXiv stamp was successful.
- Fix bug that failed to correctly detect the pdf tool (#42).
- Bugfix that makes removing the arXiv stamp more robust.
- The Dockerfile has been updated to use a more recent version of Cairo (#35). Thanks to @ClaytonJY.
- We've added support for optionally using qpdf instead of pdftk (#36). Thanks to @delaere.
- Resolving redirects has been improved, which solves an issue for the Springer provider (#38) and an issue with some arXiv urls (#39).
- Unit tests were added for the provider selection.
- The code that removes the arXiv stamp has been improved (#40).
- Tracebacks have been disabled outside of debug mode, showing clearer errors (#41).
- Be more robust against missing pdftoppm executable.
- Fix bug for when the shrink operation returns bigger files (#33).
- Add the option to not crop the file at all (#28).
- Add the option to right-align the file so the menu doesn't overlap (#28).
- Bugfix for validation for the JMLR provider
- Significantly speed up the program (#26)
- Add provider for JMLR (#28).
- Bugfix for creating nested directories with
-p
option.
- Add provider for US National Bureau of Economic Research (#27).
- Automatically extract the filename from a pdf url where possible (#25).
- Speed up centering of pdfs by removing unnecessary cropping operation.
- Improve robustness against missing metadata, remove spaces in author names, and other minor improvements.
- Automatically detect when a HTML source is provided (#24)
- Add support for articles from the web using the
--html
flag (#23) - Add
--version
command to command line interface - Fix cropping bug that resulted in occassional rotated pages
- Add support for older arXiv URL scheme
- Add logging of long running crop/center operations
- Keep cookies during requests
- Add wait for CiteSeerX provider
- Make determining the provider more robust (issue #21)
- Bugfix for creating a directory on the reMarkable (#20)
- Add provider for CiteSeerX
- Update provider for ACM (website redesign)
- Properly use exceptions when errors occur
- Add provider for Proceedings of Machine Learning Research (#8)
- Add provider for NeurIPS papers (#12)
- Add support for alternative arXiv URLS
- Always run all redirects on specified urls before choosing provider
- Refactor code to make it a real Python package
- Rename to
paper2remarkable