Releases: blackadad/paper-scraper
Releases · blackadad/paper-scraper
v1.8.0
What's Changed
- Fixed invalid method - status_code by @blackadad in #60
- Better stack trace for bibtex link failure by @jamesbraza in #63
- Adding
mypy
to type check by @jamesbraza in #61 - Adding
pytest-timeout
for deadlocks by @jamesbraza in #64 - Fixing
Test7.test_custom_scraper
by @jamesbraza in #67 - Expanding
find_doi
to account for more papers by @jamesbraza in #68 - Adding
dev
extra by @jamesbraza in #65 - Multiple
ThrottledClientSession
fixes by @jamesbraza in #69 - Fixing misleading failure message in
pmc_to_pdf
by @jamesbraza in #71 - Suppressing a
latexcodec
warning by @jamesbraza in #70 - Fixing misleading failure message in
find_pmc_pdf_link
by @jamesbraza in #72 - Exposing
encode_id
function by @jamesbraza in #62 - More test cases for
find_doi
by @jamesbraza in #73 - Fixing ChemRxiv in
find_doi
by @jamesbraza in #74 - Fixing URL encodings and bioRxiv in
find_doi
by @jamesbraza in #75 - Fixing
rate_limit
being a requirement for backoff by @jamesbraza in #76 - Exposing Google Scholar preprocessing by @jamesbraza in #78
- Defining
RateLimits
for DRY code by @jamesbraza in #77 preprocess_google_scholar_metadata
not requiring"link"
by @jamesbraza in #80- Fixing
format_bibtex
crash on BibTeX without author or title by @jamesbraza in #79 - Service limit reached message in backoff by @jamesbraza in #81
- More checks on serpapi return values by @blackadad in #82
- Adds more tests for reconciling DOIs by @blackadad in #83
- More links for OA by @blackadad in #84
- Increased backoff by 10x by @blackadad in #85
Full Changelog: v1.7.0...v1.8.0
v1.7.0
What's Changed
- Cleaning up
a_search
logic around SERP by @jamesbraza in #39 - Exposing Semantic Scholar fields by @jamesbraza in #40
- Adding
InvalidURL
tolink_to_pdf
's internals by @jamesbraza in #41 - Allowing
check_pdf
to accept aLogger
by @jamesbraza in #43 - Removed
url
locals by @jamesbraza in #42 - Adding more scrapers by @blackadad in #45
- Exposing paper search and paper recommendations URLs by @jamesbraza in #44
- Fixing variable confusion in
response
vsresp
by @jamesbraza in #46 - Created a scaper that does not rely on S2 by @blackadad in #47
- Documenting
test_search_papers_year
by @jamesbraza in #49 - Fixing incorrect session for Semantic Scholar no-year by @jamesbraza in #50
- Fixing
paths.update
early parenthesis by @jamesbraza in #52 - Fixing
a_search_papers
more than 20 "google" search results by @jamesbraza in #53 - Bumped a bugfix version to 1.6.1 by @jamesbraza in #54
functools.partial
instead of closure by @jamesbraza in #55- Allow single results by @mskarlin in #57
- Various improvements to speed and errors by @blackadad in #58
- Added another check for arxiv from DOI by @blackadad in #59
New Contributors
Full Changelog: v1.4.0...v1.7.0
v1.4.0
What's Changed
- Semantic scholar failure message readability by @jamesbraza in #29
- Handling when
{"openAccessPdf": None}
withopenaccess_scraper
by @jamesbraza in #28 - Newer
black
and ran on all files by @jamesbraza in #30 - Addition of
ruff
,codespell
,toml-sort
by @jamesbraza in #31 - Decomposing
parse_semantic_scholar_metadata
and exposingdefault_scraper
callback by @jamesbraza in #32 - Introducing basic CI pipeline by @jamesbraza in #33
- Fixing erroneous parenthesis from #32 by @jamesbraza in #34
- Encapsulating scraping from
a_search_papers
toScrape
itself by @jamesbraza in #36 - Type hinted the scrapers, using
r.ok
over200
comparison by @jamesbraza in #37 - Migrating from
setup.py
topyproject.toml
by @jamesbraza in #35 - Fixing CI after #35 by @jamesbraza in #38
Full Changelog: v1.3.2...v1.4.0
v1.3.2
v1.3.1
v1.3.0
What's Changed
- Gather S2 Calls in Reconcile by @blackadad in #25
- Added
py.typed
metadata file by @jamesbraza in #24
New Contributors
- @jamesbraza made their first contribution in #24
Full Changelog: v1.2.1...v1.3.0
v1.2.1
v1.2.0
v1.1.3
What's Changed
- simplified s2 fetch from google scholar by @blackadad in #21
Full Changelog: v1.1.2...v1.1.3
v1.1.2
What's Changed
- Fixed empty google search bug by @blackadad in #14
- Added google scholar pdf links by @kubalal in #15
- Fixed tests failing by @kubalal in #16
- Open Access Fix by @kubalal in #17
- Fixed robustness in s2-google correlation by @blackadad in #19
- Hot fix by @blackadad in #20
Full Changelog: v1.0.3...v1.1.2