Releases: arquivo/page-search
Releases · arquivo/page-search
Dionisius release
- Revise Page Search API log format", arquivo/pwa-technologies#1066
Basileus release
- Added test to check for validation of no status code in warc arquivo/pwa-technologies#1021
- Added null check to length and status code to fix arquivo/pwa-technologies#1021 This happens because donated collections do not have status code
- Fix error being thrown when user tries to get extracted text with a resource id invalid or not existent. arquivo/pwa-technologies#986
- encode url when searching for it. arquivo/pwa-technologies#948
- Make textextracted work with solr and nutch. arquivo/pwa-technologies#948
- Fix and add tests to verify issue arquivo/pwa-technologies#979
- when dedupValue = 0 increase by 1 the number of hits so we can know if it is the lastpage or not. nutchwax doesn't don't apply the hits multiplication factor when the dedupvalue is 0. fix arquivo/pwa-technologies#980
Responsive release
- Add Arquivo.pt collection on cdxj records arquivo/pwa-technologies#788
- Change dedupField to URL when searching for site arquivo/pwa-technologies#805
- Deduplication by URL/HOST should ignore the protocol and www prefix arquivo/pwa-technologies#807
- Collection field on textsearch metadata should use cdxj API arquivo/pwa-technologies#846
- Log the search results of each page/text search API call arquivo/pwa-technologies#928