Releases: apify/crawlee-python
Releases · apify/crawlee-python
0.0.7
0.0.6
Adds
- BREAKING:
BasicCrawler.export_datahelper method which replacesBasicCrawler.export_to Configuration.get_global_configurationmethod- Automatic logging setup
- Context helper for logging (
context.log)
Fixes
- Handling of relative URLs in
add_requests - Graceful exit in
BasicCrawler.run
0.0.5
Adds
- Add explicit error messages for missing package extras during import
- Better browser abstraction:
BrowserController- Wraps a single browser instance and maintains its state.BrowserPlugin- Manages the browser automation framework, and basically acts as a factory for controllers.
- Browser rotation with a maximum number of pages opened per browser.
- Add emit persist state event to event manager
- Add batched request addition in
RequestQueue - Add start requests option to
BasicCrawler - Add storage-related helpers
get_data,push_dataandexport_totoBasicCrawlerandBasicContext - Add
PlaywrightCrawler's enqueue links helper
Fixes
- Fix type error in persist state of statistics
0.0.4
0.0.3
Another internal release, adding mainly session management and BeautifulSoupCrawler.
Adds
HttpxClientSessionPoolBeautifulSoupCrawlerBaseStorageClientStoragesandMemoryStorageClientwere refactored
Was added in 0.0.2
EventManager&LocalEventManagerSnapshotterAutoscaledPoolMemoryStorageClientStoragesBasicCrawler&HttpCrawler