This repository contains material required for the automated data extraction from the Bozner Wochenblatt for the years 1842-1848 (starting with 1856 the newspaper is called Bozner Zeitung).
The source code is stored in the Python Jupyter notebook BW_Data_Extraction_1842_1848.ipynb and is also provided as Python file. The folder trained_RF_algorithms_zipped.zip holds the 3 trained random forest classifier required for code execution. The folder was compressed using Microsoft Windows 10 Zip Tool and needs to be decompressed when used together with the source code.
The png file included is an example of a data table published in the Bozner Wochenblatt (Source: Landesbibliothek Dr. Friedrich Teßmann, last access 15.9.2022. License: CC BY-NC-ND 4.0.) The csv file included contains the automatically extracted data from the png file.
-
Notifications
You must be signed in to change notification settings - Fork 0
This repository contains the materials (code and trained machine learning algorithms) required for automatic data extraction from the Bozner Wochenblatt (years 1842 - 1848) together with some example data.
License
michaelh00/BW_Data_Extraction_1842_1848
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
This repository contains the materials (code and trained machine learning algorithms) required for automatic data extraction from the Bozner Wochenblatt (years 1842 - 1848) together with some example data.
Topics
Resources
License
Stars
Watchers
Forks
Packages 0
No packages published