This proof-of-concept shows how Python can be used, from within a web-browser, to extract data for research purposes. The end-user (data donator) needs only to have a modern web-browser.
An application is provided which creates a basic user-interface for selecting a file. This file is then submitted to a Python interpreter. The relevant file API's of the web-browser have been wrapped so that the Python code does not need to be aware of running in a web-browser.
Example code for the Python part of the project is provided in the data_extractor folder. This also contains a README which explains how to modify this code and presents two examples of data extraction scripts, namely for Google Semantic Location History and Google Search History data.
To run the application execute the following command from the checkout:
python3 -m http.server
This launches a web-server that can be accessed on http://localhost:8000.
Port POC is developed by Eyra. The example data extraction scripts for Google Semantic Location History and Google Search History data packages are developed by the Research Engineering team of Utrecht University.
Do you have questions, suggestions or remarks on the technical implementation? File an issue or feel free to contact Erik-Jan van Kesteren.