SEARS SDK

The purpose of this SDK is to publish code that will help data scientists to query MongoDB using python so as to bulk download data and files directly from the SEARS backend for aggregated analysis. Case studies 6.1 and 6.2 from our main paper were conducted using this SDK.

Main SEARS platform

Please refer to our main SEARS platform repository here.

Steps to pull data.

Copy the .env file to the root directory of the project. Update the connection string to use your own MongoDB Atlas connection string. Also update the AWS S3 parameters as per your AWS settings.
Install all requirements using pip3 install -r requirements.txt
Run python3 mongo_connect.py to download data from MongoDB to a CSV file. Set search_criteria and output_file_name in the program file. Please note that mongo_connect.py has been customized to access our own schema in SEARS. As you adapt SEARS to your own needs, you may need to modify the code to suit your schema. Therefore please use this file as a reference to write your own MongoDB query code. We encourage use of AI agents to achieve this goal. Easiest way to get a copy of the schema is to go to SEARS dashboard. Next to any experiment appearing in the dashboard, click on the "W"-icon button. This will download the schema of the experiment in JSON format. You can use this schema to write your own MongoDB queries.
Run python3 AWS_Download.py to download files from AWS S3 to a local directory ./file_fetch/. All files related to experiments meeting the search criteria will be downloaded.
Run your ML model on the downloaded data and files.

Process to automate the upload of experiment data to MongoDB

#Steps

Notice the folder ./uploads in the root directory of the project. This folder is used to upload data to MongoDB.
Drop data for an experiment in the folder ./uploads. The data should be in the form of a JSON file.
Run the program python3 auto_upload.py to upload the data to MongoDB. The program will automatically upload the data to the MongoDB collection productData.

Manual upload of data to SEARS on a per experiment basis - SOP

Pool all your upload files in a single folder. The files should be in the form of a CSV file. Run CSV_Validator.ipynb to validate the CSV files. This will ensure that the files are in the correct format for upload. Address any errors that the validator reports.
Go to the SEARS dashboard.
Click on the "+" button next to any "Experiment". A dialog box will appear.
Navigate to the correct experiment tab in the dialog box. e.g. "Thickness".
Drag and Drop files into the Upload area to upload. You can upload multiple files at once.
Click on the "Save Data" button to finish.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
__pycache__		__pycache__
file_fetch		file_fetch
logs		logs
sample_schema		sample_schema
.env		.env
.gitignore		.gitignore
AWS_Download.py		AWS_Download.py
CSV_Validator.ipynb		CSV_Validator.ipynb
LICENSE		LICENSE
ReadMe.md		ReadMe.md
auto_upload.py		auto_upload.py
logging_helper.py		logging_helper.py
mongo_connect.py		mongo_connect.py
playground-1.mongodb.js		playground-1.mongodb.js
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SEARS SDK

Main SEARS platform

Steps to pull data.

Process to automate the upload of experiment data to MongoDB

Manual upload of data to SEARS on a per experiment basis - SOP

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

baskargroup/SEARS-Data-Pull

Folders and files

Latest commit

History

Repository files navigation

SEARS SDK

Main SEARS platform

Steps to pull data.

Process to automate the upload of experiment data to MongoDB

Manual upload of data to SEARS on a per experiment basis - SOP

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages