cs128h-project

Project for CS128 Honors

Group

Name	NetID
Andrew	alester3
Arul	arulhv2
Devak	devakn2
Jonathan	jsneh2

Project Introduction

The Project focuses on creating a web application that runs a Map Reduce on any uploaded dataset along with other dataset analysis tools (To be decided) using a rust backend. The application will be tested using pre-existing datasets found from online (Will be sourced as we use them). We chose to work on this project because we all enjoyed the last MP, and we think that working more on running parallel data analysis algorithms will improve our understanding of Rust and concurrency.

Some goals we have for the project are a simple interface with not much code, interesting data analysis features, and benchmarking for our concurrent algorithms. We believe that most of the work in the project should be spent on implementing the parallel algorithms, so the user interface will be rather simple to make up for that. Our data analysis features could involve graph or tree algorithms, image processing, or other data visualization.

System Overview

Major technical checkpoints will include:

Choosing a web framework for our Rust backend
Developing a front end of some sort using html and css styling
Developing a map reduce algorithm using rust to process data in a time efficient manner
Testing our algorithm using online datasets and publishing results
Determing what other algorithms we could implement
Hook up the algorithms to the web backend. In doing so, find ways for the user to upload data for the algorithms to run, whether that be through images or even video processing.

Possible Challenges

One possible challenge will be identifying a good backend library to use with Rust and hooking our data analysis code into it. There are many of them out there, so we will need to make a careful decision at the start so that we are headed in the right direction. Another challenge will be designing the process for the user to upload datasets to our backend for use by the MapReduce and/or other parallel data processing algorithms. If a data set is truly large, it might not be possible to upload it directly and we would have to resort to some form of data streaming. Another challenge will be implementing a form of data visualization to fit data needs as not all data can be expressed in the same way and it would be better to address different forms of data differently, else we may have to limit the type of data we accept.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
client		client
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
RUN.md		RUN.md
rust-toolchain.toml		rust-toolchain.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

cs128h-project

Group

Project Introduction

System Overview

Possible Challenges

About

Releases

Packages

Contributors 4

Languages

License

AndrewLester/cs128h-project

Folders and files

Latest commit

History

Repository files navigation

cs128h-project

Group

Project Introduction

System Overview

Possible Challenges

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages