Skip to content

Analysis of Latin documents provided by Classics department of University of Buffalo. Performed normalization and lemmatization on the latin texts. Scaled the number documents from 2 to 80 and compared the performance on scaling and optimized the algorithm for better performance. Tools: VMWare

Notifications You must be signed in to change notification settings

yashjain284/Parallel-Text-Processing-Using-Spark

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Parallel-Text-Processing-Using-Spark

Analysis of Latin documents provided by Classics department of University of Buffalo. Performed normalization and lemmatization on the latin texts. Scaled the number documents from 2 to 80 and compared the performance on scaling and optimized the algorithm for better performance.

Tools: VMWare

About

Analysis of Latin documents provided by Classics department of University of Buffalo. Performed normalization and lemmatization on the latin texts. Scaled the number documents from 2 to 80 and compared the performance on scaling and optimized the algorithm for better performance. Tools: VMWare

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%