Welcome to the MuHSiC Github, the home of all documentation for corpus processing, web management, and data analysis
There are 4 folders containing code and files from four different aspects of the corpus: Data Analysis, Data Processing, Images, and Website Code.
-
Data Analysis - Praat Scripts for exporting acoustical data, Python Scripts gathering token counts and utterance counts, R Scripts for plotting data.
-
Data Processing - Python Scripts for cleaning data, formatting doc/pdf files, redacting wav files, generating textgrid files, and forced alignment configuration.
-
Images - png and jpg files used throughout the website.
-
Website - Jave code for building the MuHSiC Website.
