Contains script to retrieve Python Files from /git_dl
Files containing extracted data
Our corpus
The main code:
ExtractCode.py - CodeUnit / Code analysis ExtractSequences.py - Generate sequences from CodeUnits LM.py - The Language Model ProcessData.py - Gets info for LM TestModel.py - The actual test TypeUtils.py - misc. WriteData.py - Processes CodeUnits results.csv - THE LL's
Store zip files here
DL .zip files to git_dl CorpusPopulate/GetJavaFiles.py
mode = {levels, cfs} WriteData.py mode ProcessData.py mode
TestModel.py