This repository will hold the publicly released anonymized logs and scripts for DOI 10.1145/3478432.3499123
- Specified column names in the cleaned datasets. The raw logs(
final-time.tsv,midterm-time.tsv) were left unmodified.curr_time: Elapsed time in millisecondsid: Student IDquestion: Question identifier in the formatq<category_number>_<question_number>score: Student's score on the specific question at the current timestamp
 
clean_time_final.tsvandclean_time_mid.tsvContain the cleaned time-related datasets for the final and midterm exams, respectively.
Each included student ID has only valid (non-negative)curr_timeentries and a complete set of questions.
The
.tsvfiles below contain discarded entries from the original dataset during the cleaning process:
corrupted_time_final.tsvandcorrupted_time_mid.tsvContain all log entries for any student ID with at least one negativecurr_timevalue.
These entries were excluded under the assumption that negative timestamps indicate a corrupted log.incomplete_time_final.tsvandincomplete_time_mid.tsv
Contain all log entries for student IDs who viewed fewer or more than the number of questions assigned in the exam:- 7 questions for the midterm
 - 8 questions for the final
These logs were excluded based on the assumption that a valid student log should include exactly all assigned questions for the midterm/final. 
clean_non_time_final.tsvandclean_non_time_mid.tsv
Contain the cleaned non-time-related datasets for the final and midterm exams, respectively.
Each included student ID has a complete set of questions. These files may contain entries with negativecurr_timevalues, as time was not a criterion for this cleaning.
The
.tsvfiles below contain discarded entries from the original dataset during the cleaning process:
incomplete_non_time_final.tsvandincomplete_non_time_mid.tsv
Contain all log entries for student IDs who viewed fewer or more than the number of questions assigned in the exam:- 7 questions for the midterm
 - 8 questions for the final
These logs were excluded under the assumption that a valid student log should include exactly all assigned questions for the midterm/final.