Dataset and Code for "Predicting CircRNA-disease Associations through Linear Neighborhood Label Propagation Method"
Dataset1/association.csvis the circRNA-disease association matrix ofDataset1, which contains 331 associations between 312 circRNAs and 40 diseases.Dataset1/all_circRNAs.csvcontains all the circRNAs, corresponding to the rows of the association matrix.Dataset1/all_diseases.csvcontains all the diseases, corresponding to the columns of the association matrix.
Dataset2/association.csvis the circRNA-disease association matrix ofDataset2, which contains 650 associations between 603 circRNAs and 88 diseases.Dataset2/all_circRNAs.csvcontains all the circRNAs, corresponding to the rows of the association matrix.Dataset2/all_diseases.csvcontains all the diseases, corresponding to the columns of the association matrix.
-
case_study.pycalculates score matrices of case studies onDataset1andDataset2respectively. -
LNLP_method.pycontains our method function, that islinear_neighbor_predict. -
LNLP_evaluation.pyimplements LOOCV of CD-LNLP onDataset1.
-
case_study_scoresDataset1_scores.csvis the score matrix of case study onDataset1.Dataset2_scores.csvis the score matrix of case study onDataset2.
-
Dataset1_result/diseaseFor every disease in
Dataset1, the candidate circRNAs are in the text file named as the disease's name inDataset1_result/diseasefolder in descending order of score. -
Dataset2_result/diseaseFor every disease in
Dataset2, the candidate circRNAs are in the text file named as the disease's name inDataset2_result/diseasefolder in descending order of score. -
evaluation_result/loocvevaluation_result/loocvcontains our method's evaluation result on LOOCV.0.1_0.9_1.0_loo.csvcontains the values of 6 metrics.0.1_0.9_1.0_loo_pr_x.csvcontains the values of recall on different thresholds.0.1_0.9_1.0_loo_pr_y.csvcontains the values of precision on different thresholds.0.1_0.9_1.0_loo_roc_x.csvcontains the values of False Positive Rate on different thresholds.0.1_0.9_1.0_loo_roc_y.csvcontains the values of True Positive Rate on different thresholds.