GitHub

TCR data analysis and modeling

In Data_process, “CDR3 and peptide data preprocess.ipynb” is used to process raw data at the first place. “ERGOdata_process.ipynb” is used to generate train+validation and test data for ERGO methods. “repetitive data process_ae.ipynb” and “repetitive data process_lstm” are used to remove repetitive data among train, validation and test datasets.
ERGO-result contains summaries about data sizes used in ERGO and deepTCR methods and results from them. "ERGO_result_process.ipynb" is used to summary and generate plots based on results.
For testing ERGO methods, I randomly chose two data sets with different amounts(110,000 and 210,000) from the entire data. “ERGO-master-test10” contains codes and results for the small dataset(110,000), and “ERGO-master-test20” contains codes and results for the large dataset(210,000). In both files, "ERGO.py" is used to split train and validation data, then create negative and positive examples for them. I use "main" function in "copy_ERGO.py" to train ERGO with train and validation data. "pep_test" function in "new_ERGO.py" is used to test ERGO with the test data, and "pep_test" function in "copy_ERGO.py" is used to test ERGO based on each single peptide in the test data.
In deepTCR, “ERGO_compare_DeepTCR.ipynb” is used to generate suitable data format and produce results for deepTCR method.
In baseline, “data_preprocess.ipynb” is used to prepare data for the baseline method. “editing disatance.ipynb” is used to acquire results from baseline analyses using edit distance method.

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
Data_process		Data_process
ERGO-result		ERGO-result
ERGO		ERGO
archive		archive
baseline		baseline
deepTCR		deepTCR
utils		utils
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TCR data analysis and modeling

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

ShenLab/TCR

Folders and files

Latest commit

History

Repository files navigation

TCR data analysis and modeling

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages