poetry install
- python>=3.11
- poetry>=1.8.3
- Clone the repository:
git clone https://github.com/BioTechCo/SSG.git
- Navigate to the project directory:
cd SSG - Install dependencies:
poetry install
geo_champ.ipynb: This notebook fetches data from GEO database via FTP server andGEOparse. The critical phenotypes are identified and a sample sheet suitable for ChAMP analysis is generated.
Warning
The one phenotype differentiating tumor and normal samples is identified by selecting the phenotype with exactly two unique values. Users might need to manually select the phenotype of interest if more than one phenotype has two unique values.
-
champ_analysis.ipynb: This notebook fetch Data tables by specific GEO accession. Users are encouraged to visit the GEO database and look up whether the data is normalized, which is often presented inData table header descriptionsof every sample. -
geo_phenotype.ipynb: This notebook fetch phenotypes form GEO database and export them as a csv file. Excluded phenotypes are those names containingsupplementary_file,data_processing,extract_protocol_ch1,hyb_protocol,contact.
- Generate sample sheet for ChAMP analysis by specific GEO accession.
- Fetch data tables by specific GEO accession.
- Fetch phenotypes by specific GEO accession.