Several script to merge the SNP data from different populations called with the pipeline PWGS.
Software used:
- bedtools (BEDTools, Quinlan et al. 2010)
- R (R Core Team 2015)
##merge_positions.sh
Merge all positions sequenced in all populations. it use the _multi.BED files.
##select_SNPs.sh
Create folder with the SNPs tab for each population analysed with Snape and VarScan.
##merge_SNPs.sh
merge all the SNPs files.
-
row file (p53_tab_snape)
file with all the SNPs that are in regions sequenced in all populations. First the ALT allele after the frequencies for each population. Value with NA means that there is the REF allele and is fixed. -
clean file (p53_tab_snape_clean.BED)
file in bedtools format scaffold position position REF allele ALT allele frequencies for all the populations. Criteria for selection: only biallelic SNPs, polymorphic in at least two populations, at least one population has the frequency more than 0.05. -
rfixed (p53_tab_snape_rfixed)
file with SNPs different from reference but fixed in all populations. -
rmin_freq (p53_tab_snape_rmin_freq)
file with the discharged SNPs that have frequency less than 0.05 in all populations. -
rmin_pop (p53_tab_snape_rmin_pop)
file with the discharged SNPs that are polymorphic in one population. -
rtri (p53_tab_snape_rtri)
file with the discharged triallelic SNPs.