Changes between Version 2 and Version 3 of CoverageAnalysisPipeline
- Timestamp:
- Sep 22, 2010 12:37:44 AM (14 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
CoverageAnalysisPipeline
v2 v3 12 12 * Changes the permissions such that it is in-accessible to the group and others 13 13 14 == Create a BWA index on adatabase ==14 == Create a BWA index on database == 15 15 16 16 [[Image(bwaIndexDatabase.png, 50%)]] 17 18 Gunzip fasta file. Build BWA index. Tar-gzip the results. 17 19 18 20 == Split fastq file == … … 20 22 [[Image(splitFastq.png, 50%)]] 21 23 24 Splits a large fastq file (gzipped) into several smaller files with the unix command 'split'. The results are uploaded to the directory that is specified in 'gridOutputDir' 25 22 26 == Alignment with BWA on each split file == 23 27 24 28 [[Image(BWAparam.png, 50%)]] 29 30 Runs BWA with adjustable parameter settings. 31 * Matches sequence reads to a reference database 32 * Convert sai to sam 33 * Convert sam to bam 34 * Sort bam file 35 * Index sorted bam file 36 * Tar-gzip all results. Also the intermediate files 25 37 26 38 == Merge bam files == … … 28 40 [[Image(MergeIndexSNPcall.png, 50%)]] 29 41 42 * Downloads all bai, bam, sam and tar.gz files from the gridInputDirectory 43 * Gunzip tar the tar.gz files if they are present 44 * Gunzip the reference file (fasta format) 45 * Merge all _sorted.bam files 46 * Build index on this merged file 47 * Call SNPs and make selection. Output in pileup format. 48 * Convert pileup format to bed format 49 30 50 == SNP calling with varscan, determine coverage == 31 51 32 52 [[Image(Coverage_Varscan_BaseCoverage.png)]] 53 54 * Creates a pileup file (with samtools pileup -f) Sends the output to Varscan. Calls SNPs, indels and copy number variations. 55 * Calculates coverage per 50kbp 56 * Calculates coverage per base