Changes between Version 10 and Version 11 of Impute2Pipeline
- Timestamp:
- Apr 25, 2013 8:06:38 PM (12 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Impute2Pipeline
v10 v11 7 7 * Pre-processing of study data 8 8 1. Liftover PED/MAP files from build 36 to build 37 9 1. Quality control of study data9 2. Quality control of study data 10 10 * Analysis 11 1. Phasing the study data12 1. Imputing the study data13 Alist of used software and versions can be found on the bottom of this page.11 3. Phasing the study data 12 4. Imputing the study data 13 Detailed description and a list of used software and versions can be found on the bottom of this page. 14 14 15 15 All analysis jobs are generated using [http://www.molgenis.org/wiki/ComputeStart MOLGENIS Compute], more information about MOLGENIS Compute and the other analysis pipelines can be found here: https://github.com/molgenis/molgenis-pipelines … … 17 17 == Paper ready summary == 18 18 19 TODO. 19 You can use the following text in your paper: 20 21 TODO. Including citations. 20 22 21 23 == Reference data == … … 23 25 * GoNL release 4, 499 unrelated individuals, 998 haplotypes 24 26 25 == Pre processing of study data==26 ===Liftover PED/MAP files from build 36 to build 37 ===27 == Detailed description == 28 === Step 1: Liftover PED/MAP files from build 36 to build 37 === 27 29 * Location: https://github.com/molgenis/molgenis-pipelines/blob/master/compute4/Liftover_genome_build_PEDMAP/protocols/liftoverPEDMAP.ftl 28 30 [[br]]This protocol applies the following steps: … … 33 35 * Create new plink data without unmapped SNPs using [http://pngu.mgh.harvard.edu/~purcell/plink/ Plink]. 34 36 [[br]] 35 === Quality control of study data ===37 === Step 2: Quality control of study data === 36 38 * Location: https://github.com/molgenis/molgenis-pipelines/blob/master/compute4/Imputation_imputationtool_studyQC/protocols/studyQC.ftl 37 39 [[br]]This protocol applies QC to the study data using [http://genenetwork.nl/wordpress/imputationtool/ imputationTool]. This tool employs a binary format, called [http://genenetwork.nl/wordpress/trityper/ TriTyper] for rapid loading of big genotypic data. ImputationTool performs the following checks: … … 43 45 [[br]] 44 46 [[br]] 45 == Analysis == 46 === Phasing the study data === 47 === Step 3: Phasing the study data === 47 48 * Location: https://github.com/molgenis/molgenis-pipelines/blob/master/compute4/Imputation_shapeit_phasing/protocols/shapeitPhasing.ftl 48 49 [[br]]This protocol phases the study data using [http://shapeit.fr/ shapeit], study data can be in the following formats: Ped/Map, Bed/Bim, Gen/Haps. … … 51 52 * Phase the study data using the map files. 52 53 [[br]] 53 === Imputing study data ===54 === Step 4: Imputing study data === 54 55 * Location: https://github.com/molgenis/molgenis-pipelines/blob/master/compute4/Imputation_impute2/protocols/impute2Imputation.ftl 55 56 [[br]]This protocol imputes the study data in chromosome bins of 5million bases using [http://mathgen.stats.ox.ac.uk/impute/impute_v2.html Impute2]. Afterwards the results per chromosome bin are merged into full chromosomes.