14 | | === results === |
15 | | Here is all the data that has gone through any kind processing at UMCG |
16 | | */bam/umcg/ |
17 | | ** A4 trio complete bam files |
18 | | ** pilot chromosomes 19, 20, X, Y, MT bam files |
19 | | * /snp/hg18 |
20 | | ** Pilot cleaned up VCF files from the BGI on hg18(sorted, updated to VCF4.0) |
21 | | * /snp/hg19 |
22 | | ** Pilot initial unfiltered calls from UMCG |
23 | | ** Lifted-over files from BGI |
| 5 | === /target/gpfs2/gcc/groups/gonl/sftp/ === |
| 6 | Root of the SFTP. |
| 7 | |
| 8 | === /target/gpfs2/gcc/groups/gonl/sftp/A4 === |
| 9 | Contains all the information about the A4 test trio, including all the raw and aligned data. |
| 10 | |
| 11 | === /target/gpfs2/gcc/groups/gonl/sftp/BGI === |
| 12 | Contains all the data coming from BGI, including their variant calls. The data is organized by batch in the batchX subfolders. Each of the subfolders typically contains the following: |
| 13 | * batchX/ |
| 14 | ** A set of compressed files containing the plain text data and md5 files for downloading purpose. These are named as follows: timestamp.BGI.batchX.data_type.hg1X.data_format.tar.bz2. All plain text data should be available as a compressed file, including but not limited to: CNV, InDel, InDel annotations, SNP, SNP annotation. Some of these are available in multiple formats; see BGI data page for more explanation about the BGI data and its formats. |
| 15 | ** md5 checksum files for all files. |
| 16 | * batchX/bam OR batchX/alignment |
| 17 | ** The BAM files aligned by BGI |
| 18 | * batchX/CNV |
| 19 | ** CNVs in CNV Detector format. If you want to download for all samples, please download the compressed archive from batchX/ |
| 20 | * batchX/indel |
| 21 | ** InDels in samtools pileup format. If you want to download for all samples, please download the compressed archive from batchX/ |
| 22 | * batchX/indel_annotation |
| 23 | ** Indels annotations in GFF format. If you want to download for all samples, please download the compressed archive from batchX/ |
| 24 | * batchX/SNP |
| 25 | ** SNP in SOAPsnp format. If you want to download for all samples, please download the compressed archive from batchX/ |
| 26 | * batchX/SNP_annotation |
| 27 | ** SNP annotations in GFF format. If you want to download for all samples, please download the compressed archive from batchX/ |
| 28 | * batchX/vcf_format/CNV |
| 29 | ** CNV in VCF format. If you want to download for all samples, please download the compressed archive from batchX/ |
| 30 | * batchX/vcf_format/indel |
| 31 | ** Indel in VCF format. If you want to download for all samples, please download the compressed archive from batchX/ |
| 32 | * batchX/vcf_format/SNP |
| 33 | ** SNP in VCF format. If you want to download for all samples, please download the compressed archive from batchX/ |
| 34 | |
| 35 | NOTES: |
| 36 | * Unless specified otherwise, all data is aligned on hg19 |
| 37 | * Some of the folder/filenames are inconsistent from one batch to the other. This is because the original names as found on the BGI HD have been kept. |
| 38 | |
| 39 | === /target/gpfs2/gcc/groups/gonl/sftp/pilot === |
| 40 | Data fro the pilot, including aligned BAMs and SNPs. |
| 41 | |
| 42 | === /target/gpfs2/gcc/groups/gonl/sftp/resources === |
| 43 | GoNL resources tarball (Thanks Freerk!) |
| 44 | |
| 45 | === /target/gpfs2/gcc/groups/gonl/sftp/upload === |
| 46 | This is where everyone has write permissions. This directory should be used for data exchange. |