| | 1 | = RNASeq protocols= |
| | 2 | |
| | 3 | In a nutshell, all raw RNAseq datasets on Grid will look like the following: |
| | 4 | |
| | 5 | * sample AD1NNNACXX-1-2 |
| | 6 | * srm://srm.grid.sara.nl/pnfs/grid.sara.nl/data/bbmri.nl/RP3/RNASeq/AD1NNNACXX-1-2/AD1NNNACXX-1-2_R1.fg.gz |
| | 7 | * srm://srm.grid.sara.nl/pnfs/grid.sara.nl/data/bbmri.nl/RP3/RNASeq/AD1NNNACXX-1-2/AD1NNNACXX-1-2_R2.fg.gz |
| | 8 | |
| | 9 | * sample AD1NNNACXX-1-4 |
| | 10 | * srm://srm.grid.sara.nl/pnfs/grid.sara.nl/data/bbmri.nl/RP3/RNASeq/AD1NNNACXX-1-4/AD1NNNACXX-1-4_R1.fg.gz |
| | 11 | * srm://srm.grid.sara.nl/pnfs/grid.sara.nl/data/bbmri.nl/RP3/RNASeq/AD1NNNACXX-1-4/AD1NNNACXX-1-4_R2.fg.gz |
| | 12 | |
| | 13 | The locations of sample information, file location, md5sum, etc are stored in the [wiki:BIOS_Metadatabase Metadatabase]. |
| | 14 | |
| | 15 | == SRM location == |
| | 16 | All RNA sample datasets are stored at srm://srm.grid.sara.nl/pnfs/grid.sara.nl/data/bbmri.nl/RP3/RNASeq/. |
| | 17 | |
| | 18 | Note: all data are also copied to the Grid archive and where they are backuped. |
| | 19 | |
| | 20 | == Directory naming == |
| | 21 | Each sample is to be uploaded into its own, separate directory. The directory name is formatted as follows: |
| | 22 | {{{ |
| | 23 | <flowcell ID>-<lane ID>-<index ID>/ |
| | 24 | }}} |
| | 25 | For instance: |
| | 26 | {{{ |
| | 27 | AD1NNNACXX-1-2/ |
| | 28 | }}} |
| | 29 | |
| | 30 | == File naming == |
| | 31 | Files should follow the same naming scheme as the directories: |
| | 32 | |
| | 33 | === GZipped .fastq's === |
| | 34 | Fastq files, required for alignment, are to be formatted as follows: |
| | 35 | {{{ |
| | 36 | <flowcell ID>-<lane ID>-<index ID>_R1.fq.gz |
| | 37 | <flowcell ID>-<lane ID>-<index ID>_R2.fq.gz |
| | 38 | }}} |