Difference between revisions of "GBS Data Management"
From Poland Lab Wiki
Altschuler (Talk | contribs) |
|||
(2 intermediate revisions by 2 users not shown) | |||
Line 20: | Line 20: | ||
[[Media:GBS_Check-In_Hudson_Alpha_Automated.pdf|GBS Data Management for Hudson Alpha Facility]] | [[Media:GBS_Check-In_Hudson_Alpha_Automated.pdf|GBS Data Management for Hudson Alpha Facility]] | ||
+ | |||
+ | [[Media:GBS_Check-In_Novogene_Automated.pdf|GBS Data Management for Novogene Facility]] |
Latest revision as of 18:48, 12 September 2019
GBS Sequence Data Management consists of the following tasks:
- Download of GBS sequence data files from sequencing facilities.
- Verify the integrity of downloaded GBS files.
- Create or rename GBS sequence files to conform to the standard GBS file naming format.
- Filtering of GBS sequence file to remove short reads of less than 75bp (Nextseq files only).
- Update wheatgenetics gbs table flowcell, lane, num_lines and md5sum columns for each file.
- Checking % valid reads in each GBS file and % reads found in each blank well.
- Checking DNA quantification values for blank wells relative to other wells.
- Storage of GBS files on Beocat in /bulk/jpoland/sequence GBS sequence file repository
- Backup of GBS files to external NAS.
The exact method for executing these tasks is dependent on the sequencing facility that produced the data.
The data management procedure for facilities that are currently used by the Poland Lab are document below.
GBS Data Management for KSU Genomics Facility
GBS Data Management for Genome Quebec NGS Facility