Difference between revisions of "Genotyping Database"

From Poland Lab Wiki
Jump to: navigation, search

Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/customer/www/wiki.wheatgenetics.k-state.edu/public_html/includes/diff/DairikiDiff.php on line 434

Deprecated: assert(): Calling assert() with a string argument is deprecated in /home/customer/www/wiki.wheatgenetics.k-state.edu/public_html/includes/diff/DairikiDiff.php on line 437
(Example Queries)
(Key Query)
 
Line 26: Line 26:
 
=== Key Query ===
 
=== Key Query ===
 
The query below shows an example of how to obtain the information used to construct a key file that is required as input to TASSEL 5 GBSv2 pipeline.
 
The query below shows an example of how to obtain the information used to construct a key file that is required as input to TASSEL 5 GBSv2 pipeline.
SELECT  
+
SELECT  
 
   gbs.flowcell as 'Flowcell',  
 
   gbs.flowcell as 'Flowcell',  
 
   gbs.lane as 'Lane',  
 
   gbs.lane as 'Lane',  
Line 50: Line 50:
 
   dna.sample_id,
 
   dna.sample_id,
 
   dna.tissue_id
 
   dna.tissue_id
FROM dna LEFT JOIN gbs ON gbs.dna_id = dna.plate_id
+
FROM dna LEFT JOIN gbs ON gbs.dna_id = dna.plate_id
 
   LEFT JOIN plant ON dna.tissue_id = plant.plant_id
 
   LEFT JOIN plant ON dna.tissue_id = plant.plant_id
 
   INNER JOIN barcodes ON dna.well_A01 = barcodes.well_A01 AND gbs.plexing LIKE barcodes.`set`  
 
   INNER JOIN barcodes ON dna.well_A01 = barcodes.well_A01 AND gbs.plexing LIKE barcodes.`set`  
WHERE gbs.project LIKE "%MyProject%"   
+
WHERE gbs.project LIKE "%MyProject%"   
ORDER BY gbs.gbs_id, dna.well_01A ASC
+
ORDER BY gbs.gbs_id, dna.well_01A ASC

Latest revision as of 19:08, 27 May 2016

The genotyping database stores information related to DNA libraries. The core set of tables related to genotyping are:

  • The gbs table stores information about gbs libraries, DNA sequencing orders and DNA sequence files.
  • The dna table stores information about DNA plates and the samples contained in each well on a plate.
  • The dnaQuant table stores DNA quantification and fluorescence values for each sample.
  • The barcodes table stores information about the barcode sequences used for each plexing scheme for each well on a plate.

In addition to the core tables, the following tables provide details on the tissue and plant DNA sources (if available):

  • The tissue table stores information about the tissue used to prepare a DNA sample.
  • The plant table stores information about the plant that the tissue was collected from.

The figure below illustrates the table definitions and the key relationships used to formulate queries related to genomic analysis. (PK indicates the primary key for each table.)


Gbs.png


Example Queries

Key Query

The query below shows an example of how to obtain the information used to construct a key file that is required as input to TASSEL 5 GBSv2 pipeline.

SELECT 
  gbs.flowcell as 'Flowcell', 
  gbs.lane as 'Lane', 
  barcodes.barcode as 'Barcode', 
  dna.sample_name as 'DNASample',
  dna.plate_id as 'LibraryPlate', 
  substring(dna.well_A01,1,1) as 'Row',
  substring(dna.well_A01,2,2) as 'Col',
  concat(gbs.gbs_id,barcodes.barcode) as 'LibraryPrepID',
  barcodes.well_A01 as 'BarcodeWell',
  gbs.enzyme as 'Enzyme',
  gbs.gbs_id as 'LibraryPlateID',
  dna.plate_id as 'DNA_Plate',
  dna.well_A01 as 'SampleDNA_Well',
  'triticum' as 'Genus',
  'aestivum' as 'Species', 
  'NA' as 'Pedigree', 
  'NA' as 'Population',
  'NA' as 'SeedLot', 
  dna.sample_name as 'FullSampleName',
  gbs.plexing, 
  gbs.project, 
  dna.sample_id,
  dna.tissue_id
FROM dna LEFT JOIN gbs ON gbs.dna_id = dna.plate_id
 LEFT JOIN plant ON dna.tissue_id = plant.plant_id
 INNER JOIN barcodes ON dna.well_A01 = barcodes.well_A01 AND gbs.plexing LIKE barcodes.`set` 
WHERE gbs.project LIKE "%MyProject%"  
ORDER BY gbs.gbs_id, dna.well_01A ASC