Deep sampling of the Palomero maize transcriptome by a high throughput strategy of pyrosequencingReport as inadecuate




Deep sampling of the Palomero maize transcriptome by a high throughput strategy of pyrosequencing - Download this document for free, or read online. Document in PDF available to download.

BMC Genomics

, 10:299

First Online: 06 July 2009Received: 02 December 2008Accepted: 06 July 2009

Abstract

BackgroundIn-depth sequencing analysis has not been able to determine the overall complexity of transcriptional activity of a plant organ or tissue sample. In some cases, deep parallel sequencing of Expressed Sequence Tags ESTs, although not yet optimized for the sequencing of cDNAs, has represented an efficient procedure for validating gene prediction and estimating overall gene coverage. This approach could be very valuable for complex plant genomes. In addition, little emphasis has been given to efforts aiming at an estimation of the overall transcriptional universe found in a multicellular organism at a specific developmental stage.

ResultsTo explore, in depth, the transcriptional diversity in an ancient maize landrace, we developed a protocol to optimize the sequencing of cDNAs and performed 4 consecutive GS20–454 pyrosequencing runs of a cDNA library obtained from 2 week-old Palomero Toluqueño maize plants. The protocol reported here allowed obtaining over 90% of informative sequences. These GS20–454 runs generated over 1.5 Million reads, representing the largest amount of sequences reported from a single plant cDNA library. A collection of 367,391 quality-filtered reads 30.09 Mb from a single run was sufficient to identify transcripts corresponding to 34% of public maize ESTs databases; total sequences generated after 4 filtered runs increased this coverage to 50%. Comparisons of all 1.5 Million reads to the Maize Assembled Genomic Islands MAGIs provided evidence for the transcriptional activity of 11% of MAGIs. We estimate that 5.67% 86,069 sequences do not align with public ESTs or annotated genes, potentially representing new maize transcripts. Following the assembly of 74.4% of the reads in 65,493 contigs, real-time PCR of selected genes confirmed a predicted correlation between the abundance of GS20–454 sequences and corresponding levels of gene expression.

ConclusionA protocol was developed that significantly increases the number, length and quality of cDNA reads using massive 454 parallel sequencing. We show that recurrent 454 pyrosequencing of a single cDNA sample is necessary to attain a thorough representation of the transcriptional universe present in maize, that can also be used to estimate transcript abundance of specific genes. This data suggests that the molecular and functional diversity contained in the vast native landraces remains to be explored, and that large-scale transcriptional sequencing of a presumed ancestor of the modern maize varieties represents a valuable approach to characterize the functional diversity of maize for future agricultural and evolutionary studies.

Electronic supplementary materialThe online version of this article doi:10.1186-1471-2164-10-299 contains supplementary material, which is available to authorized users.

Download fulltext PDF



Author: Julio C Vega-Arreguín - Enrique Ibarra-Laclette - Beatriz Jiménez-Moraila - Octavio Martínez - Jean Philippe Vielle-Calz

Source: https://link.springer.com/article/10.1186/1471-2164-10-299



DOWNLOAD PDF




Related documents