TY - JOUR
T1 - De novo assembly and characterization of Gleditsia sinensis transcriptome and subsequent gene identification and SSR mining
AU - Han, S.
AU - Wu, Z.
AU - Wang, X.
AU - Huang, K.
AU - Jin, Y.
AU - Yang, W.
AU - Shi, H.
N1 - Funding Information:
Research partially supported by the National Natural Science Foundation of China (grant #31270316 to W. Yang and #31328004 to H. Shi) and by the Excellent Doctoral Dissertation Cultivation Grant from the Central China Normal University (grant #2013YBZD24) to S. Han.
Publisher Copyright:
© FUNPEC-RP.
PY - 2016/1/26
Y1 - 2016/1/26
N2 - Gleditsia sinensis is a Chinese native deciduous tree with a high economic and medicinal value. However, there is limited knowledge on the molecular processes responsible for the medical properties of this species owing to lack of bioinformatic resources such as available whole-genome sequences. In the present study, RNA sequencing data were used to analyze the transcriptome of G. sinensis, and a series of bioinformatic tools was used to explore the main genes involved in important molecular processes. A total of 75.57 million paired-end reads, with a length of 101 bp, were acquired from G. sinensis. Using the assembly tool Trinity, 233,751 transcripts were discovered. Among these, 85,795 were identified as unique transcripts and 59,326 unique transcripts were found to contain coding regions. Gene ontology analysis identified 27,637 unique transcripts that were clustered into 56 functional groups. Genes involved in flavonoid and terpenoid backbone biosynthesis and those encoding transcription factors were further analyzed. Sequence analysis revealed four putative G. sinensis chalcone isomerase genes (GsCHI) encoding the enzymes for flavonoid biosynthesis. GsCHI1 was found to be phylogenetically related to the chalcone isomerase of the family Leguminosae, and its transcript levels in different tissues were higher than those of GsCHI2, GsCHI3, and GsCHI4. Furthermore, 15,014 simple sequence repeat (SSR) markers were discovered in the transcript library, and 5170 primers were generated for the SSR loci. The genetic and genomic information presented in this study will be helpful for future studies on gene discovery and molecular processes in G. sinensis.
AB - Gleditsia sinensis is a Chinese native deciduous tree with a high economic and medicinal value. However, there is limited knowledge on the molecular processes responsible for the medical properties of this species owing to lack of bioinformatic resources such as available whole-genome sequences. In the present study, RNA sequencing data were used to analyze the transcriptome of G. sinensis, and a series of bioinformatic tools was used to explore the main genes involved in important molecular processes. A total of 75.57 million paired-end reads, with a length of 101 bp, were acquired from G. sinensis. Using the assembly tool Trinity, 233,751 transcripts were discovered. Among these, 85,795 were identified as unique transcripts and 59,326 unique transcripts were found to contain coding regions. Gene ontology analysis identified 27,637 unique transcripts that were clustered into 56 functional groups. Genes involved in flavonoid and terpenoid backbone biosynthesis and those encoding transcription factors were further analyzed. Sequence analysis revealed four putative G. sinensis chalcone isomerase genes (GsCHI) encoding the enzymes for flavonoid biosynthesis. GsCHI1 was found to be phylogenetically related to the chalcone isomerase of the family Leguminosae, and its transcript levels in different tissues were higher than those of GsCHI2, GsCHI3, and GsCHI4. Furthermore, 15,014 simple sequence repeat (SSR) markers were discovered in the transcript library, and 5170 primers were generated for the SSR loci. The genetic and genomic information presented in this study will be helpful for future studies on gene discovery and molecular processes in G. sinensis.
KW - Chalcone isomerase
KW - Gene identification
KW - Gleditsia sinensis
KW - SSR mining
KW - Transcriptome assembly
KW - Unique transcripts
UR - http://www.scopus.com/inward/record.url?scp=84961720132&partnerID=8YFLogxK
U2 - 10.4238/gmr.15017740
DO - 10.4238/gmr.15017740
M3 - Article
C2 - 26909943
AN - SCOPUS:84961720132
SN - 1676-5680
VL - 15
JO - Genetics and Molecular Research
JF - Genetics and Molecular Research
IS - 1
M1 - 15017740
ER -