Analysis of antisense expression by whole genome tiling microarrays and siRNAs suggests mis-annotation of arabidopsis orphan protein-coding genes

Casey R. Richardson, Qing Jun Luo, Viktoria Gontcharova, Ying Wen Jiang, Manoj Samanta, Eunseog Youn, Christopher D. Rock

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

Background: MicroRNAs (miRNAs) and trans-acting small-interfering RNAs (tasi-RNAs) are small (20-22 nt long) RNAs (smRNAs) generated from hairpin secondary structures or antisense transcripts, respectively, that regulate gene expression by Watson-Crick pairing to a target mRNA and altering expression by mechanisms related to RNA interference. The high sequence homology of plant miRNAs to their targets has been the mainstay of miRNA prediction algorithms, which are limited in their predictive power for other kingdoms because miRNA complementarity is less conserved yet transitive processes (production of antisense smRNAs) are active in eukaryotes. We hypothesize that antisense transcription and associated smRNAs are biomarkers which can be computationally modeled for gene discovery. Principal Findings: We explored rice (Oryza sativa) sense and antisense gene expression in publicly available whole genome tiling array transcriptome data and sequenced smRNA libraries (as well as C. elegans) and found evidence of transitivity of MIRNA genes similar to that found in Arabidopsis. Statistical analysis of antisense transcript abundances, presence of antisense ESTs, and association with smRNAs suggests several hundred Arabidopsis 'orphan' hypothetical genes are noncoding RNAs. Consistent with this hypothesis, we found novel Arabidopsis homologues of some MIRNA genes on the antisense strand of previously annotated protein-coding genes. A Support Vector Machine (SVM) was applied using thermodynamic energy of binding plus novel expression features of sense/antisense transcription topology and siRNA abundances to build a prediction model of miRNA targets. The SVM when trained on targets could predict the "ancient" (deeply conserved) class of validated Arabidopsis MIRNA genes with an accuracy of 84%, and 76% for "new" rapidlyevolving MIRNA genes. Conclusions: Antisense and smRNA expression features and computational methods may identify novel MIRNA genes and other non-coding RNAs in plants and potentially other kingdoms, which can provide insight into antisense transcription, miRNA evolution, and post-transcriptional gene regulation.

Original languageEnglish
Article numbere10710
JournalPloS one
Volume5
Issue number5
DOIs
StatePublished - 2010

Fingerprint Dive into the research topics of 'Analysis of antisense expression by whole genome tiling microarrays and siRNAs suggests mis-annotation of arabidopsis orphan protein-coding genes'. Together they form a unique fingerprint.

Cite this