z-logo
open-access-imgOpen Access
Multi-Omics Driven Assembly and Annotation of the Sandalwood (Santalum album) Genome
Author(s) -
H. B. Mahesh,
Pratigya Subba,
Jayshree Advani,
Meghana Deepak Shirke,
Ramya Malarini Loganathan,
Shankara Chandana,
S Shilpa,
Oishi Chatterjee,
Sneha M. Pinto,
Thottethodi Subrahmanya Keshava Prasad,
Malali Gowda
Publication year - 2018
Publication title -
plant physiology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.554
H-Index - 312
eISSN - 1532-2548
pISSN - 0032-0889
DOI - 10.1104/pp.17.01764
Subject(s) - biology , genome , proteogenomics , santalum album , proteome , gene , transcriptome , computational biology , genomics , sandalwood , genome project , sequence assembly , proteomics , genetics , dna sequencing , gene expression , botany , history , archaeology
Indian sandalwood ( Santalum album ) is an important tropical evergreen tree known for its fragrant heartwood-derived essential oil and its valuable carving wood. Here, we applied an integrated genomic, transcriptomic, and proteomic approach to assemble and annotate the Indian sandalwood genome. Our genome sequencing resulted in the establishment of a draft map of the smallest genome for any woody tree species to date (221 Mb). The genome annotation predicted 38,119 protein-coding genes and 27.42% repetitive DNA elements. In-depth proteome analysis revealed the identities of 72,325 unique peptides, which confirmed 10,076 of the predicted genes. The addition of transcriptomic and proteogenomic approaches resulted in the identification of 53 novel proteins and 34 gene-correction events that were missed by genomic approaches. Proteogenomic analysis also helped in reassigning 1,348 potential noncoding RNAs as bona fide protein-coding messenger RNAs. Gene expression patterns at the RNA and protein levels indicated that peptide sequencing was useful in capturing proteins encoded by nuclear and organellar genomes alike. Mass spectrometry-based proteomic evidence provided an unbiased approach toward the identification of proteins encoded by organellar genomes. Such proteins are often missed in transcriptome data sets due to the enrichment of only messenger RNAs that contain poly(A) tails. Overall, the use of integrated omic approaches enhanced the quality of the assembly and annotation of this nonmodel plant genome. The availability of genomic, transcriptomic, and proteomic data will enhance genomics-assisted breeding, germplasm characterization, and conservation of sandalwood trees.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom