z-logo
Premium
De novo genome assembly of the stress tolerant forest species Casuarina equisetifolia provides insight into secondary growth
Author(s) -
Ye Gongfu,
Zhang Hangxiao,
Chen Bihua,
Nie Sen,
Liu Hai,
Gao Wei,
Wang Huiyuan,
Gao Yubang,
Gu Lianfeng
Publication year - 2019
Publication title -
the plant journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.058
H-Index - 269
eISSN - 1365-313X
pISSN - 0960-7412
DOI - 10.1111/tpj.14159
Subject(s) - biology , sequence assembly , genome , gene , casuarina equisetifolia , whole genome sequencing , genome size , genetics , illumina dye sequencing , reference genome , gc content , dna sequencing , single molecule real time sequencing , computational biology , transcriptome , botany , gene expression , dna sequencer
Summary Casuarina equisetifolia ( C. equisetifolia ), a conifer‐like angiosperm with resistance to typhoon and stress tolerance, is mainly cultivated in the coastal areas of Australasia. C. equisetifolia , making it a valuable model to study secondary growth associated genes and stress‐tolerance traits. However, the genome sequence is unavailable and therefore wood‐associated growth rate and stress resistance at the molecular level is largely unexplored. We therefore constructed a high‐quality draft genome sequence of C. equisetifolia by a combination of Illumina second‐generation sequencing reads and Pacific Biosciences single‐molecule real‐time (SMRT) long reads to advance the investigation of this species. Here, we report the genome assembly, which contains approximately 300 megabases (Mb) and scaffold size of N50 is 1.06 Mb. Additionally, gene annotation, assisted by a combination of prediction and RNA‐seq data, generated 29 827 annotated protein‐coding genes and 1983 non‐coding genes, respectively. Furthermore, we found that the total number of repetitive sequences account for one‐third of the genome assembly. Here we also construct the genome‐wide map of DNA modification, such as two novel forms N 6 ‐adenine (6mA) and N4‐methylcytosine (4mC) at the level of single‐nucleotide resolution using single‐molecule real‐time (SMRT) sequencing. Interestingly, we found that 17% of 6mA modification genes and 15% of 4mC modification genes also included alternative splicing events. Finally, we investigated cellulose, hemicellulose, and lignin‐related genes, which were associated with secondary growth and contained different DNA modifications. The high‐quality genome sequence and annotation of C. equisetifolia in this study provide a valuable resource to strengthen our understanding of the diverse traits of trees.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here