Premium
De novo assembly and SSR loci analysis in Gasterophilus nasalis (Diptera: Oestridae)
Author(s) -
Zhang Tiange,
Zhang Ke,
Zhou Tong,
Zhou Ran,
Ge Yan,
Wang Zhenbiao,
Shao Huimin,
Zhang Dong,
Li Kai
Publication year - 2021
Publication title -
entomological research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.421
H-Index - 20
eISSN - 1748-5967
pISSN - 1738-2297
DOI - 10.1111/1748-5967.12505
Subject(s) - biology , sequence assembly , genetics , obligate , lucilia cuprina , gene , contig , kegg , homology (biology) , computational biology , genome , transcriptome , larva , botany , calliphoridae , gene expression
Gasterophilus nasalis , an obligate parasite of equids, is distributed worldwide and causes severe damage when abundant in the host's digestive tract. However, detailed genomic information is lacking for this pathogen. In this study, we generated 27.47 Gb of clean data on G. nasalis using the Illumina Hiseq™ 4000 sequencing platform. De novo analysis of sequencing reads revealed 37,289 unigenes obtained from G. nasalis , with an average length of 1,335 bp and N50 length of 1,808 bp. All unigenes were searched against the non‐redundant (NR) databases using BlastX and were primarily annotated to the NCBI nr database (13 396, 35.92%), exhibiting highest homology with sequences from Lucilia cuprina . Further, a total of 9255 unigenes were annotated to three functional categories and 54 subclasses under the Gene Ontology (GO) database. Meanwhile, 6817 unigenes were assigned to 43 biochemical pathways under KEGG database, of which the signal transduction pathway was the most enriched. Moreover, 28 250 simple sequence repeats were found to be located in 16 538 unigenes, and A/T was the dominant repeat in 275 repeated motif types. The results of this study will lay a theoretical foundation for the development of biological control strategies for G. nasalis .