z-logo
open-access-imgOpen Access
Ongoing global and regional adaptive evolution of SARS-CoV-2
Author(s) -
Nash D. Rochman,
Yuri I. Wolf,
Guilhem Faure,
Pascal Mutz,
Feng Zhang,
Eugene V. Koonin
Publication year - 2021
Publication title -
proceedings of the national academy of sciences of the united states of america
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 5.011
H-Index - 771
eISSN - 1091-6490
pISSN - 0027-8424
DOI - 10.1073/pnas.2104241118
Subject(s) - biology , phylogenetic tree , evolutionary biology , pandemic , phylogenetics , negative selection , coronavirus , genetics , genetic diversity , epistasis , genome , computational biology , covid-19 , gene , demography , population , medicine , disease , pathology , sociology , infectious disease (medical specialty)
Understanding the trends in severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) evolution is paramount to control the COVID-19 pandemic. We analyzed more than 300,000 high-quality genome sequences of SARS-CoV-2 variants available as of January 2021. The results show that the ongoing evolution of SARS-CoV-2 during the pandemic is characterized primarily by purifying selection, but a small set of sites appear to evolve under positive selection. The receptor-binding domain of the spike protein and the region of the nucleocapsid protein associated with nuclear localization signals (NLS) are enriched with positively selected amino acid replacements. These replacements form a strongly connected network of apparent epistatic interactions and are signatures of major partitions in the SARS-CoV-2 phylogeny. Virus diversity within each geographic region has been steadily growing for the entirety of the pandemic, but analysis of the phylogenetic distances between pairs of regions reveals four distinct periods based on global partitioning of the tree and the emergence of key mutations. The initial period of rapid diversification into region-specific phylogenies that ended in February 2020 was followed by a major extinction event and global homogenization concomitant with the spread of D614G in the spike protein, ending in March 2020. The NLS-associated variants across multiple partitions rose to global prominence in March to July, during a period of stasis in terms of interregional diversity. Finally, beginning in July 2020, multiple mutations, some of which have since been demonstrated to enable antibody evasion, began to emerge associated with ongoing regional diversification, which might be indicative of speciation.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here