Premium
Full‐length nucleotide sequence of ERMAP alleles encoding Scianna (SC) antigens
Author(s) -
Srivastava Kshitij,
Lee Eunah,
Owens Eric,
Rujirojindakul Pairaya,
Flegel Willy A.
Publication year - 2016
Publication title -
transfusion
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.045
H-Index - 132
eISSN - 1537-2995
pISSN - 0041-1132
DOI - 10.1111/trf.13801
Subject(s) - genetics , sequence (biology) , allele , nucleotide , biology , antigen , nucleic acid sequence , microbiology and biotechnology , gene
BACKGROUND Scianna (SC) blood group system comprises two antithetical antigens, Sc1 and Sc2, and five additional antigens. The antigens reside on a glycoprotein encoded by the erythroblast membrane–associated protein ( ERMAP ) gene. For the common ERMAP alleles, we determined the full‐length nucleotide sequence that encodes the Scianna glycoprotein. STUDY DESIGN AND METHODS Blood donor samples from five populations were analyzed including 20 African Americans, 10 Caucasians, 10 Thai, five Asians, and five Hispanics for a total of 100 chromosomes. An assay was devised to determine the genomic sequence of the ERMAP gene in one amplicon, spanning 21.4 kb and covering Exons 2 to 12 and the intervening sequence (IVS). All alleles (confirmed haplotypes) were resolved without ambiguity. RESULTS Among 50 blood donors, we found 80 single‐nucleotide polymorphisms (SNPs), including six novel SNPs, in 21,308 nucleotides covering the coding sequence of the ERMAP gene and including the introns. The noncoding sequences harbored 75 SNPs (68 in the introns and seven in the 3′‐UTR). No SNP indicative of a nonfunctional allele was detected. The nucleotide sequences for 48 ERMAP alleles (confirmed haplotypes) were determined by allele‐specific polymerase chain reaction and sequencing in 100 chromosomes. CONCLUSIONS We documented 48 ERMAP alleles of 21,308 nucleotides each. The two nucleotide sequences available in GenBank for ERMAP alleles of similar length have not been found in our 100 chromosomes. Alleles determined without ambiguity can be used as templates to analyze next generation sequencing data, which will enhance the reliability in clinical diagnostics.