Immunogenetics
Benjamin D. Solomon, MD, PhD
Pediatric Resident Physician
Stanford University
Palo Alto, California, United States
Hong Zheng, PhD
Research engineer
Stanford University
Palo Alto, California, United States
Laura W. Dillon, PhD
National Institutes of Health
Bethesda, Maryland, United States
Jason D. Goldman, MD, MPH
Physician
Swedish Foundation
Seattle, Washington, United States
Christopher S. Hourigan, MD, DPhil
Principal Investigator
National Institutes of Health
Bethesda, Maryland, United States
James Heath, PhD
President and Professor
Institute for Systems Biology
Seattle, Washington, United States
Purvesh Khatri, PhD
Associate Professor (Research), Medicine
Stanford University
Stanford, California, United States
The human leukocyte antigen (HLA) locus plays a central role in adaptive immune function and has significant clinical implications for tissue transplant compatibility and allelic disease associations. Studies using bulk-cell RNA sequencing have demonstrated that HLA transcription may be regulated in an allele-specific manner and single-cell RNA sequencing (scRNA-seq) has the potential to better characterize these expression patterns. However, quantification of allele-specific expression (ASE) for HLA loci requires sample-specific reference genotyping due to extensive polymorphism. While genotype prediction from bulk RNA sequencing is well described, the feasibility of predicting HLA genotypes directly from single-cell data is unknown. Here we evaluate and expand upon several computational HLA genotyping tools by comparing predictions from human single-cell data to gold-standard, molecular genotyping. The highest 2-field accuracy averaged across all loci was 76% by arcasHLA and increased to 86% using a composite model of multiple genotyping tools. We also developed a highly accurate model (AUC 0.93) for predicting HLA-DRB345 copy number in order to improve genotyping accuracy of the HLA-DRB locus. Genotyping accuracy improved with read depth and was reproducible at repeat sampling. Using a metanalytic approach, we also show that HLA genotypes from PHLAT and OptiType can generate ASE ratios that are highly correlated (R2 = 0.8 and 0.94, respectively) with those derived from gold-standard genotyping.