Heng Li, Ph.D.

Heng Li, PhD

Associate Professor of Biomedical Informatics, Dana-Farber Cancer Institute and Harvard Medical School

Heng Li studies advanced computational algorithms to solve practical biological problems, currently with a focus on sequence alignment, variant calling, de novo assembly, data storage, and information query. He developed and maintains several widely used software packages, such as BWA, samtools, minimap2, and seqtk, for analyzing high-throughput sequencing data. He has also collaborated with multiple research groups and published work on the analysis of single-cell sequence data, chromosome conformation, cancer genomics, population genetics and species evolution.


DBMI Research Areas
Genome sequence of a 45,000-year-old modern human from western Siberia.
Authors: Fu Q, Li H, Moorjani P, Jay F, Slepchenko SM, Bondarev AA, Johnson PL, Aximu-Petri A, Prüfer K, de Filippo C, Meyer M, Zwyns N, Salazar-García DC, Kuzmin YV, Keates SG, Kosintsev PA, Razhev DI, Richards MP, Peristov NV, Lachmann M, Douka K, Higham TF, Slatkin M, Hublin JJ, Reich D, Kelso J, Viola TB, Pääbo S.
Nature
View full abstract on Pubmed
Toward better understanding of artifacts in variant calling from high-coverage samples.
Authors: Li H.
Bioinformatics
View full abstract on Pubmed
Ancient human genomes suggest three ancestral populations for present-day Europeans.
Authors: Lazaridis I, Patterson N, Mittnik A, Renaud G, Mallick S, Kirsanow K, Sudmant PH, Schraiber JG, Castellano S, Lipson M, Berger B, Economou C, Bollongino R, Fu Q, Bos KI, Nordenfelt S, Li H, de Filippo C, Prüfer K, Sawyer S, Posth C, Haak W, Hallgren F, Fornander E, Rohland N, Delsate D, Francken M, Guinet JM, Wahl J, Ayodo G, Babiker HA, Bailliet G, Balanovska E, Balanovsky O, Barrantes R, Bedoya G, Ben-Ami H, Bene J, Berrada F, Bravi CM, Brisighelli F, Busby GB, Cali F, Churnosov M, Cole DE, Corach D, Damba L, van Driem G, Dryomov S, Dugoujon JM, Fedorova SA, Gallego Romero I, Gubina M, Hammer M, Henn BM, Hervig T, Hodoglugil U, Jha AR, Karachanak-Yankova S, Khusainova R, Khusnutdinova E, Kittles R, Kivisild T, Klitz W, Kucinskas V, Kushniarevich A, Laredj L, Litvinov S, Loukidis T, Mahley RW, Melegh B, Metspalu E, Molina J, Mountain J, Näkkäläjärvi K, Nesheva D, Nyambo T, Osipova L, Parik J, Platonov F, Posukh O, Romano V, Rothhammer F, Rudan I, Ruizbakiev R, Sahakyan H, Sajantila A, Salas A, Starikovskaya EB, Tarekegn A, Toncheva D, Turdikulova S, Uktveryte I, Utevska O, Vasquez R, Villena M, Voevoda M, Winkler CA, Yepiskoposyan L, Zalloua P, Zemunik T, Cooper A, Capelli C, Thomas MG, Ruiz-Linares A, Tishkoff SA, Singh L, Thangaraj K, Villems R, Comas D, Sukernik R, Metspalu M, Meyer M, Eichler EE, Burger J, Slatkin M, Pääbo S, Kelso J, Reich D, Krause J.
Nature
View full abstract on Pubmed
The complete genome sequence of a Neanderthal from the Altai Mountains.
Authors: Prüfer K, Racimo F, Patterson N, Jay F, Sankararaman S, Sawyer S, Heinze A, Renaud G, Sudmant PH, de Filippo C, Li H, Mallick S, Dannemann M, Fu Q, Kircher M, Kuhlwilm M, Lachmann M, Meyer M, Ongyerth M, Siebauer M, Theunert C, Tandon A, Moorjani P, Pickrell J, Mullikin JC, Vohr SH, Green RE, Hellmann I, Johnson PL, Blanche H, Cann H, Kitzman JO, Shendure J, Eichler EE, Lein ES, Bakken TE, Golovanova LV, Doronichev VB, Shunkov MV, Derevianko AP, Viola B, Slatkin M, Reich D, Kelso J, Pääbo S.
Nature
View full abstract on Pubmed
Pseudo-Sanger sequencing: massively parallel production of long and near error-free reads using NGS technology.
Authors: Ruan J, Jiang L, Chong Z, Gong Q, Li H, Li C, Tao Y, Zheng C, Zhai W, Turissini D, Cannon CH, Lu X, Wu CI.
BMC Genomics
View full abstract on Pubmed
The anatomy of successful computational biology software.
Authors: Altschul S, Demchak B, Durbin R, Gentleman R, Krzywinski M, Li H, Nekrutenko A, Robinson J, Rasband W, Taylor J, Trapnell C.
Nat Biotechnol
View full abstract on Pubmed
Mapping the human reference genome's missing sequence by three-way admixture in Latino genomes.
Authors: Genovese G, Handsaker RE, Li H, Kenny EE, McCarroll SA.
Am J Hum Genet
View full abstract on Pubmed
Using population admixture to help complete maps of the human genome.
Authors: Genovese G, Handsaker RE, Li H, Altemose N, Lindgren AM, Chambert K, Pasaniuc B, Price AL, Reich D, Morton CC, Pollak MR, Wilson JG, McCarroll SA.
Nat Genet
View full abstract on Pubmed
SOAPindel: efficient identification of indels from short paired reads.
Authors: Li S, Li R, Li H, Lu J, Li Y, Bolund L, Schierup MH, Wang J.
Genome Res
View full abstract on Pubmed
An integrated map of genetic variation from 1,092 human genomes.
Authors: Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE, Kang HM, Marth GT, McVean GA.
Nature
View full abstract on Pubmed