Tianx Cai
Tianxi Cai, ScD
Professor of Biomedical Informatics, Harvard Medical School
John Rock Professor of Population and Translational Data Sciences, Harvard T.H. Chan School of Public Health
Director, Translational Data Science Center for a Learning Health System (CELEHS)
10 Shattuck Street, Room 434, Boston, MA 02115

Tianxi Cai is a major player in developing analytical tools for mining EHR data and predictive modeling with biomedical data. She provides statistical leadership on several large-scale projects, including the NIH-funded Undiagnosed Diseases Network at DBMI. Cai's research lab develops novel statistical and machine learning methods for several areas including clinical trials, real world evidence, and personalized medicine using genomic and phenomic data. Cai received her ScD in Biostatistics at Harvard and was an assistant professor at the University of Washington before returning to Harvard as a faculty member in 2002.

DBMI Research Areas
Using a surrogate marker for early testing of a treatment effect.
Authors: Parast L, Cai T, Tian L.
View full abstract on Pubmed
Decreased Total Iron Binding Capacity May Correlate with Ruptured Intracranial Aneurysms.
Authors: Can A, Lai PMR, Castro VM, Yu S, Dligach D, Finan S, Gainer V, Shadick NA, Savova G, Murphy S, Cai T, Weiss ST, Du R.
Sci Rep
View full abstract on Pubmed
Assessing the value of a censored surrogate outcome.
Authors: Parast L, Tian L, Cai T.
Lifetime Data Anal
View full abstract on Pubmed
Semi-supervised estimation of covariance with application to phenome-wide association studies with electronic medical records data.
Authors: Chan SF, Hejblum BP, Chakrabortty A, Cai T.
Stat Methods Med Res
View full abstract on Pubmed
Associations of antepartum suicidal behaviour with adverse infant and obstetric outcomes.
Authors: Zhong QY, Gelaye B, Karlson EW, Avillach P, Smoller JW, Cai T, Williams MA.
Paediatr Perinat Epidemiol
View full abstract on Pubmed
Automated feature selection of predictors in electronic medical records data.
Authors: Gronsbell J, Minnier J, Yu S, Liao K, Cai T.
View full abstract on Pubmed
f(1Feature Extraction for Phenotyping from Semantic and Knowledge Resources.
Authors: Ning W, Chan S, Beam A, Yu M, Geva A, Liao K, Mullen M, Mandl KD, Kohane I, Cai T, Yu S.
J Biomed Inform
View full abstract on Pubmed
Variance components genetic association test for zero-inflated count outcomes.
Authors: Goodman MO, Chibnik L, Cai T.
Genet Epidemiol
View full abstract on Pubmed
Use of natural language processing in electronic medical records to identify pregnant women with suicidal behavior: towards a solution to the complex classification problem.
Authors: Zhong QY, Mittal LP, Nathan MD, Brown KM, Knudson González D, Cai T, Finan S, Gelaye B, Avillach P, Smoller JW, Karlson EW, Cai T, Williams MA.
Eur J Epidemiol
View full abstract on Pubmed
Probabilistic record linkage of de-identified research datasets with discrepancies using diagnosis codes.
Authors: Hejblum BP, Weber GM, Liao KP, Palmer NP, Churchill S, Shadick NA, Szolovits P, Murphy SN, Kohane IS, Cai T.
Sci Data
View full abstract on Pubmed