Dr Liu Boxiang - Department of Biomedical Informatics

LIU, Boxiang

Assistant Professor

Brief Introduction Journals & Publications

Brief Introduction

Genome-wide association studies (GWAS) have identified tens of thousands of genetic variants associated with human diseases. Because the majority of GWAS variants fall into non-coding regions of the genome, their functional mechanism are usually not clear. Our group have used multi-omics methods to study complex diseases. Using a combination of transcriptomic and epigenomic information, we have identified risk genes for coronary artery disease [Liu (2018) AJHG; Wirka (2019) Nature Medicine] and age-related macular degeneration [Liu (2019) Communications Biology]. Our group has a strong interest in expression quantitative trait loci (eQTL) mapping, and was part of the Genotype-Tissue Expression (GTEx) project [GTEx consortium (2017) Nature]. Our group is currently part of the Asian Immune Diversity Atlas (AIDA).

Novel scientific questions and data modalities require computational methods beyond existing ones. Our group develops statistical methods, machine learning models (especially deep learning models), and visualization techniques to fill these gaps. The computational techniques developed by our group are rooted in biological questions, but often borrow ideas from other domains such as natural language processing and computer vision. For the [Liu (2018) AJHG] paper, we developed a fast software to approximate sum of non-identical binomial random variables [Liu and Quertermous (2017), R Journal]. Combining microfluidic multiplex PCR and ancestry inference techniques, we developed the ANTseq pipeline to reduce the cost of ancestry determination by 5-fold [Liu (2016)].

Our group has a strong interest in deep learning. We developed a deep learning architecture to jointly model the cis- and trans-regulators of gene expression. Our method outperformed the previous state-of-the-art by as much as 20% [Liu (2017), NeurIPS]. Our group is also interested in the application of deep learning in the biomedical natural language processing. We developed ParaMed as the first biomedical English-Chinese machine translation dataset [Liu (2021) BMC Medical Informatics]. This dataset, combined with a state-of-the-art transformer architecture, outperformed baseline by 24 BLEU score (2-fold performance boost). We also showed that deep learning models underperform traditional rule-based methods in certain domains [Church and Liu (2021) Frontiers in Artificial Intelligence].

Journals & Publications

Highlight Publications (¹=joint first authors, *=corresponding author)

Boxiang Liu¹*, Michael J. Gloudemans¹, Abhiram S. Rao, Erik Ingelsson, and Stephen B. Montgomery*. (2019). Abundant associations with gene expression complicate GWAS follow-up. Nature Genetics. (IF = 38.33) [paper]
Boxiang Liu, Milos Pjanic, Ting Wang, Trieu Nguyen, Michael Gloudemans, Abhiram Rao, Victor G. Castano, Sylvia Nurnberg, Daniel J Rader, Susannah Elwyn, Erik Ingelsson, Stephen B Montgomery, Clint L Miller, Thomas Quertermous. (2018). Genetic regulatory mechanisms of smooth muscle cells map to coronary artery disease risk loci. American Journal of Human Genetics. (IF = 11.025) [paper]
Boxiang Liu¹, Melissa A. Calton¹, Nathan S. Abell, Gillie Benchorin, Michael J. Gloudemans, Ming Chen, Jane Hu, Xin Li, Brunilda Balliu, Dean Bok, Stephen B. Montgomery, Douglas Vollrath. (2019). Genetic analyses of human fetal retinal pigment epithelium gene expression suggest ocular disease mechanisms. Communications Biology. (IF = 6.268) [paper]
Li, Yingmei¹, Boxiang Liu¹, Ian David Connolly¹, Bina Wasunga Kakusa, Wenying Pan, Seema Nagpal, Stephen B. Montgomery, and Melanie Hayden Gephart. (2018). Recurrently mutated genes differ between leptomeningeal and solid lung cancer brain metastases. Journal of Thoracic Oncology. (IF = 15.609) [paper]
Boxiang Liu¹*, Kaibo Liu¹, He Zhang, Liang Zhang, Yuchen Bian, and Liang Huang. (2020). CoV-Seq, a new tool for SARS-CoV-2 genome analysis and visualization: Development and usability study. Journal of medical Internet Research (IF= 5.43) [paper]
Boxiang Liu*, and Stephen B. Montgomery*. (2020). Identifying causal variants and genes using functional genomics in specialized cell types and contexts. Human Genetics(IF = 5.331) [paper]
Boxiang Liu¹*, Yanjun Li¹, Liang Zhang. (2022). Analysis and Visualization of Spatial Transcriptomic Data. Frontiers in Genetics (IF = 4.6) [paper]
Boxiang Liu*, and Thomas Quertermous. (2017). Approximating the sum of independent non-identical binomial random variables. R Journal (IF = 984) [paper]
Boxiang Liu*, Liang Huang. (2021). ParaMed: A Parallel Corpus for English-Chinese Translation in the Biomedical Domain. BMC Medical Informatics and Decision Making. (IF=3.394) [paper]
Badsha Bahadur¹, Rui Li¹, Boxiang Liu¹, Yang I. Li, Min Xian, Nicholas E. Banovich, and Audrey Qiuyan Fu. (2020). Imputation of single-cell gene expression with an autoencoder neural network. Quantitative Biology(IF = 1.161) [paper]
Boxiang Liu, Nadine Hussami, Avanti Shrikumar, Tyler Shimko, Salil Bhate, Scott Longwell, Stephen Montgomery, and Anshul Kundaje. (2017) A multi-modal neural network for learning cis and trans regulation of stress response in yeast. NeurIPS Machine Learning in Computational Biology Workshop [paper] Note: Computer Science venues do not have IF.

Google Scholar

Personal Website

Curriculum Vitae