Director of the Cornell EpiGenomics Core Facility.
Assistant Research Professor in Molecular Biology and Genetics.
Assistant Research Professor in Computational Biology.
458 Biotechnology Building, Cornell, Ithaca, NY, USA 14850

William KM Lai

The primary focus of my research program is to understand how protein-complexes target the genome, how protein binding changes across cell states, and how mis-regulation of these interactions can result in disease-phenotypes. My group combines novel genomic assays with custom bioinformatic software development. We pursue ‘big-data’ projects that generate thousands of unique (epi)genomic datasets across human, mouse, and yeast model systems.

As the Director of the Cornell EpiGenomics Core facility, my group works hand-in-hand with physician scientists at Cornell Weill medical campus to apply our epigenomic technology and algorithmic approaches to biomedical specimens to investigate the fundamental nature of human diseases. Combining wet-bench biochemical genomic technology with analytical bioinformatic algorithmic approaches allows us to understand the fundamental rules of protein-DNA interactions and how these rules are broken in diseased patient samples.

The volume and dimensionality of our data typically requires application of machine learning approaches as well as eXplainable AI algorithms. To support our analysis, we heavily leverage NSF-provided ACCESS resources. Our students are trained to perform advanced informatic analysis across a wide spectrum of heterogeneous and high-performance compute systems.

The greatest biological discoveries are achieved when we can pair biochemical and bioinformatic approaches to build something greater than the sum of its parts.

Publications

View More

2023

  • GenoPipe: identifying the genotype of origin within (epi)genomic datasets.
    Lang O, Srivastava D, Pugh BF, Lai WKM.
    Nucleic Acids Research 2023, Nucleic Acids Research 51 (22), 12054-12068. PMID: 37933851; PMCID: PMC10711449.

  • Joint sequence & chromatin neural networks characterize the differential abilities of Forkhead transcription factors to engage inaccessible chromatin.
    Arora S, Yang J, Akiyama T, James DQ, Morrissey A, Blanda TR, Badjatia N, Lai WKM, Ko MSH, Pugh BF, Mahony S.
    bioArxiv 2023, https://doi.org/10.1101/2023.10.06.561228