Academics

High-dimensional IV regression for genetical genomics data incorporating network structures

Time:Mon., 11:00 -12:00 am, June 26, 2023

Venue:Conference Room 3 Jin Chun Yuan West Bldg.;Zoom ID: 271 534 5558; PW: YMSC

Speaker:Yuehua Cui 崔跃华 Michigan State University

Abstract

Genetical genomics data present promising opportunities for integrating gene expression and genotype information. Lin et al. (2015) proposed an instrumental variables (IV) regression framework to select important genes with high-dimensional genetical genomics data. The IV regression addresses the issue of endogeneity caused by potential correlations between gene expressions and error terms, thereby improving gene selection performance. Knowing that genes function in networks to fulfill their joint task, incorporating network structures into a regression model can further enhance gene selection performance. In this presentation, I will introduce a graph-constrained penalized IV regression framework for high-dimensional genetical genomic data, aiming to improve gene selection performance by incorporating gene network structures. We propose a two-step estimation procedure that adopts a network-constrained regularization method and establishes selection consistency. Furthermore, considering that gene expressions are time-dependent, we extend the framework to allow for the effect of gene expressions to vary over time within a varying-coefficients IV regression framework. We demonstrate the utility of our method through simulations and real data analysis.

This is a joint work with Bin Gao, Jialin Qu, Xu Liu and Hongzhe Li.


Yuehua Cui 崔跃华

Michigan State University

崔跃华教授目前为美国密西根州立大学统计与概率系教授,研究生部主任,美国统计协会ASA fellow,国际统计学院ISI elected member。担任美国和中国国家自然科学基金评审专家,并担任多家国际学术期刊的副主编和编委,如BMC Genomic Data,Statistics and Probability letters和Computational and Structural Biotechnology Journal等。其主要从事统计遗传和基因组学的方法学研究,发表论文一百余篇,研究获得美国NSF和NIN的资助。

DATEJune 26, 2023
SHARE
Related News
    • 0

      Covariate-shift Robust Adaptive Transfer Learning for High-Dimensional Regression

      AbstractThe main challenge that sets transfer learning apart from traditional supervised learning is the distribution shift, reflected as the shift between the source and target models and that between the marginal covariate distributions. High-dimensional data introduces unique challenges, such as covariate shifts in the covariate correlation structure and model shifts across individual featur...

    • 1

      Data analysis: A shift from linear regression to network modeling

      Speaker IntroRongling Wu, received a Ph.D. in Quantitative Genetics from the University of Washington (Seattle) in 1995. He was a Distinguished Professor of Statistics and Public Health Sciences at Pennsylvania State University, and Director of the Center for Statistical Genetics. He is currently the Zeng Siming Chair Professor of Yau Mathematical Sciences Center, Tsinghua University. He is als...