Academics

Manifold learning for noisy and high-dimensional datasets: challenges and some solutions

Time:Monday,15:00-16:00 June 24, 2024

Venue:C548, Shuangqing Complex Building A 清华大学双清综合楼A座C548

Organizer:吴昊,杨帆,姜建平,顾陈琳

Speaker:Xiucai Ding 丁秀才 University of California, Davis

Abstract:

Manifold learning theory has garnered considerable attention in the modeling of expansive biomedical datasets, showcasing its ability to capture data essence more effectively than traditional linear methodologies. Nevertheless, prevalent algorithms are primarily designed for low-dimensional and clean datasets, whereas contemporary biomedical datasets tend to be high-dimensional and noisy. This presentation addresses the adaptation of these algorithms to effectively accommodate the challenges posed by high dimensionality and noise in modern datasets.

DATEJune 23, 2024
SHARE
Related News
    • 0

      Quadric hypersurface intersection for manifold learning in feature space

      AbstractThe knowledge that data lies close to a particular submanifold of the ambient Euclidean space may be useful in a number of ways. For instance, one may want to automatically mark any point far away from the submanifold as an outlier, or to use its geodesic distance to measure similarity between points. Classical problems for manifold learning are often posed in a very high dimension, e.g...

    • 1

      Factor Modeling for Clustering High-dimensional Time Series

      AbstractWe propose a new unsupervised learning method for clustering a large number of time series based on a latent factor structure. Each cluster is characterized by its own cluster-specific factors in addition to some common factors which impact on all the time series concerned. Our setting also offers the flexibility that some time series may not belong to any clusters. The consistency with...