网站地图 | 联系我们 | English | 意见反馈 | 主任信箱
 
首页 中心概况 新闻动态 科研进展 交流合作 人才培养 研究队伍 人才招聘 政策规章 数学交叉科学传播
科研进展
科研成果
研究专题
获奖
现在位置:首页 > 科研进展 > 科研成果
异质型单细胞多模态数据整合的流形对齐算法
【打印】【关闭】

  2022-4-29

 

Motivation: Single-cell multi-omics sequencing data can provide a comprehensive molecular view of cells. However, effective approaches for the integrative analysis of such data are challenging. Existing manifold alignment methods demonstrated the state-of-the-art performance on single-cell multi-omics data integration, but they are often limited by requiring that single-cell datasets be derived from the same underlying cellular structure. 

  Results: In this study, we present Pamona, a partial Gromov-Wasserstein distance-based manifold alignment frame- work that integrates heterogeneous single-cell multi-omics datasets with the aim of delineating and representing the shared and dataset-specific cellular structures across modalities. We formulate this task as a partial manifold align- ment problem and develop a partial Gromov-Wasserstein optimal transport framework to solve it. Pamona identifies both shared and dataset-specific cells based on the computed probabilistic couplings of cells across datasets, and it aligns cellular modalities in a common low-dimensional space, while simultaneously preserving both shared and dataset-specific structures. Our framework can easily incorporate prior information, such as cell type annotations or cell-cell correspondence, to further improve alignment quality. We evaluated Pamona on a comprehensive set of publicly available benchmark datasets. We demonstrated that Pamona can accurately identify shared and dataset- specific cells, as well as faithfully recover and align cellular structures of heterogeneous single-cell modalities in a common space, outperforming the comparable existing methods. 

 

 

  Availability and implementation: Pamona software is available at https://github.com/caokai1073/Pamona. 

  Contact: lwan@amss.ac.cn 

  Supplementary information: Supplementary data are available at Bioinformatics online.    

 

  Publication: 

  Bioinformatics, Volume 38, Issue 1, 1 January 2022, Pages 211–219.

    

  Author:  

  Kai Cao  

  LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences 

  School of Mathematical Sciences, University of Chinese Academy of Sciences 

 

  Yiguang Hong  

  LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences 

  Department of Control Science and Engineering, Tongji University     

 

  Lin Wan  

  LSC, NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences 

  School of Mathematical Sciences, University of Chinese Academy of Sciences 

 

欢迎访问国家数学与交叉科学中心 
地址:北京海淀区中关村东路55号 邮编:100190 电话: 86-10-62613242 Fax: 86-10-62616840 邮箱: ncmis@amss.ac.cn