小柯机器人

基于在线学习的迭代单细胞多组学集成
2021-04-21 16:53

美国密歇根大学Joshua D. Welch团队使用在线学习来迭代单细胞多组学集成。相关论文发表在2021年4月19日出版的《自然-生物技术》杂志上。

他们描述了在线集成非负矩阵分解(iNMF),这是一种用于集成大型、多样且不断达到单细胞数据集的算法。他们的方法使用固定内存扩展到任意数量的单元,并在生成新数据集时迭代地合并这些数据集,并允许许多用户通过在互联网上流式传输大型数据集的单个副本,同时进行分析。迭代数据添加还可用于将新数据映射到参考数据集。与先前方法的比较表明,效率的提高不会牺牲数据集的对齐方式和聚类保留性能。

他们通过在标准笔记本电脑上整合超过一百万个细胞,整合大型单细胞RNA测序和空间转录组数据集,并迭代构建小鼠运动皮层的单细胞多组图谱,证明了在线iNMF的有效性。

据悉,整合大型单细胞基因表达、染色质可及性和DNA甲基化数据集需要通用且可扩展的计算方法。

附:英文原文

Title: Iterative single-cell multi-omic integration using online learning

Author: Chao Gao, Jialin Liu, April R. Kriebel, Sebastian Preissl, Chongyuan Luo, Rosa Castanon, Justin Sandoval, Angeline Rivkin, Joseph R. Nery, Margarita M. Behrens, Joseph R. Ecker, Bing Ren, Joshua D. Welch

Issue&Volume: 2021-04-19

Abstract: Integrating large single-cell gene expression, chromatin accessibility and DNA methylation datasets requires general and scalable computational approaches. Here we describe online integrative non-negative matrix factorization (iNMF), an algorithm for integrating large, diverse and continually arriving single-cell datasets. Our approach scales to arbitrarily large numbers of cells using fixed memory, iteratively incorporates new datasets as they are generated and allows many users to simultaneously analyze a single copy of a large dataset by streaming it over the internet. Iterative data addition can also be used to map new data to a reference dataset. Comparisons with previous methods indicate that the improvements in efficiency do not sacrifice dataset alignment and cluster preservation performance. We demonstrate the effectiveness of online iNMF by integrating more than 1 million cells on a standard laptop, integrating large single-cell RNA sequencing and spatial transcriptomic datasets, and iteratively constructing a single-cell multi-omic atlas of the mouse motor cortex. A new algorithm enables scalable and iterative integration of single-cell datasets.

DOI: 10.1038/s41587-021-00867-x

Source: https://www.nature.com/articles/s41587-021-00867-x

Nature Biotechnology:《自然—生物技术》,创刊于1996年。隶属于施普林格·自然出版集团,最新IF:68.164
官方网址:https://www.nature.com/nbt/
投稿链接:https://mts-nbt.nature.com/cgi-bin/main.plex


本期文章:《自然—生物技术》:Online/在线发表

分享到:

0