||
用传统方法在一个大型数据库中查询数百万个DNA序列,可能需要几个小时到数周,莱斯大学的计算机科学家发布兰博(RAMBO,repeated and merged bloom filter)程序,可快速对庞大的DNA数据库进行搜索,比现有方法快35倍。由于DNA测序非常流行,基因组数据集每两年就翻一番,快速搜索数据的工具非常重要。
RAMBO uses a data structure that has a significantly faster query time than state-of-the-art genome indexing methods as well as other advantages like ease of parallelization, a zero false-negative rate and a low false-positive rate.
Gaurav Gupta, Minghao Yan, Benjamin Coleman, Bryce Kille, R. A. Leo Elworth, Tharun Medini, Todd Treangen, Anshumali Shrivastava. Fast Processing and Querying of 170TB of Genomics Data via a Repeated And Merged BloOm Filter (RAMBO). SIGMOD/PODS '21: Proceedings of the 2021 International Conference on Management of Data, June 2021; DOI: http://dx.doi.org/10.1145/3448016.3457333
Archiver|手机版|科学网 ( 京ICP备07017567号-12 )
GMT+8, 2024-4-19 08:46
Powered by ScienceNet.cn
Copyright © 2007- 中国科学报社