小柯机器人

研究揭示高效从头基因组组装工具WENGAN
2020-12-15 16:18

法国里昂大学Marie-France Sagot、Alex Di Genova研究团队在研究中取得进展。他们的最新研究利用WENGAN高效从头组装了人类基因组。2020年12月14日出版的《自然-生物技术》发表了这项成果。

研究人员研发了一种用于混合装配的算法WENGAN,它以较低的计算成本提供了很高的质量。研究人员展示了使用ONT PromethION、PacBio Sequel、Illumina和MGI技术生成的测序数据组合,从头组装了四个人类基因组。WENGAN利用有效算法来提高装配的连续性和共识质量。其产生的基因组装配体具有高连续性(contig NG50:17.24–80.64 Mb)、极少的装配错误(contig NGA50:11.8–59.59 Mb)、良好的共有质量(QV:27.84–42.88)和高基因完整性(BUSCO完成:94.6– 95.2%)、同时消耗较少的计算资源(CPU时间:187–1,200)。

特别是,利用WENGAN装配的单倍体CHM13样品具有80.64 Mb的NG50 contig(NGA50:59.59 Mb),超过了当前人类参考基因组的连续性(GRCh38 contig NG50:57.88 Mb)。

据了解,已经研究证明仅使用易错长读段进行基因组装配以准确生成大型、重复序列丰富的人类基因组具有一定挑战性,并且大多数由长读段组装而成的人类基因组会添加准确的短读段以完善共有序列。

附:英文原文

Title: Efficient hybrid de novo assembly of human genomes with WENGAN

Author: Alex Di Genova, Elena Buena-Atienza, Stephan Ossowski, Marie-France Sagot

Issue&Volume: 2020-12-14

Abstract: Generating accurate genome assemblies of large, repeat-rich human genomes has proved difficult using only long, error-prone reads, and most human genomes assembled from long reads add accurate short reads to polish the consensus sequence. Here we report an algorithm for hybrid assembly, WENGAN, that provides very high quality at low computational cost. We demonstrate de novo assembly of four human genomes using a combination of sequencing data generated on ONT PromethION, PacBio Sequel, Illumina and MGI technology. WENGAN implements efficient algorithms to improve assembly contiguity as well as consensus quality. The resulting genome assemblies have high contiguity (contig NG50: 17.24–80.64Mb), few assembly errors (contig NGA50: 11.8–59.59Mb), good consensus quality (QV: 27.84–42.88) and high gene completeness (BUSCO complete: 94.6–95.2%), while consuming low computational resources (CPU hours: 187–1,200). In particular, the WENGAN assembly of the haploid CHM13 sample achieved a contig NG50 of 80.64Mb (NGA50: 59.59Mb), which surpasses the contiguity of the current human reference genome (GRCh38 contig NG50: 57.88Mb).

DOI: 10.1038/s41587-020-00747-w

Source: https://www.nature.com/articles/s41587-020-00747-w

 

Nature Biotechnology:《自然—生物技术》,创刊于1996年。隶属于施普林格·自然出版集团,最新IF:68.164
官方网址:https://www.nature.com/nbt/
投稿链接:https://mts-nbt.nature.com/cgi-bin/main.plex


本期文章:《自然—生物技术》:Online/在线发表

分享到:

0