小柯机器人

科学家开发出评估病毒基因组质量和完整性的新工具
2020-12-22 22:27

美国劳伦斯伯克利国家实验室Nikos C. Kyrpides、Stephen Nayfach等研究人员合作开发出评估病毒基因组质量和完整性的新工具。相关论文于2020年12月21日在线发表在《自然—生物技术》杂志上。

研究人员报道了CheckV,这是一种用于识别封闭的病毒基因组、估计基因组片段的完整性并从整合的原病毒中去除侧翼宿主区域的自动化方法。CheckV通过将序列与完整病毒基因组的大型数据库进行比较来评估完整性,该数据库包括从对公开可获得的元基因组、元转录组和元病毒组的系统搜索中识别出的76262个病毒基因组。

在对模拟数据集进行验证并与现有方法进行比较之后,研究人员将CheckV应用到了由元基因组组装的病毒序列的各种庞大集合中,包括IMG/VR和全球海洋病毒组。这项研究揭示了44,652个高质量的病毒基因组(>90%完整),尽管绝大多数序列是小片段,这突出了从短读本元基因组中组装病毒基因组的挑战。此外,研究人员发现去除宿主污染大大改善了辅助代谢基因的准确识别和对病毒编码功能的解释。

据介绍,数百万个新的病毒序列已经从元基因组中鉴定出,但是这些序列的质量和完整性差异很大。

附:英文原文

Title: CheckV assesses the quality and completeness of metagenome-assembled viral genomes

Author: Stephen Nayfach, Antonio Pedro Camargo, Frederik Schulz, Emiley Eloe-Fadrosh, Simon Roux, Nikos C. Kyrpides

Issue&Volume: 2020-12-21

Abstract: Millions of new viral sequences have been identified from metagenomes, but the quality and completeness of these sequences vary considerably. Here we present CheckV, an automated pipeline for identifying closed viral genomes, estimating the completeness of genome fragments and removing flanking host regions from integrated proviruses. CheckV estimates completeness by comparing sequences with a large database of complete viral genomes, including 76,262 identified from a systematic search of publicly available metagenomes, metatranscriptomes and metaviromes. After validation on mock datasets and comparison to existing methods, we applied CheckV to large and diverse collections of metagenome-assembled viral sequences, including IMG/VR and the Global Ocean Virome. This revealed 44,652high-quality viral genomes (that is, >90% complete), although the vast majority of sequences were small fragments, which highlights the challenge of assembling viral genomes from short-read metagenomes. Additionally, we found that removal of host contamination substantially improved the accurate identification of auxiliary metabolic genes and interpretation of viral-encoded functions.

DOI: 10.1038/s41587-020-00774-7

Source: https://www.nature.com/articles/s41587-020-00774-7

Nature Biotechnology:《自然—生物技术》,创刊于1996年。隶属于施普林格·自然出版集团,最新IF:68.164
官方网址:https://www.nature.com/nbt/
投稿链接:https://mts-nbt.nature.com/cgi-bin/main.plex


本期文章:《自然—生物技术》:Online/在线发表

分享到:

0