


New Transformer-based method accurately predicts DNA methylation from nanopore sequencing
DNA methylation plays an important role in various biological processes, including cell differentiation, aging and cancer development. The most important methylation in mammals is 5-methylcytosine, which occurs primarily in the context of CpG dinucleotides. Sequencing methods such as
Whole-genome bisulfite sequencing can successfully detect 5-methylcytosine DNA modifications. However, they suffer from the serious drawback of short read lengths, which may introduce amplification bias.
Researchers at Singapore A*STAR have developed a deep learning algorithm Rockfish that significantly improves read-level 5-methylcytosine by using Oxford Nanopore Sequencing (ONT) Pyrimidine detection capability.
The study was titled "Rockfish: A transformer-based model for accurate 5-methylcytosine prediction from nanopore sequencing" and was published in "Nature Communications" on July 3, 2024.
The researchers trained the model using high-quality human and mouse datasets and tested it on multiple R9.4.1 and R10.4.1 datasets, including:
- In-house sequenced R9.4.1 H1 embryonic stem cell (H1ESc) native dataset
- R9.4.1 and R10.4.1 neonatal mouse (C57BL/6 neonatal) data
- Some publicly available human cancer and blood datasets
Given that both R9.4.1 and R10.4.1 NA12878 as well as neonatal mouse datasets were used for evaluation, the researchers pointed out the well versions to distinguish them. The remaining data sets were sequenced using only the R9.4.1 well version.
Extensive evaluation of Rockfish models and comparisons with the following tools:
- Megalodon Remora, Megalodon Rerio and Nanopolish for R9.4.1 dataset
- Remora for R10.4.1 dataset
The comparison includes:
- Read-level prediction
- Site-level prediction
- Site-level correlation with WGBS
- Call coverage
- Execution time
- Resource utilization
Illustration: Read -level evaluation. (Source: Paper)
Single base accuracy and F1 metric improved by up to 5 percentage points on the R.9.4.1 dataset and up to 0.82 percentage points on the R10.4.1 dataset.
In addition, Rockfish exhibits high correlation with whole-genome bisulfite sequencing, requires lower read depth, and is computationally efficient with higher confidence in biologically important regions such as CpG-rich promoters. Spend.
Its excellent performance in human and mouse samples highlights its versatility in studying 5-methylcytosine methylation in different organisms and diseases. Finally, its adaptable architecture ensures compatibility with new versions of pores and chemistries and modification types.
Despite this, Rockfish is currently unable to distinguish between 5mC and 5hmC methylation, due to the lack of high-quality control data sets for other types of modifications. There is still room for improvement in computational efficiency of the model, and efficiency is expected to be improved through architecture and engineering optimization in the future.
Rockfish demonstrated the ability to extract methylation information from ONT raw signals, with its small model performing better and taking shorter run times on all datasets, demonstrating the benefits of additional data and knowledge distillation.
5mC modification is related to a variety of biological phenomena, such as transcriptional regulation, disease, aging, etc. Therefore, it is crucial to deeply understand the role of DNA methylation through single-base resolution detection, which may help in the prevention of diseases. Early diagnosis and treatment strategy selection. Rockfish's architecture makes it easily scalable to detect various types of DNA and RNA modifications.
Paper link: https://www.nature.com/articles/s41467-024-49847-0
The above is the detailed content of New Transformer-based method accurately predicts DNA methylation from nanopore sequencing. For more information, please follow other related articles on the PHP Chinese website!

AI Augmenting Food Preparation While still in nascent use, AI systems are being increasingly used in food preparation. AI-driven robots are used in kitchens to automate food preparation tasks, such as flipping burgers, making pizzas, or assembling sa

Introduction Understanding the namespaces, scopes, and behavior of variables in Python functions is crucial for writing efficiently and avoiding runtime errors or exceptions. In this article, we’ll delve into various asp

Introduction Imagine walking through an art gallery, surrounded by vivid paintings and sculptures. Now, what if you could ask each piece a question and get a meaningful answer? You might ask, “What story are you telling?

Continuing the product cadence, this month MediaTek has made a series of announcements, including the new Kompanio Ultra and Dimensity 9400 . These products fill in the more traditional parts of MediaTek’s business, which include chips for smartphone

#1 Google launched Agent2Agent The Story: It’s Monday morning. As an AI-powered recruiter you work smarter, not harder. You log into your company’s dashboard on your phone. It tells you three critical roles have been sourced, vetted, and scheduled fo

I would guess that you must be. We all seem to know that psychobabble consists of assorted chatter that mixes various psychological terminology and often ends up being either incomprehensible or completely nonsensical. All you need to do to spew fo

Only 9.5% of plastics manufactured in 2022 were made from recycled materials, according to a new study published this week. Meanwhile, plastic continues to pile up in landfills–and ecosystems–around the world. But help is on the way. A team of engin

My recent conversation with Andy MacMillan, CEO of leading enterprise analytics platform Alteryx, highlighted this critical yet underappreciated role in the AI revolution. As MacMillan explains, the gap between raw business data and AI-ready informat


Hot AI Tools

Undresser.AI Undress
AI-powered app for creating realistic nude photos

AI Clothes Remover
Online AI tool for removing clothes from photos.

Undress AI Tool
Undress images for free

Clothoff.io
AI clothes remover

AI Hentai Generator
Generate AI Hentai for free.

Hot Article

Hot Tools

SublimeText3 Linux new version
SublimeText3 Linux latest version

mPDF
mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

Atom editor mac version download
The most popular open source editor

DVWA
Damn Vulnerable Web App (DVWA) is a PHP/MySQL web application that is very vulnerable. Its main goals are to be an aid for security professionals to test their skills and tools in a legal environment, to help web developers better understand the process of securing web applications, and to help teachers/students teach/learn in a classroom environment Web application security. The goal of DVWA is to practice some of the most common web vulnerabilities through a simple and straightforward interface, with varying degrees of difficulty. Please note that this software

VSCode Windows 64-bit Download
A free and powerful IDE editor launched by Microsoft