ATOM-1: A Foundation Model for RNA Structure and Function Built on Chemical Mapping Data

2 min read Original article ↗

New Results

, View ORCID ProfileBrandon M. Anderson, View ORCID ProfileBrent Townshend, View ORCID ProfileRyan Chow, View ORCID ProfileConnor J. Stephens, View ORCID ProfileRamya Rangan, View ORCID ProfileMatias Kaplan, View ORCID ProfileMeredith Corley, Akshay Tambe, View ORCID ProfileYuzu Ido, View ORCID ProfileJake Yukich, View ORCID ProfileTabitha Tcheau, View ORCID ProfileAyah Abdeldayem, View ORCID ProfileGabriel Ferns, View ORCID ProfileHarsh Patel, View ORCID ProfileShaon Barman, April Schleck, View ORCID ProfileAdrian L. Sanborn, Stephan Eismann, View ORCID ProfileRaphael J. L. Townshend

doi: https://doi.org/10.1101/2023.12.13.571579

Loading

Abstract

RNA-based medicines and RNA-targeting drugs are emerging as promising new approaches for treating disease. Optimizing these therapeutics by naive experimental screening is a time-consuming and expensive process, while rational design requires an accurate understanding of the structure and function of RNA. To address this design challenge, we present ATOM-1, the first RNA foundation model trained on chemical mapping data, enabled by data collection strategies purposely developed for machine learning training. Using small probe neural networks on top of ATOM-1 embeddings, we demonstrate that this model has developed rich internal representations of RNA. Trained on limited amounts of additional data, these small networks achieve state-of-the-art accuracy on key RNA prediction tasks, suggesting that this approach can enable the design of therapies across the RNA landscape.

Competing Interest Statement

All authors are current or former employees of Atomic AI. There is a pending patent application in relation to this work.

Copyright 

The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.