Developing Tools To Analyze Individual RNA Molecules
COS/BioE Professor Meni Wanunu and BioE Assistant Professor Sara Rouhanifard, in collaboration with Anna-Marie Pyle from Yale University, were awarded a $4.2 million grant from the National Institutes of Health (NIH)/National Human Genome Research Institute (NHGRI) for “Direct RNA Sequencing Using Electro-optical Zero-mode Waveguides and Custom Click Fluorescent Nucleotides.” The goal is to read the sequence and detect the diversity of chemical modifications in RNA molecules, which would be a huge step forward in understanding the biology and disease-causing events in human cells.
Current methods are lacking, which promoted Wanunu, Rouhanifard and collaborators, to develop an ultra-sensitive technology that directly reads RNA molecules, one at a time, at low cost, and with high throughput. The method will use an array of nanometer-scale devices in which individual enzymes replicate RNA and report on the RNA sequence and presence of modifications, all from minute amounts that are equivalent to the number of RNA molecules in a single human cell.
Abstract Source: NIH
Advances in genome technologies ushered in vast cost reductions in DNA sequencing and increased read lengths, the latter afforded by development of new single-molecule sequencing technologies. As a result, much of the genome’s “dark matter” has been elucidated, and higher-quality reference genomes were made available. In addition to genome sequencing, these single-molecule methods have enabled new applications for probing chemical modifications in DNA, by either probing the kinetics of sequencing-by-synthesis using optical waveguides, or by electrically distinguishing modified bases using nanopores. Despite progress, a critical barrier in genomics is understanding the roles of RNA in biology, which demands methods for quantitative analysis of RNA molecules in a cell. The myriad of types of RNAs in a cell, their dynamic chemical modifications, and their elaborate structural and functional diversity, all hint at a tremendous level of regulation and biological significance. Traditional RNA sequencing methods have primarily relied on conversion to complementary DNA (cDNA) followed by cDNA sequencing using either high-throughput second-generation methods or third generation single-molecule methods, the latter of which offers long reads. Using these methods, some RNA modifications can be read through prior chemical functionalization of the RNA prior to conversion to cDNA (for example, m6A, pseudouridine, A-to-I editing, 1-methyluridine, and dihydrouridine). However, the chemical reactions involved in these methods are not 100% quantitative or specific, and further, detection is often done through incomplete reads due to reverse transcription blocks, which precludes detection of multiple modifications. The only available method for direct RNA sequencing, the Oxford Nanopore Technologies platform, suffers from several drawbacks that include high input requirements, limited ability to probe RNA modifications, and incomplete reads, particularly near the RNA 5’ end. We address these limitations by developing a new single-molecule method that can be scaled to allow long read direct RNA sequencing at high throughputs, all with very low input requirements of several picograms. Building on zero-mode waveguides (ZMWs) originally developed by Pacific Biosciences, we have recently developed electro-optical ZMWs (eZMWs) that allow low-input capture of DNA and RNA molecules. We demonstrated using these devices identification of DNA fragments from low inputs by rapid capture of single molecules and their flash sequencing. Together with the Pyle group, we are developing the integration of MarathonRT, an ultra- processive reverse transcriptase that converts RNA molecules to cDNA molecules with high processivity and accuracy, into our electro-optical eZMWs for direct RNA sequencing. We have already fused MarathonRT to a streptavidin protein and demonstrated its functionality in eZMWs. Here we will build on these developments to develop a direct RNA sequencing method that can detect single base edits and chemical modifications, all with high coverage from single-cell inputs of a few pg per run.