Molecule of the Month: DNA-Sequencing Nanopores

Designer nanopores work with enzymes to offer pocket-sized DNA sequencing

This article was written and illustrated by Jennifer Jiang, Katherine H. Park, and Kiranmayi Vemuri as part of a week-long boot camp on "Science Communication in Biology and Medicine" for undergraduate and graduate students hosted by the Rutgers Institute for Quantitative Biomedicine in January 2021.

DNA-sequencing nanopore, with DNA in red, CsgG in blue, DNA polymerase in orange, and the membrane shown schematically in gray.
DNA-sequencing nanopore, with DNA in red, CsgG in blue, DNA polymerase in orange, and the membrane shown schematically in gray.
Download high quality TIFF image
DNA sequencing has revolutionized the study of biology, revealing the information that encodes the processes of life. The Human Genome Project took a total of 13 years and cost almost 3 billion dollars using the conventional Sanger chain-termination method. As the enthusiasm for genomics continues to grow, the need for faster, better and more cost-effective sequencing techniques is increasingly clear. Nanopore sequencing is a rapidly developing tool to meet these demands, with devices often costing less than $1000. Today, this technology is used everywhere - from Antarctica to the International Space Station!

Pocket-sized Sequencers

Nanopore sequencing decodes a DNA strand as it is drawn through a tiny pore embedded in a membrane. Compared to conventional DNA sequencing approaches, nanopore sequencing directly detects nucleotides without relying on DNA synthesis/amplification. This technique also enables sequencing of long fragments of DNA, which can be used to piece together entire genomes of organisms quickly and reliably. A single device houses thousands of individual nanopores and yet is simple, inexpensive and can even fit in your pocket. Scientists are currently minimizing challenges such as reducing error rates and increasing yields per run.

Nanopores in Action

Nanopore sequencers are composed of nanopores embedded in a membrane that splits a salt solution into two chambers. A voltage is applied across the membrane, which causes ionic flow that can be measured. When a DNA strand is pulled through the pore, the ion flow is partially blocked, leading to a reduction in the observed current. The four types of nucleic acid bases are each associated with a different level of ion current change, which enables their identification. Commercially-available sequencers currently use derivatives of the bacterial protein CsgG as the go-to nanopore. CsgG, shown here from PDB ID 4uv3, is an ungated, non-selective outer membrane protein with a pore diameter of about 1 nanometer, which allows the easy passage of single-stranded DNA. To improve its DNA-reading ability, CsgG has been engineered to reshape the nanopore, as described below, resulting in greater yields and more accurate sequencing reads.

Regulating DNA Threading

Precise control of the DNA strand is required to maintain efficient sequencing rates as it passes through the nanopore. Free DNA strands pass through pores too fast to get a clear read of each base. Scientists have found that they can regulate the speed by adding DNA-binding enzymes such as polymerases or helicases to the nanopore machinery. Bacteriophage phi29 DNA polymerase is shown here (PDB ID 1xhz). It acts as a stepper motor to slow down the DNA. In association with phi29 DNA polymerase, DNA ratchets through the CsgG pore at single nucleotide resolution.

Cross-sectional view of α-hemolysin (left) and MspA (right) highlighted the different sizes of their sensing regions when used for DNA sequencing.
Cross-sectional view of α-hemolysin (left) and MspA (right) highlighted the different sizes of their sensing regions when used for DNA sequencing.
Download high quality TIFF image

Nature’s Pores

Other pore-forming membrane proteins are abundant in nature. α-hemolysin (PDB ID 7ahl), a bacterial toxin, was the first biological nanopore used for sequencing. It features a long channel with a range of pore diameters (1.5-2.5 nanometers), making it suboptimal for discerning individual nucleic acid bases. The mycobacterial protein, MspA (PDB ID 1uun), is a naturally occurring nanopore with a shorter sensing region. This is an advantage, because in pores with shorter sensing regions, fewer nucleotides influence the characteristics of the recorded current, resulting in more accurate sequence reading. Since MspA exhibits a channel diameter of about 1.2 nanometers and a shorter sensing region, it is a better sequencer compared to α-hemolysin.

Exploring the Structure

Dual constriction nanopores in action

Scientists have engineered nanopores with greater pore stability and tuned for optimal function. For example, dual constriction nanopores add an accessory protein to CsgG to introduce a second constriction for detecting DNA. The CsgG-CsgF complex shown here (PDB ID 6si7) highlights the CsgG molecule in blue with its pore-forming region in yellow, and the CsgF molecule in pink. CsgG forms the initial constriction point for DNA and CsgF adds a second constriction. Integrating the signal from two constriction sites improves base-reading accuracy, particularly in DNA stretches with multiple consecutive copies of the same base. Click on the image to explore this structure in an interactive JSmol.

Topics for Further Discussion

  1. If you want to explore the structure of DNA bases, PDB ID 2rpd includes a single DNA strand with one of each of the four types of bases.
  2. Aerolysin is another commonly used nanopore sensor borrowed from a bacterium. You can see the wild-type pore in the PDB ID 5jzt.

References

  1. 6si7: Van der Verren, S.E., Van Gerven, N., Jonckheere, W., Hambley, R., Singh, P., Kilgour, J., Jordan, M., Wallace, E.J., Jayasinghe, L., Remaut, H. (2020) A dual-constriction biological nanopore resolves homonucleotide sequences with high fidelity. Nat Biotechnol 38, 1415-1420.
  2. Carter, J.M., Hussain, S. (2017) Robust long-read native DNA sequencing using the ONT CsgG Nanopore system. Wellcome Open Res, 2, 23.
  3. Castro-Wallace, S.L., Chiu, C.Y., John, K.K., Stahl, S.E., Rubins, K.H., McIntyre, A.B.R., Dworkin, J.P., Lupisella, M.L., Smith, D.J., Botkin, D.J., Stephenson, T.A., Juul, S., Turner, D.J., Izquierdo, F., Federman, S., Stryke, D., Somasekar, S., Alexander, N., Yu, G., Mason, C.E., Burton, A.S. (2017) Nanopore DNA sequencing and genome assembly on the International Space Station. Sci Rep 7, 18022.
  4. Johnson, S.S., Zaikova, E., Goerlitz, D.S., Bai, Y., Tighe, S.W. (2017) Real-time DNA sequencing in the Antarctic dry valleys using the Oxford Nanopore sequencer. J Biomol Tech 28, 2-7.
  5. Deamer, D., Akeson, M., Branton, D. (2016) Three decades of nanopore sequencing. Nat Biotechnol 34, 518-524.
  6. 4uv3: Goyal, P., Krasteva, P.V., Van Gerven, N., Gubellini, F., Van den Broeck, I., Troupiotis-Tsailaki, A., Jonckheere, W., Pehau-Arnaudet, G., Pinkner, J.S., Chapman, M.R., Hultgren, S.J., Howorka, S., Fronzes, R., Remaut, H. (2014) Structural and mechanistic insights into the bacterial amyloid secretion channel CsgG. Nature 516, 250-253.
  7. Lieberman, K.R., Cherf, G.M., Doody, M.J., Olasagasti, F., Kolodji, Y., Akeson, M. (2010) Processive replication of single DNA molecules in a nanopore catalyzed by phi29 DNA polymerase. J Am Chem Soc 132, 17961–17972.
  8. 1uun: Faller, M., Niederweis, M., Schulz, G.E. (2004) The structure of mycobacterial outer-membrane channel. Science 303, 1189-1192.
  9. 1xhz: Kamtekar, S., Berman, A.J., Wang, J., Lazaro, J.M., de Vega, M., Blanco, L., Salas, M., Steitz, T.A. (2004) Insights into strand displacement and processivity from the crystal structure of the protein-primed DNA polymerase of bacteriophage phi29. Mol Cell 16, 609-618.
  10. 7ahl: Song, L., Hobaugh, M.R., Shustak, C., Cheley, S., Bayley, H., Gouaux, J.E. (1996) Structure of staphylococcal alpha-hemolysin, a heptameric transmembrane pore. Science 274, 1859-1866.

September 2021, Jennifer Jiang, Katherine H. Park, Kiranmayi Vemuri, David Goodsell

http://doi.org/10.2210/rcsb_pdb/mom_2021_9
About Molecule of the Month
The RCSB PDB Molecule of the Month by David S. Goodsell (The Scripps Research Institute and the RCSB PDB) presents short accounts on selected molecules from the Protein Data Bank. Each installment includes an introduction to the structure and function of the molecule, a discussion of the relevance of the molecule to human health and welfare, and suggestions for how visitors might view these structures and access further details.More