Restriction–Modification Systems with Specificity GGATC, GATGC and GATGG. Part 2. Functionality and Structural Issues

Cover Page

Cite item

Full Text

Open Access Open Access
Restricted Access Access granted
Restricted Access Subscription Access

Abstract

The structural and functional issues of protein functionality of restriction-modification systems recognizing one of the GGATC/GATCC, GATGC/GCATC, and GATGG/CCATC sites have been studied using bioinformatics methods. Such systems include a single restriction endonuclease and either two separate DNA methyltransferases or a single fusion DNA methyltransferase with two catalytic domains. For a subset of these systems, it was known that both adenines within the site are methylated to form 6-methyladenine, but the role of each of the two DNA methyltransferases comprising the system was unknown. In this work, the functionality of most known systems of this kind is proven. Based on analysis of the structures of related DNA methyltransferases, it is hypothesized which of the adenines within the site is modified by each of the DNA methyltransferases of the system. A possible molecular mechanism of DNA methyltransferase specificity change from GATGG to GATGC during horizontal transfer of its gene is described.

Full Text

Restricted Access

About the authors

S. A. Spirin

Lomonosov Moscow State University; Higher School of Economics; Scientific Research Institute of System Development

Author for correspondence.
Email: sas@belozersky.msu.ru
Russian Federation, Moscow; Moscow; Moscow

A. V. Grishin

Gamaleya National Research Center for Epidemiology and Microbiology; All-Russia Research Institute of Agricultural Biotechnology

Email: sas@belozersky.msu.ru
Russian Federation, Moscow; Moscow

I. S. Rusinov

Lomonosov Moscow State University

Email: sas@belozersky.msu.ru
Russian Federation, Moscow

A. V. Alexeevsky

Lomonosov Moscow State University; Scientific Research Institute of System Development

Email: sas@belozersky.msu.ru
Russian Federation, Moscow; Moscow

A. S. Karyagina

Lomonosov Moscow State University; Gamaleya National Research Center for Epidemiology and Microbiology; All-Russia Research Institute of Agricultural Biotechnology

Email: sas@belozersky.msu.ru
Russian Federation, Moscow; Moscow; Moscow

References

  1. Williams, R. J. (2003) Restriction endonucleases: classification, properties, and applications, Mol. Biotechnol., 23, 225-244, https://doi.org/10.1385/mb:23:3:225.
  2. Roberts, R. J. (2003) A nomenclature for restriction enzymes, DNA methyltransferases, homing endonucleases and their genes, Nucleic Acids Res., 31, 1805-1812, https://doi.org/10.1093/nar/gkg274.
  3. Madhusoodanan, U. K., and Rao, D. N. (2010) Diversity of DNA methyltransferases that recognize asymmetric target sequences, Crit. Rev. Biochem. Mol. Biol., 45, 125-145, https://doi.org/10.3109/10409231003628007.
  4. Vasu, K., and Nagaraja, V. (2013) Diverse functions of restriction-modification systems in addition to cellular defense, Microbiol. Mol. Biol. Rev., 77, 53-72, https://doi.org/10.1128/mmbr.00044-12.
  5. Mistry, J., Chuguransky, S., Williams, L., Qureshi, M., Salazar, G. A., Sonnhammer, E. L. L., Tosatto, S. C. E., Paladin, L., Raj, S., Richardson, L. J., Finn, R. D., and Bateman, A. (2020) Pfam: the protein families database in 2021, Nucleic Acids Res., 49, D412-D419, https://doi.org/10.1093/nar/gkaa913.
  6. Roberts, R. J., Vincze, T., Posfai, J., and Macelis, D. (2014) REBASE – a database for DNA restriction and modification: enzymes, genes and genomes, Nucleic Acids Res., 43, D298-D299, https://doi.org/10.1093/nar/gku1046.
  7. Edgar, R. C. (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput, Nucleic Acids Res., 32, 1792-1797, https://doi.org/10.1093/nar/gkh340.
  8. Waterhouse, A. M., Procter, J. B., Martin, D. M. A, Clamp, M., and Barton, G. J. (2009) Jalview Version 2 – a multiple sequence alignment editor and analysis workbench, Bioinformatics, 25, 1189-1191, https://doi.org/10.1093/bioinformatics/btp033.
  9. Lefort, V., Desper, R., and Gascuel, O. (2015) FastME 2.0: A comprehensive, accurate, and fast distance-based phylogeny inference program, Mol. Biol. Evol., 32, 2798-2800, https://doi.org/10.1093/molbev/msv150.
  10. Kumar, S., Stecher, G., and Tamura, K. (2016) MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets, Mol. Biol. Evol., 33, 1870-1874, https://doi.org/10.1093/molbev/msw054.
  11. Letunic, I., and Bork, P. (2021) Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation, Nucleic Acids Res., 49, W293-W296, https://doi.org/10.1093/nar/gkab301.
  12. Li, W., and Godzik, A. (2006) Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences, Bioinformatics, 22, 1658-1659, https://doi.org/10.1093/bioinformatics/btl158.
  13. Mirdita, M., Schütze, K., Moriwaki, Y., Heo, L., Ovchinnikov, S., and Steinegger, M. (2022) ColabFold: making protein folding accessible to all, Nat. Methods, 19, 679-682, https://doi.org/10.1038/s41592-022-01488-1.
  14. Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., et al. (2021) Highly accurate protein structure prediction with AlphaFold, Nature, 596, 583-589, https://doi.org/10.1038/s41586-021-03819-2.
  15. DeLano, W. L. (2002) Pymol: An open-source molecular graphics tool, CCP4 Newsl. Protein Crystallogr., 40, 82-92.
  16. Crooks, G. E., Hon, G., Chandonia, J. M., and Brenner, S. E. (2004) WebLogo: A sequence logo generator, Genome Res., 14, 1188-1190, https://doi.org/10.1101/gr.849004.
  17. Gingeras, T. R., Milazzo, J. P., and Roberts, R. J. (1978). A computer assisted method for the determination of restriction enzyme recognition sites, Nucleic Acids Res., 5, 4105-4127, https://doi.org/10.1093/nar/5.11.4105.
  18. Higgins, L. S., Besnier, C., and Kong, H. (2001) The nicking endonuclease N.BstNBI is closely related to type IIS restriction endonucleases MlyI and PleI, Nucleic Acids Res., 29, 2492-2501, https://doi.org/10.1093/nar/29.12.2492.
  19. Kachalova, G. S., Rogulin, E. A., Yunusova, A. K., Artyukh, R. I., Perevyazova, T. A., Matvienko, N. I., Zheleznaya, L. A., and Bartunik, H. D. (2008) Structural analysis of the heterodimeric type IIS restriction endonuclease R.BspD6I acting as a complex between a monomeric site-specific nickase and a catalytic subunit, J. Mol. Biol., 384, 489-502, https://doi.org/10.1016/j.jmb.2008.09.033.
  20. Malone, T., Blumenthal, R. M., and Cheng, X. (1995) Structure-guided analysis reveals nine sequence motifs conserved among DNA amino-methyltransferases, and suggests a catalytic mechanism for these enzymes, J. Mol. Biol., 253, 618-632, https://doi.org/10.1006/jmbi.1995.0577.
  21. Yang, Z., Horton, J. R., Zhou, L., Zhang, X. J., Dong, A., Zhang, X., Schlagman, S. L., Kossykh, V., Hattman, S., and Cheng, X. (2003) Structure of the bacteriophage T4 DNA adenine methyltransferase, Nat. Struct. Biol., 10, 849-855, https://doi.org/10.1038/nsb973.
  22. Horton, J. R., Liebert, K., Hattman, S., Jeltsch, A., and Cheng, X. (2005) Transition from nonspecific to specific DNA interactions along the substrate-recognition pathway of dam methyltransferase, Cell, 121, 349-361, https://doi.org/10.1016/j.cell.2005.02.021.
  23. Horton, J. R., Liebert, K., Bekes, M., Jeltsch, A., and Cheng, X. (2006) Structure and substrate recognition of the Escherichia coli DNA adenine methyltransferase, J. Mol. Biol., 358, 559-570, https://doi.org/10.1016/j.jmb.2006.02.028.
  24. Nell, S., Estibariz, I., Krebes, J., Bunk, B., Graham, D. Y., Overmann, J., Song, Y., Spröer, C., Yang, I., Wex, T., Korlach, J., Malfertheiner, P., and Suerbaum, S. (2018) Genome and methylome variation in Helicobacter pylori with a cag pathogenicity island during early stages of human infection, Gastroenterology, 154, 612-623, https://doi.org/10.1053/j.gastro.2017.10.014.
  25. Friedrich, T., Fatemi, M., Gowhar, H., Leismann, O., and Jeltsch, A. (2000) Specificity of DNA binding and methylation by the M.FokI DNA methyltransferase, Biochim. Biophys. Acta, 1480, 145-159, https://doi.org/10.1016/s0167-4838(00)00065-0.
  26. Tomilova, J. E., Kuznetsov, V. V., Abdurashitov, M. A., Netesova, N. A., and Degtyarev, S. K. (2010) Recombinant DNA-methyltransferase M1.Bst19I from Bacillus stearothermophilus 19: purification, properties, and amino acid sequence analysis, Mol. Biol., 44, 606-615, https://doi.org/10.1134/S0026893310040163.

Supplementary files

Supplementary Files
Action
1. JATS XML
2. Appendix
Download (1MB)
3. Fig. 1. Plot of the ER sequence alignment of our class together with the sequence of the Nt.BstNBI nickase. Red stars indicate residues E418, D456, E469, E482 of the Nt.BstNBI nickase, the role of which in catalytic activity was confirmed experimentally, yellow star - residue H489, presumably involved in catalysis

Download (447KB)
4. Fig. 2. Tree of 12 MTases with GGATC, GATGC and GATGG specificities together with the MTases M.EcoKDam, M.EcoT4Dam and M1.DpnII, for which spatial structures are known, and the well-studied M.EcoRV, M.FokI and M1.Bst19I. For the two-domain MTases, phylogeny was reconstructed separately for the N-terminal and C-terminal domains. The colours of the names correspond to specificity: red - GGATC, green - GATGG, blue - GATGC. Numbers on branches indicate bootstrap support; branches with less than 25% support have been removed (‘collapsed’)

Download (246KB)
5. Fig. 3. Schematic of gene structure and recognisable sequences of MTase representatives from systems recognising GGATG, GATGG, GATGC and GGATC sites. The figure uses images of red and yellow flags taken from the REBASE website, which mark the positions of the conserved F-x-G-x-G and D-P-P-Y motifs; a blue asterisk at the flag indicates that the motif is imprecise (F-x-G-x-A and D-T-P-Y, respectively). Two hypothetical methylation variants are shown for each GGATC site, with the pink background highlighting the more likely variant

Download (293KB)
6. Fig. 4. Models of the structures of complexes of group B MTases and similar complexes with DNA. Blue indicates the catalytic domain of MTase, green - TRD, red - a loop potentially involved in the recognition of the right nucleotide pairs of the recognised site in the figure, magenta - a site presumably involved in the recognition of the left nucleotide pairs. Four nucleotides corresponding to the nucleotides of the GATC recognition site of M.EcoKDam are highlighted in pale pink on the DNA, and the nucleotide pairs framing them are highlighted in light lilac and pink. The nucleotide designations of the recognition site and ends refer to the chain containing adenine methylated by MTase B

Download (568KB)
7. Fig. 5. LOGO-diagrams for the alignments of MTase loops of group B, highlighted in red in Fig. 4. a - Diagram for MTase loops with GGATC specificity homologous to the loop indicated in red on the M.AlwI-N structure (M.AlwI residues 199-218, sequence VPISEYSDFKRYTKEQFYLE). b - Diagram for MTase loops with GGATG specificity homologous to the loop indicated in red on the M.FokI-C structure (M.FokI residues 552-571, sequence LITTGSYNDGNRGNRGFKDWNRL). The recognition site of MTases of the corresponding family is shown below each LOGO diagram, the nucleotide pair presumably recognised by the corresponding loop is shown in bold, with the arrowed G probably bound by a conserved arginine

Download (190KB)
8. Fig. 6. LOGO-diagrams for the alignments of MTase loops of group B, highlighted in magenta in Fig. 4. a - Diagram for MTase loops with GATGC specificity (blue) homologous to the one marked in magenta on the M.SfaNI-C structure (M.SfaNI residues 490-509, sequence LSNSKMYGYNYNYYYKTSSAKGL). b - Diagram for loops of length 23 of MTase with GATGG specificity (green) homologous to that indicated in magenta on the M2.McaCI structure (residues 111-133, sequence in M2.McaCI LSCSYLSITVPDELKKKYVKYVKTYY). c - Same as b but for loops of length 24. The recognition site of MTases of the corresponding family is given under each LOGO diagram, with the nucleotide pair presumably recognised by the corresponding loop in bold

Download (240KB)
9. Fig. 7. Phylogenetic trees of representatives of group A MTases (a), group B MTases (b), and ERs (c) with GATGC (blue) and GATGG (green) specificity, and a model of the spatial structure of group A MTase M1.Hmu12714II (d). The GC pair corresponding to the 3′-terminal cytosine in the GATGC sequence is highlighted in red on the DNA model. The protein rod model shows S-adenosyl-L-homocysteine and side chains of the N-x-R-S-N motif, the catalytic domain of the MTase is indicated in blue, and TRD is shown in green

Download (291KB)

Copyright (c) 2025 Russian Academy of Sciences