We study the molecular mechanisms that define the extent of transcription units generated by RNA polymerase II (Pol II) across mammalian genomes. Especially how do protein coding transcripts differ from long noncoding transcripts in their mode of synthesis and coupled RNA processing?
Quick links
Defining RNA Polymerase II Transcription Units across Mammalian Genomes.
As shown in the diagram, protein coding transcripts (in red) are separated into exonic and intronic sequence and subjected to a variety of co-transcriptional RNA processing reactions to generate translatable mRNA; 5’ end capping, intron removal coupled to exon ligation (splicing) and 3’ end cleavage and polyadenylation (CPA). Added to these well-established steps in mRNA synthesis, several additional co-transcriptional RNA processing mechanisms operate, as actively investigated by our research group. First, a fraction of Pol II transcript prematurely terminates at either cryptic polyA signals (PAS) or through transcript cleavage mediated by the Integrator complex (I). Second hairpin structures may be excised from introns by the microprocessor complex (M). This releases pre-microRNA that are subsequently converted into microRNA by cytoplasmic Dicer. Third CPA cleavage at the gene 3’ end not only promotes release of polyadenylated mRNA, but also exposes nascent transcript to 5’->3’ exonuclease activity by Xrn2 (X). This ultimately forces termination of Pol II from the gene template by the torpedo mechanism.
As shown in the diagram, long noncoding transcripts (lncRNA, in green) are also synthesized by Pol II but may be formed and processed in a distinct manner to protein coding transcripts. Many lncRNA derive from R-loop structures usually detected near the ends of protein coding transcripts. These RNA:DNA hybrids force the non-template DNA strand out of the DNA helix. The single stranded DNA so formed can act as a template for de novo antisense transcription by Pol II. Such R-loop promoter activity may explain the origin of many lncRNA, especially over protein coding gene promoters (promoter antisense or PROMPT lncRNA) and terminators (gene antisense lncRNA). Furthermore, transcriptional enhancer elements may similarly generate R-loop dependent lncRNA (eRNA). Some lncRNA are formed independently of protein coding genes in intergenic regions. These are referred to as long intergenic noncoding RNA (lincRNA). All classes of lncRNA are subject to coupled RNA processing and degradation. However, unlike protein coding transcripts they are usually only weakly spliced and polyadenylated and often depend on Integrator to end and restrict their transcription.
All aspects of these coupled transcription and RNA processing mechanisms are under active investigation by our research team and are funded by a five year Investigator Award from the Wellcome Trust running until end of 2026.
2023
Elongation roadblocks mediated by dCas9 across human genes modulate transcription and nascent RNA processing.
Zukher, I., Dujardin, G., Sousa-Luis, R., Proudfoot, N.J.
Nature Structure and Molecular Biology – In press
2022
Mechanism of lncRNA biogenesis as revealed by nascent transcriptomics.
Nojima, T. and Proudfoot, N.J.
Nature Reviews in Molecular Cell Biology – 23: 389-406
2021
POINT technology illuminates the processing of polymerase associated intact nascent transcripts.
Sousa-Luis, R., Dujardin, G., Zukher, I., Kimura, H., Weldon, C., Carmo-Fonseca, M. and Proudfoot, N.J., Nojima, T.
Molecular Cell – 81: 1935-1950.
2019
R-Loops Promote Antisense Transcription across the Mammalian Genome.
Tan-Wong, S.M., Dhir, S. and Proudfoot, N.J.
Molecular Cell – 76(4): 600-616
2019
Selective Roles of Vertebrate PCF11 in Premature and Full-Length Transcript Termination.
Kamieniarz-Gdula, K., Gdula, M.R., Panser, K., Nojima, T., Monks, J., Wiśniewski, J.R., Riepsaame, J., Brockdorff, N., Pauli, A. and Proudfoot, N.J.
Molecular Cell – 74(1): 158-172
2017
Distinctive Patterns of Transcription and RNA Processing for Human lincRNAs.
Schlackow, M., Nojima, T., Gomes, T., Dhir, A., Carmo-Fonseca, M. and Proudfoot, N.J.
Molecular Cell – 65(1): 25-38
2016
Transcriptional termination in mammals: Stopping the RNA polymerase II juggernaut.
Proudfoot, N.J.
Science – 52(6291): aad9926.
2015
Mammalian NET-seq reveals genome-wide nascent transcription coupled to RNA processing.
Nojima, T., Gomes, T., Grosso, A.R.F., Kimura, H., Dye, M.J., Dhir, S., Carmo-Fonseca, M.* and Proudfoot, N.J.*
Cell – 161(3): 526-540.
dCas9 roadblocks paving new ways for CRISPRi use
February 2024
In a study published in Nature Structural & Molecular Biology, the Proudfoot lab demonstrates how the use of dCas9 might affect transcriptional elongation and RNA processing, and how it can be harnessed to manipulate Pol II progression along the gene.
Improved therapy for spinal muscular atrophy
June 2022
Published in Cell, a collaboration between the groups of Alberto Kornblihtt (Universidad de Buenos Aires, Argentina), Adrian Krainer (Cold Spring Harbor Laboratory, USA) and Nick Proudfoot (Dunn School) reports an improved treatment for spinal muscular atrophy. Spinal muscular atrophy (SMA) is a genetic disease of the central nervous system, causing muscle weakness and wasting. Affecting...