Promoters – DNA sequences that provide direction for RNA SEQUENCE OF EVENTS
polymerase
- Promoter region to which RNA pol binds is closer to the
3’ end of the template strand than the actual gene for the RNA to be synthesized - RNA formed from 5’3’ end - Polymerase moves along template strand from 3’5’ end - Coding strand = 5’3’ - Binding site upstream (5’ of coding strand; 3’ of the template strand)
Transcription Start Site (TSS) – first base incorporated into RNA
chain, at position +1
Pribnow Box – first promoter element, -10 region
Core promoter - -35 region/element
UP element – upstream of the core promoter, which enhances
the binding of RNA polymerase; usually extends from -40 to -60
Extended promoter – region from the end of the UP element to
the TSS
Consensus sequences – base sequence of promoter regions that
contain many bases in common CHAIN ELONGATION - Promoter regions are A-T rich, with 2 hydrogen - After the strands have separated, a transcription bubble bonds/base pair; more easily unwound than G-C rich of about 17 base pairs moves down the DNA sequence to regions, which have 3 hydrogen bonds/base pair be transcribed - Promoter base sequence controls the frequency with - RNA polymerase catalyzes formation of phosphodiester which the gene is transcribed bonds between the incorporated ribonucleotides - A strong promoter binds RNA pol tightly gene is - When about 10 nucleotides have been incorporated, the transcribed more often sigma subunit dissociates, later recycled to bind to CHAIN INITIATION – begins when RNA pol binds to the promoter another RNA polymerase core enzyme & forms a closed complex Transcription process supercoils DNA - The sigma-subunit directs the polymerase to the Negative supercoiling – upstream promoter bridges the -10 and -35 regions of the promoter to the RNA polymerase core via a flexible “flap” Positive supercoiling – downstream in the sigma-subunit Topoisomerases – relax the supercoils in front and behind the - Core enzymes lacking the sigma-subunit bind to areas of advancing transcription bubble DNA that lack promoters - Holoenzyme may bind to “promterless” DNA, but it - Rate of chain elongation is not constant RNA pol dissociates without transcribing moves quickly though some DNA regions - Chain initiation requires formation of the open complex - Abortive transcription – instead of finishing every RNA - Portion of the beta-prime and the sigma-subunits chain, RNA pol releases most chains near the beginning initiate strand separation, melting about 14 bp surround of the process (5-10 nucleotides) this is due to failure TSS of RNA pol to break its own bonds to the promoter via - Purine ribonucleoside triphosphate first base in RNA, the sigma subunit binds to its complementary DNA base at position +1 - For chain elongation to occur RNA pol must be able to - Adenine tends to occur more often than G launch itself off the promoter - First residue retains its 5’-triphosphate group - Tight binding between the sigma subunit and promoter 2. ENHANCERS – upstream activation sequences (UAS) requires substantial energy - Sequences bound by transcription factors; sequences - RNA polymerase bound tightly to the DNA promoter that assist in initiation it “scrunches” the DNA into itself causing torsional - Location not fixed, still functions as enhancers strain of the separated DNA strands - Usually upstream of the promoter - Polymerase does not bind to enhancers CHAIN TERMINATION – involves specific sequences downstream - Acts even if located downstream from start of of the actual gene for the RNA to be transcribed transcription TWO TYPES OF MECHANISMS - Bidirectional sequence may be reversed - Promiscuous enhance expression of any gene in its 1. Intrinsic termination – controlled by termination sites vicinity - Termination sites characterized by 2 inverted repeats sequences of bases complementary s/t they can UPSTREAM SITES OF GENES FOR rRNA PRODUCTION: loop back on themselves 1. Fis sites – binding sites for Fis protein - DNA encodes a series of uracils o Extend from the end of the UP element at -60 to -150 - When RNA is created, inverted repeats form a hairpin (examples of a class of DNA sequences called loop tends to stall the advancement of RNA pol enhancers) - Presence of Uracils causes series of A-U base pairs bet. o Enhance transcription of rRNA Template strand and RNA - A-U pairs weakly than G-C RNA dissociates from the 3. OPERONS – assemblage of operators, promoter, structural transcription bubble ending transcription genes encode enzymes for metabolic pathways 2. Rho-dependent - also cause hairpin loop - Genes not transcribed all the time - Rho protein binds to RNA & chases the polymerase - Inducer – triggers production induction - When pol transcribes RNA that forms hairpin loop, it - Operator – site of regulation stalls giving the rho protein a chance to catch up - Promoter – site of interaction w/ RNA pol - When rho protein reaches termination site Regulatory protein: inducer – turns on operon (induction) facilitates dissociation of pol-DNA complex Repressor – turn off (repression) - Movement of Rho + dissociation requires ATP LAC OPERON – controlled by negative regulation TRANSCRIPTION REGULATION IN PROKARYOTES Lactose – disaccharide, substrate of beta-galactosidase 1. SIGMA FACTORS – viruses & bacteria produce different - Will use lactor/galactose as substrate only in absence of sigma-subunits that direct the RNA pol to different genes glucose a. Phage SPO1 – virus that infects Bacillus subtilis - Beta-galactosidase – hydrolyzes the glycosidic linkage between Early genes – transcribed by the host’s RNA pol; using its galactose and glucose (monosaccharide components) regular sigma-subunit codes for gp28 another sigma- - Glucose preferred energy and carbon source subunit directs RNA pol to transcribe preferentially more - Enzymes that act on galactose are not turned on until cell is of the viral genes during middle phase glucose-starved Middle genes – gp33 and gp34 make up another sigma - Structural genes are not expressed factor directs transcription of late genes - Repressor protein is usually expressed - Note: sigma factors are recycled initially B. subtilis - Protein binds to operator site blocking RNA pol II from binding uses standard sigma factor as more & more gp28 is to promoter produced, it competes for binding with standard sigma - Operator site has palindromic DNA sequence for RNA pol eventually subverting transcription for - LacZ codes for beta-galactosidase virus instead of the bacterium - LacY lactose permease allows lactose to enter the cell b. Heat shock – normal sigma-subunit is called sigma^70 - lacA transacetylase inactivate certain antibiotics that may - When E. coli is grown at high temp than optimum enter the cell via lactose permease produce another set of proteins - lacI regulatory gene controls gene expression - Sigma^32 directs RNA pol to bind to diff. promoters responsible for the production of a protein (repressor) not normally recognized by sigma^70 - Repressor – inhibits the expression of structural genes - Presence of inducer inhibition is removed - Inducer – binds to repressor, convert it into a conformation - Genes that lack TATA box have Inr (initiator element) that’s unable to bind to operator site structural genes 2. Regulatory elements transcribed - Bind to transcription factors CATABOLITE ACTIVATOR PROTEIN (CAP) - Response to physiological signals (hormones) - cAMP receptor protein - Heat shock - overrides influence of any inducers - activate adenylyl cyclase PROMOTERS – complex - high levels of cAMP - cAMP induce CAP synthesis TYPES: CONTROL OF OPERONS 1. TATA box 1. positive control – genes expressed only if regulatory 2. CAAT box protein is present 3. GC box 2. negative control - Found in “housekeeping genes” CLASSIFICATION OF OPERONS 1. inducible ENHANCERS 2. repressible 1. Response element – promoter modules that are responsive to common regulation 4. TRANSCRIPTION ATTENUATION – control mechanism by a. Heat shock element which transcription is altered before it has begun b. Glucocorticoidal response c. Metal response element TRANSCRIPTION IN EUKARYOTES - More complicated due to presence of histones nucleosomes 2. Transcription factors for different response elements & chromosomes expression is repressed initiate transcription of proteins - HSTF bind to HSE Transcription co-regulators – enzymes that loosen DNA:histone INITIATION OF TRANSCRIPTION interactions - Formation of eukaryotic transcription initiation complex - ATP-dependent chromatin remodeling - Requires: Pol II, 5 GTFs, mediator protein (Srb Med) - Complexed change shape of chromosome - Process: THREE CLASSES 1. A TBP (TATA binding protein) bind to TATA 1. RNA Pol I – synthesis major rRA 2. TAF & TBP form complex that serves as bridge 2. RNA Pol II – synthesis mRNA 3. DNA is bent s/t DNA sequence is upstream & 3. RNA Pol III – synthesis tRNA, 5S rRNA, small RNAs downstream to TATA including those involved in mRNA processing 4. GTFs convene RNA Pol II 5. Transcription begins Yeast RNA Pol I homologous to human RNA Pol II - Eukaryotic transcription depends on: RNA Pol II – 12 different polypeptides; numbered RPB1 to RPB12, 1. Relief of repression d/t chromatin structure RPB3 2. Interaction of RNA Pol II with promoter RPB4 & RPB7 unique to RPII 1. HATs (Histone Acetyl Transferases) All others shared by RPI, RPII, RPIII - Acetylation of epsilon-NH3+ groups in lysine residues in histone tails RPB1 – C-terminal domain (CTD) - Removes (+) charge destabilizes histone-DNA 27 repeats of YSPTSPS interactions - Hydrophilic - Works w/ phosphorylation of serine residues - Contains many sites for phosphorylation (OH grp) - Introduces (-) charge destabilize histone-DNA - Essential to RNA Pol II action interaction HISTONE DEACETYLASE COMPLEXES TWO SEPARATE SEQUENCES (Eukaryotic Gene Regulation) - Restores (+) charge in histones restores histone-DNA 1. Core element interaction - Contains TATA box - Consensus sequence: TATAAAA - Usually -25 bp (upstream) CHROMATIN REMODELLING COMPLEXES - DNA-binding proteins w/ leucine zippers bind the - ATP-dependent conformational DNA in the major groove via the strong electrostatic - Essential features – alter nucleosomes interactions between the basic region & sugar - Interaction of RNA Pol II phosphates ROLE OF MEDIATOR PROTEINS - Protein dimers form & the leucine half interacts with ????? the other subunit, while the basic part interacts with the DNA DNA recognition by gene regulatory proteins Principle: structural & chemical complementarities TRANSCRIPTION-ACTIVATION DOMAINS - H-bonding, DNA sugar-phosphate backbone Motifs whereby transcription factors recognize other proteins - Protein – contact with major groove 1. Acidic domains – regions rich in acidic AA 3 MOTIFS IN DNA-BINDING PROTEINS Gal4 transcription factor in yeast that activates genes 1. HTH (Helix-turn-helix) – dimensions of alpha-helix for metabolizing galactose - Segment of alpha-helix that fits into major groove - Domain of 49 AAs, 11 of which are acidic - (2) alpha-helices separated by beta-turn 2. Glutamine-rich domains - seen in transcription factors - Additional interaxn: methylation of nucleobases Sp1 upstream transcription factor that activates - Width of major groove & alpha-helix is the same transcription in the presence of an additional promoter - Most common motif element (GC box) - Sequence of 20 AA that’s relatively conserved in many - It has 2 glutamine-rich domains, one of which contains different DNA-binding proteins 39 glutamines in 143 AAs - 1st helical region first 8 residues of the region - CREB & CREM have this domain - 2nd helical region seq. of 3 or 4 AA 3. Proline-rich domain – seen in activator CTF-1 - Position 9 glycine involved in beta-turn - Domain of 84 AA, 19 of which are prolines 2. Zn-finger - motif gets its name from the shape adopted - CTF-1 member of a class of transcription factors that by the 12 AA that are looped out from the intersection of bind to an extended promoter element (CCAAT box) the 2 cysteines and 2 histidines with Zinc ion - N-terminal domain regulate transcription of genes - When TFIIIA binds to DNA, the repeated zinc fingers - C-terminal end transcription regulator; bind to follow the major groove around the DNA histone proteins via proline repeats TWO CLASSES a. C2H2 – 2 cysteins; 2 histidines INDIRECT READOUT b. Cx - DNA sequence produces a unique contour of dsDNA that 3. Leucine zipper base region (Basic-region Leucine zipper proteins can recognize motif) - Proteins usually interact - Third major class of sequence-dependent DNA- - From the major groove binding protein - With part of nucleobases not involved in H-bonding - Periodic repetition of Leucine residues w/in helical regions (every 7 residues) POST-TRANSCRIPTIONAL RNA MOD - Dimerize to form y-shaped molecules 1. tRNA & rRNA - Stem – zipper of leucine residues Trimming and addition of terminal sequences – produce - Arm – contain basic regions A & B tRNAs with proper size & base sequence - Half of the protein is composed of the basic region, - tRNA contains a CCA sequence at 3’ end presence of with many conserved residues of lysine, arginine & this portion is important d/t 3’ end is the acceptor for AA histidine to be added to growin protein chain - Second half contains series of leucines every 7 - Trimming nucleus residues - Methylation cytosol - Takes 3.6 AA to make a turn of an alpha-helix - Processing of rRNA methylation & trimming to proper - With a seven-residue spacing, the leucines all line up size on one side of an alpha-helix - base modification Methylation & substitution of sulfur - Zipper d/t line of hydrophobic residues interacts for oxygen with a second analogous protein fragment via - 3 rRNAs sedimentation coefficient of 70S hydrophobic bonds, interweaving like a zipper - Smaller subunit SC of 30S, one RNA mol sc of 16S - 50S subunit contains 2 kinds of RNA sedimentation Ribonucleoprotein particles (RNPs) coefficient of 5S & 23S - Interact with RNA as it’s formed, keeping it in a form that can - Eukaryotic ribosomes sedimentation coefficient of be accessed by other proteins and enzymes 80S, with 40S & 60S subunits - Substrate for splicing capped, poly-adenylated pre-mRNA - 40S subunit contains an 18S RNA & the 60S subunit - Splicing requires cleavage at 5’ and 3’ end of introns & joining contains a 5S RNA, a 5.8S RNA & 28S RNA of the two ends 2. Messenger RNA - Splice sites - sequence with GU at 5’ end, AG at 3’ end - Extensive processing happens in eukaryotic mRNA - Branch sites – within the intron; conserved sequence - Mods include: - Found 18 to 40 bases upstream from the 3’ splice site a. capping of 5’ end - PyNPyPuAPy (Py – pyrimidine; Pu – purine, N – any b. polyadenylating (adding a poly-A sequence) to the 3’ nucleotide, A is invariant) end - G always present on 5’ end of intron lopps back in c. splicing of coding sequences close contact with the invariant A from the branch point - processing is not a feature of prokaryotic mRNA synthesis - 2’ OH of the A performs nucleophilic attack on phosphodiester backbond at 5’ splice site forming POST TRANSCRIPTIONAL MOD OF Mrna lariat structure & releasing exon 1 - the cap at the 5’ end of eukaryotic mRNA is guanylate - AG at 3’ end of exon, does the same to G at 3’ splice site residue methylated at N-7 position fusing two exons - methylated guanylate residue attached to Small nuclar ribonucleoproteins (snRNPs) neighboring residue by 5’-5’ triphosphate linkage - Splicing depends on snRNPs to mediate process - 2’-hydroxyl group of the ribosyl portion of neighboring - Basic type of RNA contain both RNA & proteins residue frequently methylated - RNA portion between 100-200 nucleotides + 10 or - Polyadenylate tail (poly-A) at 3’ end of a message (100- more proteins 200 nucleotides long) added before the mRNA leaves - One of the most abundant gene products the nucleus - Enriched in Uridine residues - Poly-A tail protects the mRNA from nucleases & - Have internal consensus sequence of AUUUUUG phosphatases which degrades it - Bind to RNAs being spliced via complementary regions - Adenylate residues are cleaved off before the portion of between the snRNP & branch and splice sites the molecule that contains the message is attacked - Certain snRNPs stimulate transcription elongation - Presence of 5’ cap protects the mRNA from - Some RNAs catalyze self-splicing exonuclease degradation Spliceosomes – 50S to 60S particle involved in splicing - Prokaryotic genes continuous - Eukaryotic genes discontinuous d/t intervening CAPPING & METHYLATION SPLICEOSOME sequences that do not appear in the final base sequence of the mRNA for the gene product ALTERNATIVE RNA SPLICING 1. Constitutive splicing – every SPLIT GENES – introns and exons 2. Alternative (combinational) splicing – gene expression 1. INTRON –noncoding; intervening sequences that are not - Produce different protein isoforms expressed Tau protein accumulates in the brain of Alzheimer’s - Removed, linking the exons together patients - 3’ end is modified by adding a poly-A tail & 7-mG cap - Has 6 isoforms generated by different splicing forms this yields a matura mRNA appearing during specific devt stages - Actin (muscle protein) 1 intron Troponin T gene produces muscle protein with isoforms - Alpha & beta chains of Hb two introns for both - Lysozyme 3 introns 2. EXON – DNA sequences that are expressed or the one retained in final mRNA product POST TRANSCRIPTIONAL PROCESSING Removal of intervening sequences takes places in the nucleus RNA forms ribonucleoprotein particles (RNPs) through association with a set of nuclear proteins RIBOZYMES – catalytic RNA catalyzes own self-splicing 1. Group I – requirement for external guanosine becomes covalently bonded to the splice site in excision - Transesterification (of phosphoric acid esters) releases one end of the intron - Free 3’-OH end of the exon attacks the 5’ end of the other exon splicing the two exons & releasing the intron - Free 3’-OH end of the intron then attacks a nucleotide 15 residues from 5’ end cyclizing the intron & releasing a 5’ terminal sequence - Precision of reaction depend on folded conformation of RNA which remains internally H bonded - Catalytic RNA can act many times regenerated
2. Group II – display lariat mechanism; no req’t for external
nucleotide; the 2’-OH of internal adenosine attacks the phosphate at 5’ splice site - DNA can’t self-splice because it doesn’t have a 2’-OH - Folding of RNA crucial to catalytic activity, as is with protein catalysts - Divalent cation (Mg2+, Mg2+) is required metal ions stabilize the folded structure by neutralizing negative charges on the phosphate groups of the RNA SELF-SPLICING OF pre-RNA of ciliate protest Tetrahymena 1. A guanine nucleotide attacks at the splice site of the exon giving a free 3’-OH end 2. Free 3’-OH end of the exon attacks the 5’ end of the exon on the right splicing the two exons & releasing the intron 3. Free 3’-OH end of the intron then attacks a nucleoside 15 residues from the 5’ end cyclizing the intron & releasing a 5’ terminal sequence TRANSLATION – CODON-ANTICODON PAIRING AND WOBBLE 1. Amino acid activation – aminoacyl-tRNA synthetase - Codon forms bp with complementary anticodon of tRNA AA + tRNA aminoacyl-tRNA when an AA is incorporated during protein synthesis - AA covalently bonded to tRNA - Some tRNAs bond to one codon exclusively, but many of 2. Chain Initiation – first aminoacyl-tRNA is bound to them can recognize more than one codon d/t variations mRNA at the site that encodes the start of (wobble) in the allowed pattern of hydrogen bonding polypeptide synthesis - Wobble applies to first base of anticodon Ribosome + mRNA + 1st aminoacyl-tRNA Possibilities of codon-anticodon bp mRNA and ribosome bound to each other 1. Highly specific? All 61 tRNAs must be present 3. Chain Elongation – peptide bond formed between 2. Not strictly followed? amino acids; repeats itself until polypeptide chain is -A poly(U) sequence binds all Phe-tRNAPhe including UUC formed (specific to Phe) 2nd aminoacyl tRNA brought to ribosome peptide - Yeast tRNAAla bind GCU, GCC, GCA bond formed Therefore: Variations in pairing for 3rd bases are allowed 4. Chain Termination – THE GENETIC CODE Trp & Met only one codon tRNA – carry AA to site of protein synthesis - Single AA can have as many as six codons, as is the case mRNA – contain information of AA sequence with leucine & arginine tRNA – interacts with mRNA through anticodon loop - Codons sharing the same first letter often code for AA a. Triplet – sequence of three bases (codon) that are products of one another or precursors of one b. Nonoverlapping – no bases shared between consecutive another codons; ribosome moves along the mRNA three bases at CODON USAGE and RARE Trna a time - Some codons are used more than others (i.e. Leu, Pro, c. Commaless – no intervening bases exist between codons Ala, Lys, Glu) d. Degenerate – more than one triplet can encode the same - Use of rare tRNAs result to delay in translation amino acid e. Unambiguous – only one AA for each codon “Wobble” Hypothesis third base; buffer against deleterious f. Universal code - mutations DETERMINATON OF GENETIC CODE - 1st two bases pair with canonical partners 1. Synthetic mRNA + NTPs polypeptide - 3rd base “Wobble interaction” UUUUUU polyphenylalanin Therefore: Not all tRNAs have to be present UUU Phenylalanine - Any mutation in the third base would not change the AAAAA Polylysine amino acid AAA Lysine; CCC Proline; GGG glycine - Silent mutation mutation that doesn’t change 2. Filter-binding assay – various tRNA molecules, one of - When the wobble base of the anticodon is uracil can which is labeled with Carbon-14 are mixed with base pair not only with adenine, but also with guanine ribosomes and synthetic trinucleotides that are bound to - When wobble base is guanine can bp with cytosine & a filter also with uracil - Mixture of tRNAs is passed through a filter, and some - Purine base hypoxanthine frequently occurs in the bind and others pass through wobble position in many tRNAs can bp with ACU in the - If radioactive label is detected on the filter, then it is codon known that the particular tRNA did bind - A and C do not form any bp other than the expected ones - If radioactive label is found in a solution that flowed with U and G, respectively through the filter, then the tRNA did not bind - So, when the wobble position is occupied by I (from - This depends on the fact that aminoacyl-tRNAs bind inosine, nucleoside made up of ribose & hypoxanthine), strongly to ribosomes in the presence of the correct G, or U, variations in H bonding are allowed trinucleotide - Trinucleotide plays to role of an mRNA codon - EX: if the aminoacyl-tRNA for histidine binds to the ribosome in the presence of the trinucleotide CAU, the sequence CAU is established as a codon for His CONSEQUENCES OF WOBBLE BASE PAIRING SELECTIVITY AT TWO LEVELS 1. Fewer tRNAs are needed at a given time – other tRNAs in 1. Aminoacyl-AMP remains bound to the enzyme same family may substitute for missing tRNA EX: isoleucyl-tRNA synthetase can form an aminoacyl- 2. Misreading a base may not affect the AA incorporation in AMP of isoleucine or structurally similar Valine the protein - If the valyl moiety is transferred to tRNA for isoleucine, it 3. Mutations may or may not alter the protein is detected by an editing site in tRNA synthetase hydrolyze the incorrectly acylated aminoacyl-tRNA ROLE OF AMINOACYL-tRNA SYNTHETASES – need for activation - Selectivity resides in tRNA, not AA of AA and aminoacyl-tRNA 2. Depends on the selective recognition of tRNAs by 1. AA forms covalent bond to an adenine nucleotide aminoacyl-tRNA synthetases producing an aminoacyl-AMP - Specific binding sites on tRNA are recognized by 2. Free energy of ATP hydrolysis provides energy for aminoacyl-tRNA synthetases bond formation - Exact position of the recognition site varies with different 3. Aminoacyl moiety is then transferred to tRNA forming synthetases an aminoacyl-tRNA PROKARYOTIC TRANSLATION Aminoacyl-AMP mixed anhydride of a carboxylic acid + Ribosomal Architecture phosphoric acid - Protein synthesis requires the specific binding of mRNA - Anhydrides reactive compounds free energy and aminoacyl-tRNAs to the ribosome change for the hydrolysis of aminoacyl-AMP favors the - Ribosomes have specific arki that facilitates the binding 2nd step of overall rxn - 30S & 50S – small & large subunits - Another point that favors the process energy released CHAIN INITIATION when pyrophosphate (PPi) is hydrolyzed to - Synthesis of polypeptide chain starts at N-terminal orthophosphate (Pi) to replenish phosphate pool - Chain growth N-terminal to C-terminal - Coding strand & mRNA read from 5’3’ AMINO ACID ACTIVATION - 1. AA + ATP aminoacyl-AMP + PPi *anhydride bond bet. Carboxylate + phosphate
2. Aminoacyl-AMP + tRNA aminoacyl-tRNA* + AMP
*ester bond bet. Carboxylate + 3’OH of ribose NET: AA + ATP + tRNA aminoacyl-tRNA + AMP + PPi
TWO CLASSES OF AMINOACYL-tRNA SYNTHETASES
1. Class I – AA is added to 2’OH first before transfers to 3’OH 2. Class II – uses 3’ OH Synthetase requires Mg2+ and is highly specific both for the AA and for tRNA
SECOND GENETIC CODE – two-stage rxn allows for selectivity to