PETER PAZMANY SEMMELWEIS UNIVERSITYCATHOLIC UNIVERSITY Development of Complex Curricula for Molecular Bionics and Infobionics Programs within a consortial* framework** Consortium leader PETER PAZMANY CATHOLIC UNIVERSITY Consortium members SEMMELWEIS UNIVERSITY, DIALOG CAMPUS PUBLISHER The Project has been realised with the support of the European Union and has been co-financed by the European Social Fund *** **Molekuláris bionika és Infobionika Szakok tananyagának komplex fejlesztése konzorciumi keretben * * * A p ro j e kt a z Eu ró p a i U n i ó t á m o g a t á sá v a l , a z E u r ó p a i S z o c i á l i s A l a p t á rsf i n a n sz í ro z á sá v a l v a l ó su l m e g . INTRODUCTION TO BIOPHYSICS (Bevezetés a biofizikába) STRUCTURE OF PROTEINS (A fehérjék szerkezete) GYÖRFFY DÁNIEL, ZÁVODSZKY PÉTER Introduction to biophysics: Structure of proteins Introduction • Proteins are linear polymers of amino acids • Four main levels of the structure of proteinsare discerned – Sequence of amino acids – Local conformational elements – Global spatial arrangement of the whole protein – Subunit structure of proteins consisting of two ormore chains • Several secondary structural elements canform motives • Several motives can form domains determine the structure and drive the foldingof proteins • An entropic force, called hydrophobic effect isthe main contributor to the stability of proteins • The structure of proteins can be described andstudied by means of statistical physics • The native state of proteins is usually the statewith the lowest free energy • The folding of proteins is often a cooperative process • A funnel-shaped free energy landscapedescribes the folding and association ofproteins • Dynamic properties of proteins are importantfor the function • Amino acids are compounds containing both acarboxyl and an amino group • Amino acids forming proteins are amino acids in which the amino group is connected to the .-carbon • Only twenty of the .-amino acids take part inbuilding up protein chains • Protein forming amino acids can be groupedbased on their chemical properties such as hydrophobicity or acid-base properties • Different kinds of amino acids have different side chains while their backbone atoms are the same • A general amino acid can be seen in thefollowing figure • The .-amino group is coloured red and the .­carboxyl group is coloured blue • R represents the side chain General forms of amino acids Chirality of amino acids • Amino acids -except glycine -have asymmetriccarbon atoms • Due to asymmetric .-carbon atoms, opticalisomerism occurs, so two isomers of such amino acids exist which are mirror images of each other(this phenomenon is called chirality) • In proteins, only the L-conformers of amino acids are found (Figure 2.) • Amino acid derivatives forming the protein chains are often referred to as 'residues' L and D-amino acids Introduction to biophysics: Structure of proteins Electrical properties of amino acids • In some pH range, amino acids can beelectrically neutral but they have dissociablegroups such as carboxyl group or amino group • At low pH, the carboxyl group is protonatedwhile at high pH, the H+ ion dissociates from it • At high pH, the amino group is deprotonatedwhile at small pH, it carries one more H+ ion • There exists a pH range where both the .­carboxyl and the .-amino groups are chargedwhich is called a zwitter-ion (Figure 3.) Zwitter-ion • Several amino acids have multiple dissociable groups (see the following table) Dissociable groups Introduction to biophysics: Structure of proteins Groups of proteinogenic amino acids • Amino acids can be clustered into several groups based on their chemical properties – Hydrophophobic, aliphatic – Aromatic – Uncharged polar – Positively charged – Negatively charged Hydrophobic, aliphatic amino acids Aromatic amino acids Uncharged polar amino acids Positively charged amino acids Negatively charged amino acids 20 Peptide bond • Amino acids are connected to each other bythe peptide bond • Peptide bond is an ester-bond between the .­carboxyl group of an amino acid and the .­amino group of another one • The peptide bond is approximately planar dueto delocalization of electrons between singleand double covalent bonds (see the followingfigure) – The bond between C. and amino-N by the . angle – The bond between C.and the carbon atom in the carbonyl group by the . angle • . and . angles are called torsion angles • The energy of a given conformation is afunction of the two torsion angles • The plot depicting the energy versus torsionangles is called a Ramachandran plot • On the Ramachandran plot, there are distinctareas corresponding to special local structuralelements The planar peptide bond The torsion angles Introduction to biophysics: Structure of proteins Levels of protein structure • The primary structure of proteins is the linear sequence of amino acids forming thepolypeptide chain • The secondary structure consists of local, oftenregular and periodic structural elements • The tertiary structure of a protein is the globalspatial arrangement of its atoms • Quaternary structure is the subunit structureof proteins consisting of more than one chains Elements of secondary structure • Secondary structure consists of local structuralelements stabilized by interactions betweenatoms being close to each other • The elements are determined according to thepermitted areas on the Ramachandran plotand the hydrogen bonds possible to be formed • More such secondary structural elements canform structural motifs and supersecondaryelements Main types of secondary structural elements • Helices are produced by a translation followed by a rotation • Several helices are found in the structures of proteins, each of which is characterized by thetorsion angles of the part of the chain forming the helix • Most of the helices are right-handed except the.-helix of which there exists a left-handed variation • Helices are stabilized by hydrogen bonds withinthe backbone • Due to the electrostatic properties of aminoacids, helices have a dipole moment pointingfrom the C towards the N-terminus Helix types * The number of atoms in the ring formed by making the hydrogen bond .-helix Hydrogen bonds within .-helix Introduction to biophysics: Structure of proteins Extended structures • Extended structures can also be considered as helices because they can be produced byapplying translations and rotations • Extended structures are the most stretched conformations permitted by excluded volumeconstraints • The most frequent type of extended structuresis the ß-strand • Several ß-strands can form a ß-sheet within which the strands can be in parallel or antiparallel orientation Antiparallel ß-sheet Hydrogen bonds in antiparallel ß-sheets Parallel beta sheet Hydrogen bonds in parallel ß-sheets Non-periodic elements: turns and loops • Turns are short elements usually consisting ofonly a few residues • Turns are stabilized by a hydrogen bondformed within the turn • Loops are longer, extended elements but theydo not have a regular structure • Usually loops are the most flexible parts ofproteins Types of turns ß-turn Introduction to biophysics: Structure of proteins Structural motifs or supersecondary elements • Secondary structural elements can assembleto bigger units called motifs or supersecondaryelements • Only a small number of such motifs are knownbut they can be found in a huge number of structures • Supersecondary elements together can formstructural domains Introduction to biophysics: Structure of proteins Supersecondary elements • The tertiary structure of proteins is the globalarrangement of their secondary structureelements, stabilized by interactions betweenelements far away from each other along the sequence • By tertiary structure, the main classes of proteins are: globular, fibrillar and membraneproteins • These types of structures are stabilized by thesame covalent bonds and non-covalent interactions Structural domains • The most important structural unit of proteinsis the domain • Domains are defined as portions of structurewithin which there are more interactions than between them • Structural domains usually correspond to folding units and even functional units • Structural classifications of proteins are basedon domains Structural classification of proteins in the CATH database Mainly . architecture types 47 Mainly . architecture types Introduction to biophysics: Structure of proteins Mainly ß architecture types Mainly ß architecture types Table 8. Introduction to biophysics: Structure of proteins 54 Few secondary structures Quaternary structure • Several protein chains together can form together a non-covalent complex, called anoligomer • The arrangement of such chains defines thequaternary structure of such proteins • The sequence of chains can be identical(homooligomers) or different (heterooligomers) Quaternary structure of hemoglobin • Some non-protein molecules can be attachedto the proteins • Co-enzymes bind to enzymes via non-covalentinteractions • Prosthetic groups bind tightly or covalently to the protein chain • In enzymes, the cofactors are responsible forthe reaction type and the protein – theapoenzyme – is responsible for the substratespecificity • Short-range repulsion is due to the repulsion of the orbitals of electrons • Van der Waals interactions are attractive forces between induced dipole moments • Short-range repulsion and van der Waalsattraction can be treated in one expression, theLennard-Jones potential • Lennard-Jones potential contains a termcorresponding to the r-12 repulsion and a term corresponding to the r-6 van der Waals attraction • The Lennard-Jones potential is: Interactions stabilizing the structure Table 9. ...12 ..6 . rm rm V LJ =.-2· rr where . is the depth of the potential well, r is the distance of the two particles and rm is the distance where the potential reaches itsminimum The Lennard-Jones potential • Electrostatic interactions are formed between a positively charged – for example lysine – anda negatively charged – for example aspartate –residue • Due to the screening effect of water, theinfluence of these interactions is restricted to short distances • The disulfide bond is a covalent bond formed by two SH-groups of cysteine residues • Because the formation of disulfide bonds requires oxidative conditions, onlyextracellular proteins have disulfide bonds • Disulfide bonds have an important role in thestabilization of small proteins Disulfide bond • Protein stability and folding are strongly determined by the hydrophobic effect • The hydrophobic effect is based on an entropyincrease upon the association of hydrophobic groups • Around hydrophobic surfaces, water moleculesadopt the ordered arrangement which has alow entropy • Reducing hydrophobic surface allows the water molecules to be released, accompanied by anentropy increase Introduction to biophysics: Structure of proteins • The increase in solvent compensates for the decrease in the entropy of the protein chain • Increasing the entropy decreases the free energy • Due to hydrophobic effect, residues withhydrophobic side chains collapse to a core ofthe structure of protein • So water-soluble proteins have a hydrophobic core and a polar surface • Proteins with hydrophobic side chains on theirsurface have an intrinsic propensity for aggregation The hydrophobic effect Introduction to biophysics: Structure of proteins Protein folding • It is known since the famous experiment of Anfinsen that the primary structure of proteins determines the spatial structure under thegiven conditions • The structure in which proteins can performtheir physiological function is called the native state • The native state corresponds to the global freeenergy minimum • The process through which protein chains reach their native state is called folding 1916-1995) • Native state of proteins corresponds to theglobal free energy minimum • Assuming only three distinct conformationalstates per residue, and time of 10-13 seconds to switch between states, it would take 1.6·1027 years for an only 100 residue long peptidechain to reach its state with minimal free energy • In real proteins, this time would be far longer because of the practically infinite number ofconformers per residue • Contrary to this, proteins in nature reach their native state in at most a few seconds • Based on his calculation, Levinthal postulatedpaths by which a protein folds and assumedthat not only the final structure but also thepath to it is encoded in the primary structure • The funnel-shaped landscape of proteins willresolve this paradox Introduction to biophysics: Structure of proteins Statistical mechanical description of protein structure and folding • The protein chain and the solvent surrounding it can be considered as a subsystem which is in thermal equilibrium with the environment • Thus, the Boltzmann distribution is valid for the microstates of the subsystem consisting ofthe protein chain and the solvent • • Although the native state contains only one or a few chain conformations and thus relativelyfew microstates correspond to it, it can be themost favourable state because of its low energy • The protein chain-solvent system has manydegrees of freedom so we are forced to makethe model of it simpler • Chain conformation can be described by thepositions of its atoms • Many solvent arrangements belong to a singlechain conformation • Potential energy functions can be constructedto determine the energy of a givenconformation so the energy is the function ofdegrees of freedom • If we average the energy over the solventarrangements, we can consider only thedegrees of freedom of the chain itself • Degrees of freedom can be merged to a feworder parameters • Energy can be plotted as a function of degreesof freedom or as a function of order parameters . The shape of such surfaces resembles a funnel • Levinthal's paradox can be resolved based onthe funnel-shaped energy landscape • Folding can proceed through several paths butevery path has its end on the bottom of thefunnel 2D folding funnel Dill KA, Chan Levinthal to pathways to funnel. Nat. Struct. Biol. 4(1):10-9. 09/10/11. TÁMOP – 4.1.2-08/2/A/KMR-2009-0006 • Some mesostates, containing one or more microstates can be defined and a partition function can be calculated for it • The free energy of mesostates can be calculated from the partition function by F =-kT ln Q where Q is the partition function and F is the free energy . If we plot energy as a function of degrees offreedom we obtain a free energy surface Free energy profile of two-state folding