首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 0 毫秒
1.
An analysis of the geometry and the orientation of metal ionsbound to histidine residues in proteins is presented. Cationsare found to lie in the imidazole plane along the lone pairon the nitrogen atom. Out of the two tautomeric forms of theimidazole ring, the NE2-protonated form is normally preferred.However, when bound to a metal ion the ND1-protonated form ispredominant and NE2 is the ligand atom. When the metal coordinationis through ND1, steric interactions shift the side chain torsionalangle, X2 from its preferred value of 90 or 270. The orientationof histidine residues is usually stabilized through hydrogenbonding; ND1-protonated form of a helical residue can form ahydrogen bond with the carbonyl oxygen atom in the precedingturn of the helix. A considerable number of ligands are foundin helices and ß-sheets. A helical residue hound toa heme group is usually found near the C-terminus of the helix.Two ligand groups four residues apart in a helix, or two residuesapart in a ß-strand are used in many proteins to bindmetal ions.  相似文献   

2.
A method has been developed to detect pairs of positions withcorrelated mutations in protein multiple sequence alignments.The method is based on reconstruction of the phylogenetic treefor a set of sequences and statistical analysis of the distributionof mutations in the branches of the tree. The database of homology-derivedprotein structures (HSSP) is used as the source of multiplesequence alignments for proteins of known three-dimensionalstructure. We analyse pairs of positions with correlated mutationsin 67 protein families and show quantitatively that the presenceof such positions is a typical feature of protein families.A significant but weak tendency is observed for correlated residuepairs to be close in the three-dimensional structure. With furtherimprovements, methods of this type may be useful for the predictionof residue-residue contacts and subsequent prediction of proteinstructure using distance geometry algorithms. In conclusion,we suggest a new experimental approach to protein structuredetermination in which selection of functional mutants afterrandom mutagenesis and analysis of correlated mutations providesufficient proximity constraints for calculation of the proteinfold  相似文献   

3.
Secondary structure formation in model polypeptide chains   总被引:1,自引:0,他引:1  
Model polypeptide chains were folded into 3-D compact conformationsusing distance geometry techniques. Interresidue distances werepredicted from the hydrophobicity of the monomers and wererefined by repeated projections into lower-dimensional spaces.Main-chain hydrogen bond networks were constructed and propagatedthrough the structure by adjusting local conformations to complywith ideal distance constraints around hydrogen bonds. The resultingfolds were compact globules with distinct hydrophobic coresand contained secondary structure elements like real proteinmolecules. Apart from similarity in appearance, several propertiesof the model chains were also very close to those of nativefolded polypeptides. The method in its present form can serveas a starting point for the development of a novel structureprediction algorithm  相似文献   

4.
An artificial neural network system is used for pattern recognitionin protein side-chain-side-chain contact maps. A back-propagationnetwork was trained on a set of patterns which are popular inside-chain contact maps of protein structures. Several neuralnetwork architectures and different training parameters weretested to decide on the best combination for the neural network.The resulting network can distinguish between original (fromprotein structures) and randomized patterns with an accuracyof 84.5% and a Matthews' coefficient of 0.72 for the testingset. Applications of this system for protein structure evaluationand refinement are also proposed. Examples include structuresobtained after the application of molecular dynamics to crystalstructures, structures obtained from X-ray crystallography atvarious stages of refinement, structures obtained from a denovo folding algorithm and deliberately misfolded structures.  相似文献   

5.
A data bank merging related protein structures and sequences   总被引:1,自引:0,他引:1  
A data collection which merges protein structural and sequenceinformation is described. Structural superpositions amongstproteins with similar main-chain fold were performed or collectedfrom the literature. Sequences taken from the protein primarystructure databases were associated with the multiple structuralalignments providing they were at least 50% homologous in residueidentity to one of the structural sequences and at least 50%of the structural sequence residues were alignable. Such restrictionsallow reasonable confidence that the primary sequences sharethe conformation of the tertiary structural templates, exceptin the less conserved loop regions. Multiple structural superpositionswere collected for 38 familial groups containing a total of209 tertiary structures; 45 structures had no superposable matesand were used individually. Other information is also providedas main-chain and side-chain conformational angles, secondarystructural assignments and the like. Wedding the primary andtertiary structural data resulted in an 8-fold increase of databank sequence entries over those associated with the known three-dimensionalarchitectures alone.  相似文献   

6.
An empirical relationship between occupancy and the atomic displacementparameter of water molecules in protein crystal structures hasbeen found by comparing a set of well refined sperm whale myoglobincrystal structures. The relationship agrees with a series ofindependent structural features whose impact on water occupancycan easily be predicted as well as with other known data andis independent of the protein fold. The estimation of the wateroccupancy in protein crystal structures may help in understandingthe physico-chemical properties of the protein–solventinterface and can allow the monitoring of the accuracy of theprotein crystal structure refinement.  相似文献   

7.
We present an efficient technique for the comparison of proteinstructures. The algorithm uses a vector representation of thesecondary structure elements and searches for spatial configurationsof secondary structure elements in proteins. In such recurringprotein folds, the order of the secondary structure elementsin the protein chains is disregarded. The method is based onthe geometric hashing paradigm and implements approaches originatingin computer vision. It represents and matches the secondarystructure element vectors in a 3-D translation and rotationinvariant manner. The matching of a pair of proteins takes onaverage under 3 s on a Silicon Graphics Indigo2 workstation,allowing extensive all-against-all comparisons of the data setof non-redundant protein structures. Here we have carried outsuch a comparison for a data set of over 500 protein molecules.The detection of recurring topological and non-topological,secondary structure element order-independent protein foldsmay provide further insight into evolution. Moreover, as theserecurring folding units are likely to be conformationalHy favourable,the availability of a data set of such topological motifs canserve as a rich input for threading routines. Below, we describethis rapid technique and the results it has obtained. Whilesome of the obtained matches conserve the order of the secondarystructure elements, others are entirely order independent. Asan example, we focus on the results obtained for Che Y, a signaltransduction protein, and on the profilin-ß-actincomplex. The Che Y molecule is composed of a five-stranded,parallel ß-sheet flanked by five helices. Here weshow its similarity with the Escherichia coli elongation factor,with L-arabinose binding protein, with haloalkane dehalogenaseand with adenylate kinase. The profilin–ß-actincontains an antiparallel ß-pleated sheet with -helicaltermini. Its similarities to lipase, fructose disphosphataseand ß-lactamase are displayed.  相似文献   

8.
A new similarity score (-score) is proposed which is able tofind the correct protein structure among the very close alternativesand to distinguish between correct and deliberately misfoldedstructures. This score is based on the general principle `similarlikes similar', and it favors hydrophobic and hydrophilic contacts,and disfavors hydrophobic-to-hydrophilic contacts in proteins.The values of -scores calculated for the high-resolution proteinstructures from the representative set are compared with thoseof alternatives: (i) very close alternatives which are onlyslightly distorted by conformational energy minimization invacuo; (ii) alternatives with subsequently growing distortions,generated by molecular dynamics simulations in vacuo; (iii)structures derived by molecular dynamics simulation in solventat 300 K; (iv) deliberately misfolded protein models. In nearlyall tested cases the similarity score can successfully distinguishbetween experimental structure and its alternatives, even ifthe root mean square displacement of all heavy atoms is lessthan 1 Å. The confidence interval of the similarity scorewas estimated using the high-resolution X-ray structures ofdomain pairs related by non-crystallographic symmetry. The similarityscore can be used for the evaluation of the general qualityof the protein models, choosing the correct structures amongthe very close alternatives, characterization of models simulatingfolding/unfolding, etc.  相似文献   

9.
A relational database of protein structure has been developedto enable rapid and flexible enquiries about the occurrenceof many aspects of protein architecture. The coordinates of294 proteins from the Brookhaven Data Bank have been processedby standard computer programs to generate many additional termsthat quantify aspects of protein structure. These terms includesolvent accessibility, main-chain and side-chain dihedral angles,and secondary structure. In a relational database, the informationis stored in tables with columns holding the different termsand rows holding the different entries for the terms. The differentrelational base tables store the information about the proteincoordinate set, the different chains in the protein, the aminoacid residues and ligands, the atomic coordinates, the saltbridges, the hydrogen bonds, the disulphide bridges and theclose tertiary contacts. The database was established underORACLE management system. Enquiries are constructed in ORACLEusing SQL (structured query language) which is simple to useand alleviates the need for extensive computer programs. A singletable can be searched for entries that meet various criteria,e.g. all protein solved to better than a given resolution. Thepower of the database occurs when several tables, or the entriesin a single table, are cross-correlated. For example the dihedralangles of proline in the fourth position in an -helix in highresolution structures can be rapidly obtained. The structuraldatabase provides a powerful tool to obtain empirical rulesabout protein conformation. This database of protein structuresis part of a joint project between Birkbeck College and LeedsUniversity to establish an integrated data resource of proteinsequences and structures (ISIS) that encodes the complex patternsof residues and coordinates that define protein conformation.The entire data resource (ISIS) will provide a system to guideall areas of protein modelling including structure prediction,site-directed mutagenesis and de novo protein design. The availabilityof ISIS is described in the paper.  相似文献   

10.
We present a statistical analysis of protein structures basedon interatomic Ca distances. The overall distance distributionsreflect in detail the contents of sequence-specific substructuresmaintained by local interactions (such as -helixes) and longerrange interactions (such as disulfide bridges and ß-sheets).We also show that a volume scaling of the distances makes distancedistributions for protein chains of different length superimposable.Distance distributions were also calculated specifically foramino acids separated by a given number of residues. Specificfeatures in these distributions are visible for sequence separationsof up to 20 amino acid residues. A simple representation, whichpreserves most of the information in the distance distributions,was obtained using six parameters only. The parameters giverise to canonical distance intervals and when predicting coarse-graineddistance constraints by methods such as data-driven artificialneural networks, these should preferably be selected from theseintervals. We discuss the use of the six parameters for determiningor reconstructing 3-D protein structures.  相似文献   

11.
Genetic algorithms are very efficient search mechanisms whichmutate, recombine and select amongst tentative solutions toa problem until a near optimal one is achieved. We introducethem as a new tool to study proteins. The identification andmotivation for different fitness functions is discussed. Theevolution of the zinc finger sequence motif from a random startis modelled. User specified changes of the repressor structurewere simulated and critical sites and exchanges for mutagenesisidentified. Vast conformational spaces are efficiently searchedas illustrated by the ab initio folding of a model protein ofa four ß strand bundle. The genetic algorithm simulationwhich mimicked important folding constraints as overall hydrophobicpackaging and a propensity of the betaphilic residues for transpositions achieved a unique fold. Cooperativity in the ßstrand regions and a length of 3–5 for the interconnectingloops was critical. Specific interaction sites were considerablyless effective in driving the fold.  相似文献   

12.
A new method based on neural network theory is presented toanalyze and quantify the information content of far UV circulardichroism spectra. Using a backpropagation network model witha single hidden layer between input and output, it was possibleto deduce five different secondary structure fractions (helix,parallel and antiparallel ß-sheet, ß-turnand random coil) with satisfactory correlations between calculatedand measured secondary structure data. We demonstrate that foreach wavelength interval a specific network is suitable. Theremaining discrepancy between the secondary structure data fromneural network prediction and crystallography may be attributedto errors in the determination of protein concentration andrandom noise in the CD signal, as indicated by simulations.  相似文献   

13.
A methodology is proposed to solve a difficult modeling problemrelated to the recently sequenced P39 protein. This sequenceshares no similarity with any known 3D structure, but a foldis proposed by several threading tools. The difficulty in aligningthe target sequence on one of the proposed template structuresis overcome by combining the results of several available predictionmethods and by refining a rational consensus between them. Insilico validation of the obtained model and a preliminary cross-checkwith experimental features allow us to state that this borderlineprediction is at least reasonable. This model raises relevanthypotheses on the main structural features of the protein andallows the design of site-directed mutations. Knowing the geneticcontext of the P39 reading frame, we are now able to suggesta function for the P39 protein: it would act as a periplasmicsubstrate-binding protein.  相似文献   

14.
An automatic procedure for building a protein polyalanine backbonefrom C positions and ‘spare parts’ retrieved froma data base of 66 high-resolution protein structures is described.Protein backbones are constructed from over-lapping fragmentsof variable length, which allows the backbone of regular secondarystructure elements to be built in one block. The procedure isshown to yield backbones which compare very favourably withthose from highly refined X-ray structures (r.m.s. deviationbetween generated and crystal structures <lÅ). Themethod is furthermore quite insensitive to experimental errorsin C positions as well as to the size of the data base, andis seen to yield valuable insight into the relationships betweensequence and 3-D structure: one example on triose phosphateisomerase, a ß-barrel protein, shows that ßloops can be considered as structurally more uncommon than ßloops. The ‘spare parts’ approach is also foundto be useful for general-purpose modelling of local structuralchanges produced by insertion or deletion of residues. It should,however, be used with caution. Crude selection criteria basedsolely on fragment length and geometric fit to the loop baseregions yield realistic backbones in about two-thirds of thetest cases (r.m.s. deviations from refined crystal structure{small tilde}lÅ). In the remaining cases, sequence information,in particular the presence of glycine residues which tend toadopt more unusual backbone conformations, must be consideredto obtain comparable results.  相似文献   

15.
By random approaches we have previously isolated many variantsof Escherichia coli ß-galactosidase within a shortcontiguous tract near the N-terminus (residues 8–12 ofwildtype enzyme), some of which have increased stability towardsheat and denaturants. The activity of these mutants was originallyanalysed and quantitated in situ in activity gels without theaddition of magnesium ions to the buffer system. We now showthat the improved stability is only observable under such conditionsof limiting magnesium ion concentrations or in the presenceof appropriate concentrations of a metal chelator. In the presenceof EDTA, purified preparations of one of these mutant enzymeswere much more resistant to denaturants than wild-type, butthis differential was completely nullified in the presence of1 mM Mg2+. However, the stability of this mutant enzyme in EDTAwas lower than that shown by it, or the wild-type enzyme, inthe presence of magnesium ions. In addition, certain alterationswithin another N-terminal tract (residues 27–31 of wild-type)resulted in enzymes with greater dependence on Mg2+ than naturalß-galactosidase. We conclude that a small number ofresidue changes in a large protein can profoundly modulate therequirement for metal ion stabilization, allowing partial abrogationof this need in certain cases. Thus, some enzymes which requiredivalent metal ions for structural purposes only may be engineeredtowards metal independence.  相似文献   

16.
The prediction and characterization of metal binding sites in proteins   总被引:2,自引:0,他引:2  
The rational engineering of novel functions into proteins canonly be attempted when the underlying structural scaffold onwhich the new function is displayed and the structure of thetarget protein are both well understood. To introduce functionsmediated by metals it is therefore necessary to identify theprincipal liganding residues for the chosen metal, the requiredarchitecture of the metal-ligand complex and sites within thetarget protein that could accommodate such sites. Here we presenta method that applies structural information from the proteindata bank to the ab initio design and characterization of novelmetal binding sites. The prediction method has been tested on28 metalloprotein structures from the Brookhaven Protein DataBank. It successfully identified >90% of the metal bindingsites. In addition, we have used the method to design and characterizezinc binding sites in two antibody structures. Metal bindingstudies on one of these putative metalloantibodies showed metalbinding, confirming the predictive power of the method.  相似文献   

17.
A multiple sequence alignment algorithm is described that usesa dynamic programming-based pattern construction method to aligna set of homologous sequences based on their common patternof conserved sequence elements. This pattern-induced multi-sequencealignment (PUMA) algorithm can employ secondary-structure dependentgap penalties for use in comparative modelling of new sequenceswhen the three-dimensional structure of one or more membersof the same family is known. We show that the use of secondarystructure information can significantly improve the accuracyof aligning structure boundaries in a set of homologous sequenceseven when the structure of only one member of the family isknown  相似文献   

18.
19.
In search of the ideal protein sequence   总被引:1,自引:0,他引:1  
The inverse of a folding problem is to find the ideal sequencethat folds into a particular protein structure. This problemhas been addressed using the topology fingerprintbased threadingalgorithm, capable of calculating a score (energy) of an arbitrarysequence-structure pair. At first, the search is conducted byunconstrained minimization of the energy in sequence space.It is shown that using energy as the only design criterion leadsto spurious solutions with incorrect amino acid composition.The problem lies in the general features of the protein energysurface as a function of both structure and sequence. The proposedsolution is to design the sequence by maximizing the differencebetween its energy in the desired structure and in other knownprotein structures. Depending on the size of the database ofstructures ‘to avoid’, sequences bearing significantsimilarity to the native sequence of the target protein areobtained using this procedure.  相似文献   

20.
We tested a disulfide-rich antifreeze protein as a potential scaffold for design or selection of proteins with the capability of binding periodically organized surfaces. The natural antifreeze protein is a beta-helix with a strikingly regular two-dimensional grid of threonine side chains on its ice-binding face. Amino acid substitutions were made on this face to replace blocks of native threonines with other amino acids spanning the range of beta-sheet propensities. The variants, displaying arrays of distinct functional groups, were studied by mass spectrometry, reversed-phase high performance liquid chromatography, thiol reactivity and circular dichroism and NMR spectroscopies to assess their structures and stabilities relative to wild type. The mutants are well expressed in bacteria, despite the potential for mis-folding inherent in these 84-residue proteins with 16 cysteines. We demonstrate that most of the mutants essentially retain the native fold. This disulfide bonded beta-helical scaffold, thermally stable and remarkably tolerant of amino acid substitutions, is therefore useful for design and engineering of macromolecules with the potential to bind various targeted ordered material surfaces.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号