Molecular Modeling for Biological Systems (Supercomputer Teleconference) Part 4
- 1990-Jan-24
These captions and transcript were generated by a computer and may contain errors. If there are significant errors that should be corrected, please let us know by emailing digital@sciencehistory.org.
Transcript
00:00:00 Where a factor of a hundred more computer time power would be very useful in
00:00:05 Letting us really understand on a nanosecond timescale how protein active sites relax on response to different ligands
00:00:13 So there may be other you know algorithmic improvements, which helped us along
00:00:17 But I think this is an area where a couple or more orders of magnitude would be very useful. Thank you Peter
00:00:23 We now have Herschel on line 5. Herschel?
00:00:27 Hello Peter, this is Herschel Weintraub at Merrill Dow in Cincinnati
00:00:31 There's been some controversy that variations in
00:00:35 Windowing and other parameters will allow prediction of any either sign or value at the Delta Delta G
00:00:41 If this is the case that would limit the predictive value of the method although it would still allow
00:00:47 Retrospective analysis is this a result of the inability to calculate a small enough window or a long enough?
00:00:55 Simulation or is it something inherent in a instability of the technique?
00:01:00 That's a very good question Herschel nice to hear from you
00:01:03 the I think that with suitably chosen potential functions one can get very
00:01:09 Reliable numbers as I try to illustrate particularly for polar molecules reproducible numbers
00:01:15 and
00:01:16 And for ionic systems also with with the cutoff correction one gets very reproducible results if one can run the simulation
00:01:24 For let's say a hundred picoseconds
00:01:27 however for systems where there are large van der Waals changes
00:01:32 Then you may have to run into the nanoseconds for instance for
00:01:37 Mutating methane to nothing in water to get your answers to converge to tens of kilocalories
00:01:42 So a lot of your your things that you've heard I agree with some of it
00:01:47 But not with others in other words if you're happy with one kilocalorie
00:01:50 I think the current methodologies on solvation can do this for many molecules
00:01:55 All right, I believe you also have a question for Peter
00:01:58 Well, it's it's related to the whole problem of how much time does a computation really take?
00:02:06 We've talked about the computer time, but really how much time does it take?
00:02:11 How much time do you does a graduate student or somebody have to devote to a calculation?
00:02:16 To you know produce a reasonable answer that they're
00:02:21 That they're willing to publish I guess right that's a good question
00:02:24 And let me it gives me the opportunity to bring it to relate some of what my remarks to what Bill Jorgensen?
00:02:30 alluded to with base pairing in water study that was done by Liam Dang and in which we
00:02:36 calculated the absolute free energy of association of
00:02:40 Base pairs either stacked or hydrogen bonded in water an AT base pair
00:02:45 so we did the complementary simulation to what Bill had done in chloroform and
00:02:49 Confirmed what he thought would happen in that
00:02:53 It the stack form was more stable the hydrogen bonded one is virtually not now that simulation took of the order of
00:03:02 You know took hundreds of hours of Cray time because one had to set sample so much carefully
00:03:08 Carefully the association, and we're not sure we even in that case we sampled enough possible confirmations of base stacking
00:03:16 Although we know it's more stable because that kind of error would only increase the stability of the stacked form
00:03:23 But that calculation took I would say you know six months of sort of setup time thinking
00:03:30 Calculation on Liam Dang's part, and he is a very experienced modeler
00:03:34 So many of these calculations take a long time and careful thought I mean most of them do
00:03:39 Okay
00:03:40 We have a caller on line six Jan or Ben. Would you like to state your question, please hello?
00:03:46 I would like to know how you decide your starting confirmations
00:03:51 While you make the complex with the enzyme?
00:03:55 I mean to say how do you orient your inhibitors with the binding site and
00:04:02 How do you select the starting confirmation?
00:04:06 That's a very good question
00:04:08 The what we have done so far the two examples. I showed you on thermo lysin and
00:04:14 Ribonucleic nuclease t1 we had the structure of an inhibitor in each case
00:04:19 In the case of ribonuclease we didn't have the structure of the AMP model
00:04:24 And we created that by mutating the GMP to the AMP so that's a problem that
00:04:30 Combined use of what Jeff Blaney has done coming up with if one doesn't have the structure of a complex that is using distance geometry
00:04:37 And perhaps the second technique he he mentioned that was so successful on
00:04:44 methotrexate
00:04:45 DHFR interactions using that as a starting guess and then doing the mutation with with a few
00:04:51 Representative confirmations that come out of that guess if one doesn't have the structure of the crystal structure of a complex
00:04:57 So some of those distance geometry methods might be useful if one doesn't have a starting geometry for the protein inhibitor complex
00:05:05 Peter we have penny on line six penny your question
00:05:11 Can you hear me?
00:05:13 Line nine are you there?
00:05:15 short time span
00:05:17 Individual motions is there any effort to use such methods to look at large-scale?
00:05:24 Cooperative motions of larger pieces of a molecule
00:05:30 Yes, there are such efforts the time the limitation as I
00:05:35 Alluded to in my talk is that in molecular dynamics one can take only very the standard molecular dynamics one can take only very small
00:05:43 time steps
00:05:45 femtosecond and thus one is limited to
00:05:48 More short timescales just by computer limitations. There are attempts to develop new algorithms for instance by
00:05:56 Tamar schlick of New York University has another algorithm for doing molecular dynamics that might allow 10 to the minus 13th
00:06:03 second time steps, but these are areas of constant
00:06:08 Development attempting to make them the dynamics algorithm more efficient and allowing us to simulate larger and larger changes
00:06:16 Bonnie on line eight your question
00:06:19 Yes, I'm Bonnie Gun from Oxford, Mississippi
00:06:22 I'm interested you mentioned the fact that the free energy is greater in the salvation
00:06:28 So then the hydrogen bonds your stacking energy is being greater is the base stacking energy for the free for the new
00:06:36 They repeat this is the base stacking free energy and nucleic acids greater than that
00:06:41 Which are present in hydrogen bonds in solution if one applies these principles to polymeric DNA
00:06:50 That's a good question I think in polymeric DNA again coming back to what Bill Jorgensen said
00:06:56 We've got these ionic
00:06:57 Salvation problems and we have all the complexity of the other bases stacking on top of the the one base
00:07:03 And I think we haven't sorted that out yet
00:07:05 I think that's a an issue that's going to be sorted out in more complex DNA the role of hydrogen bonding and base stacking
00:07:12 but our preliminary analysis based on molecular mechanics and experimental results by people like Doug Turner of
00:07:18 Rochester University
00:07:20 Suggests that both play a critical role in DNA and RNA stability
00:07:26 Gentlemen, we need to break this up and we'll come back to it after we hear from Dr. David Case. Peter
00:07:31 We'll see you one more time for our panel discussion. Thanks very much
00:07:35 Dr. Case is a colleague of yours Art
00:07:38 Yes, Whitney. Dave has been a member in the Department of Molecular Biology at Scripps Clinic since 1986
00:07:44 He's been applying molecular modeling techniques to answer a number of biological questions
00:07:48 Including simulating the dynamics of oxygen escaping from myoglobin and hemoglobin
00:07:53 Today Dave is going to talk about the work. He's been doing tying molecular dynamics to experimental data
00:07:59 So here is Dr. Case with determining three-dimensional solution structures of proteins from NMR data
00:08:09 Thank you Whitney
00:08:11 As Art indicated one of the most promising areas of application of molecular modeling
00:08:16 Involves a close combination of theory and experiment in which the observed data serves as constraints in a molecular dynamics simulation
00:08:23 This powerful combination allows the modeler to make a fairly wide survey of conformational space
00:08:29 Looking for conformations that satisfy experimental constraints and at the same time have low computed energies
00:08:35 Two prominent sources of experimental data are x-ray crystallography and NMR spectroscopy
00:08:41 Essentially the same methods can be used in both fields and I will be talking about the ways in which
00:08:46 Dynamical simulations can be used in conjunction with NMR to determine the three-dimensional solution structures of proteins
00:08:54 The basic outline of the procedure is shown in the first slide
00:08:58 One begins with sequence specific assignments that is
00:09:02 Identifications of peaks in the NMR spectrum with particular protons in the sample
00:09:07 The ways in which these assignments are made are fascinating in their own right, but are outside the scope of my talk today
00:09:14 As I will illustrate briefly
00:09:16 Cross peaks in these spectra enable one to assign distance constraints that the structure must satisfy
00:09:22 Since cross peaks will appear only when two protons are within about five angstroms of each other
00:09:27 the theoretical procedure involves the use of distance geometry with which Jeff Blaney has already talked about and
00:09:34 Restrained molecular dynamics simulations the final structures can approach the quality of crystallographic structures
00:09:41 I'll be using as an example the zinc finger structure recently solved in a collaboration
00:09:46 Between Peter Wright's NMR lab at Scripps Clinic and my group
00:09:50 Zinc fingers are a recently discovered class of DNA binding domains
00:09:54 Each domain is a short stretch of polypeptide that has four side chains bound to zinc
00:10:00 The regulatory protein may contain many such domains
00:10:03 Shown in here is the sequence of the peptide that we studied
00:10:07 Which is a member of the class of zinc fingers that bind the metal through two cysteine and two histidine side chains
00:10:14 Our peptide is 25 amino acids in length and the circled residues are conserved or nearly conserved in all members of this class
00:10:23 Although we studied a synthetic peptide with this sequence domains like this appear repeatedly in real regulatory proteins
00:10:31 This slide shows a portion of the nuclear overhauser effect spectrum for the zinc finger peptide
00:10:37 The amide amide region is shown here and later on. We'll see the amide in this slide
00:10:42 We see the amide alpha proton cross peaks peaks along the diagonal of these two-dimensional spectra correspond to unique
00:10:49 identified protons in the spectrum
00:10:51 The off-diagonal peaks several of which are labeled here
00:10:55 Indicate pairs of protons that must be close in space since magnetization has been transferred between them during the course of the experiment
00:11:03 About a hundred and fifty such cross peaks have been measured for the zinc finger
00:11:07 For a larger protein say of a hundred amino acids it can be possible to identify more than a thousand such peaks a
00:11:15 Summary of the short-range peaks is shown here
00:11:18 The boxes across the top show sequential cross peaks that is cross peaks between the amide or C
00:11:25 Alpha-proton of residue I and the amide proton of residue I plus one
00:11:30 Peaks like this are diagnostic of the local secondary structure
00:11:34 Typically alpha amide cross peaks are seen on extended structures like sheets and amide amide peaks occur in helices
00:11:42 further evidence for helix in the region of residues 13 to 22 is the existence of I to I plus 2 I
00:11:49 To I plus 3 and I to I plus 4 cross peaks which are shown as bars below the boxes
00:11:55 In many instances one can simply read off helix and sheet structure from plots like this without doing any calculations at all
00:12:03 The distribution of longer range cross peaks is shown here
00:12:08 These are the peaks that eventually determine how the protein secondary structure elements interact with each other to give the tertiary structure of the protein
00:12:17 In a peptide as small as the zinc finger only a dozen or so such peaks are seen
00:12:22 But we shall see that these are sufficient to determine the overall structure and again for larger proteins
00:12:28 The number of observed long-range peaks is of course much larger
00:12:32 The experimental input then consists of a number of distance constraints to divide from the nuclear overhauser effect data
00:12:39 In addition as Jeff talked about we know a lot of information about distances just from knowing the chemical structure of the protein
00:12:47 Turning distances into three-dimensional structures is generally accomplished through distance geometry calculations
00:12:53 The basic mathematical ideas are reviewed in the next slide
00:12:57 The metric matrix G is defined to give the dot product between the coordinate vectors of two atoms
00:13:03 Of course, we don't know the coordinates if we don't know the structure
00:13:07 But from the law of cosines the dot products can be computed from the distances. I
00:13:12 Don't show it on this slide, but it is also the case that distances from the center of gravity position
00:13:17 Oh can also be determined just from inner atomic distances
00:13:22 The net result is that the metric matrix G can be determined if you know all of the inner atomic distances
00:13:29 Since only some of these are known one has to make a guess for the remainder
00:13:33 Repeatedly running through the algorithm with different guesses then generates a variety of structures
00:13:40 Once the metric matrix is available three-dimensional coordinates can be obtained by the method outlined in the next slide
00:13:46 This slide shows a theorem which states that the metric matrix must be of rank 3 in order to identify with a real object
00:13:54 that means that the metric matrix G must have just three nonzero eigenvalues if
00:14:01 Lambdas are the eigenvalues and W is the eigenvector then the relation boxed at the bottom of this slide
00:14:07 Allows one to construct the coordinates X from these
00:14:11 If the metric matrix G actually has more than three nonzero
00:14:15 Eigenvalues one can simply ignore all but the first three and still use this formula
00:14:21 This procedure amounts to projecting the object defined by the distances on to three-dimensional space
00:14:28 The structures directly resulting from distance geometry are generally quite poor by NMR standards
00:14:34 Partially because not all of the distances are known and partially because of the loss of information
00:14:39 Entailed in throwing away all but the first three eigenvalues
00:14:44 We typically use restrained molecular dynamics to improve or refine these structures
00:14:49 in this procedure the energy function used is a combination of a molecular mechanics force field and
00:14:55 Penalty functions that are defined to become increasingly positive as the NMR constraints are violated
00:15:02 One could just perform an energy minimization in this combined potential
00:15:06 but energy minimization tends to get trapped very easily in local minima a
00:15:10 More robust optimization procedure uses molecular dynamics to anneal the system as illustrated in the next slide
00:15:18 The heavy line in this slide shows the system temperature as a function of simulation time for the protocol that we most often use
00:15:26 In a vacuum calculation there is no problem with heating the system to 1200 degrees Kelvin
00:15:31 Especially since the presence of many NMR constraints forces the system to stay near its experimental geometry
00:15:37 what the high temperature equilibration does is allow the system to explore many regions of conformational space as
00:15:44 The temperature is slowly reduced back to zero most of the molecule most often will fall into a relatively good local minimum
00:15:52 Although this is certainly not a global minimization algorithm
00:15:55 We have found that the more slowly you can afford to cool the system the better on the average the results will be
00:16:01 Although one quickly hits a law of diminishing returns where further simulations don't lead to much improvement
00:16:08 How good are the structures that one obtains by this method?
00:16:12 This is a difficult question to answer since the true solution structures are not known
00:16:17 One approach is to look for convergence of structures for different choices of inter atomic distances
00:16:22 If randomized starting points all lead to the same final structure one might have some
00:16:27 Confidence that these are being driven by experimental data. That is that the resulting structures are quote real unquote a
00:16:35 Second approach to judging quality is to compare the results with x-ray crystallographic structures where these are available
00:16:41 I'll give an example of each
00:16:44 The first example is of the zinc finger
00:16:46 I mentioned earlier where the NMR spectra were taken by Min Lee and Peter Wright at Scripps Clinic and the computer analysis was done by
00:16:53 Dr. Soman, Gary Gifford and myself
00:16:56 The overall structure of the zinc finger is shown here
00:16:59 there is an extended portion and a turn at the end terminal end at the top of the video and
00:17:04 A fairly long helix toward the C terminal end at the bottom
00:17:08 the zinc atom is bound to two cysteines that are on opposite sides of a reverse turn and
00:17:13 To two histidines that are a part of the long helix
00:17:16 you first saw a ribbon representation of the polypeptide backbone backbone and now a space-filling view of the entire molecule as
00:17:24 An aside this was this video was recorded in real time on a Sun workstation with a graphics accelerator board
00:17:30 And gives you a feeling of the interactive rendering power of relatively inexpensive equipment
00:17:36 The next slide shows a superposition of 37 structures for the zinc finger peptide
00:17:41 The backbone is shown in orange along with the yellow cysteine and blue histidine ligands to the zinc
00:17:47 It is clear that the backbone and ligation geometries are quite well determined
00:17:51 Some statistics about this structure are shown in this figure
00:17:55 Which shows the standard deviations in the backbone first to the angle Phi and in the next slide of the angle Psi
00:18:02 In the turn region from residues four to eight and at the C terminus there is considerable uncertainty in the backbone structure
00:18:08 But the backbone confirmation in the helical region from reach from residues 13 to 22 is very well described
00:18:15 In the next slide I add side chains to the picture of the molecule
00:18:19 You can see that there is much more uncertainty about the positions of side chains than there was about the polypeptide backbone
00:18:26 Still some interesting conclusions can be drawn
00:18:29 Possibly such charge side chains are shown in blue
00:18:31 And it is clear in spite of the spread in the structures that one side of the long helix along the right
00:18:37 has an array of positively charged side chains that could be well situated to interact with a groove of DNA an
00:18:44 Artist rendering of what might be happening is shown in the next slide
00:18:47 Where here you are looking down the long helix of the peptide as it is positioned in the major groove of DNA I?
00:18:54 Should emphasize that we have no clear experimental evidence for this mode of binding although further experiments are in progress
00:19:01 It is also worth remembering that the actual regulatory proteins had many such domains
00:19:05 And that is likely the binding to DNA involves cooperative interactions among these domains
00:19:12 As a second example, I'd like to talk briefly about plastocyanin, which is an electron transport protein active in photosystem one
00:19:21 Here the NMR spectra were provided by John Moore and Peter Wright in my department and the computational work was done by Gary Gifford and myself
00:19:29 This protein has 99 amino acids and is more of a real protein than is the zinc finger
00:19:34 And we can compare our solution structures with crystallographic results from a slightly different plastocyanin
00:19:40 The NMR structure uses the protein from French bean leaves whereas the x-ray structure is of the poplar leaf protein
00:19:47 But the two are expected to be very close
00:19:50 The second video clip shows the NMR structure in blue and the x-ray structure in yellow
00:19:56 in addition to the backbone tube you will see a copper atom and it's for ligands at the top of the molecule and
00:20:03 For phenylalanine side chains that pack closely to form a hydrophobic core in the center of the protein
00:20:09 In this case we have much more extensive data than is currently available for the zinc finger
00:20:14 And we are able to pinpoint the locations of the side chains with greater confidence
00:20:18 I think you will agree that there was a remarkable agreement between the solution and crystal structures as you saw
00:20:24 That last animation was not made in real time, but consisted of 720 frames that differed by a rotation about the vertical axis
00:20:32 It took us about 12 hours to render the images and about a day to record them one by one on the videotape
00:20:40 One important difference between the data set for plastocyanin and that for the zinc finger is our knowledge in plastocyanin of
00:20:47 Stereospecific assignments for many of the methylene beta protons and for the methyl groups in valine and leucine
00:20:53 That is in favorable circumstances the two pro chiral protons on the c-beta side chain can be individually assigned
00:21:01 This allows one to generate much more specific and tighter experimental constraints and the next slide
00:21:07 I show a portion of plastocyanin, which was calculated from structures before such stereospecific assignments were available
00:21:14 That data set is about the same quality as I talked earlier with for the zinc finger
00:21:20 As you can see most of the side chains are poorly described with large uncertainties in their positions
00:21:25 and this again is roughly comparable to the results we found for the zinc finger and
00:21:29 Arises largely from a lack of cross peaks that could help restrict the values of the side chain dihedral angles
00:21:35 the next figure shows the same structures in red that you saw in a previous slide and
00:21:41 Superimposed on top of them structures determined with additional constraints the spread in position has now been substantially reduced
00:21:48 furthermore the new positions closely resemble those seen in the x-ray structure, which are shown in light blue a
00:21:54 Similar figure is shown next for the four central phenylalanine residues
00:21:59 Again, the red structures were determined with an early data set and the yellow with more complete information
00:22:05 The newer structures are better defined and in general closer to the x-ray configurations
00:22:10 This is especially noticeable for phenylalanine 82 at the center left
00:22:18 Aside from just applying this general procedure to new proteins
00:22:21 There are a number of improvements in the refinement procedure that appear to be feasible
00:22:26 My group is looking closely at quantitation of the cross peak intensities
00:22:30 Rather than simply saying that the existence of a cross peak implies a short distance, which is really only a qualitative statement
00:22:37 We are back calculating the nuclear overhauser spectrum from our structures and refining the structures into this data
00:22:44 The basic problem mathematical problem is outlined in this slide
00:22:49 The magnetization transfer process in the experiment involves not just two spins at a time
00:22:55 But actually all of the spins in the protein which interact according to a set of linear kinetic equations the block equations
00:23:02 Which are shown at the top of the slide?
00:23:04 The rate matrix elements are depend upon the inverse sixth power of the distance and also upon the spectral density function J
00:23:12 Which in simple cases has the Lorentzian form shown at the bottom of the slide?
00:23:17 Here tau is the rotational correlation time for a rotational diffusion of the entire protein in
00:23:23 Many cases it appears that this simple model for motion is adequate to understand the motional contribution to the nuclear overhauser spectrum
00:23:31 by solving the differential equation shown at the top of this slide the
00:23:35 Intensity of each peak can be calculated and compared with experiment in a process that is completely analogous to that used in crystallography
00:23:43 where computed and experimental structure factors are also compared as
00:23:47 in crystallography initial structures can be updated to improve agreement with the observed spectrum and
00:23:53 Overall figures of merit such as our factors can be determined to quantify the agreement
00:23:58 This is a very new area, but one of them that offers exciting possibilities for getting additional accurate structural information
00:24:05 from NMR data
00:24:20 While we're waiting for your calls to come through I believe art has a question nice graphics
00:24:28 Actually I wanted to pursue that the last point that you made about actually doing refinement on a figure of merit
00:24:35 analogous to crystallography
00:24:37 It sounds like a great idea especially being a lapse crystallographer to have a number to associate with
00:24:43 The quality of a structure, but where do you think the bottlenecks are in and actually implementing that approach?
00:24:49 Well, that's a very computationally intensive approach right now, and that's one of the bottlenecks
00:24:54 But I think we can actually do it on on proteins of the size of system
00:24:57 We can afford to do NMR on right now
00:24:59 We're seeing
00:25:01 Crystallographic our factors are things that are calculated in the same way as crystallographic our factors in the range of 10 to 20 percent for
00:25:07 Many of our structures. We don't know if that 10 to 20 percent number. It means the same thing as a crystallographic our factor
00:25:14 The big bottleneck for NMR is really that we're still limited to proteins of less than about 150 amino acids
00:25:22 So we can't begin to approach the size of problem yet that
00:25:26 Crystallography can can handle all right we have bill on line 5 bill your question
00:25:32 Yes, this is a question for anyone on the panel
00:25:36 I have a
00:25:38 Group of University of Tennessee pharmacy students with me taking a course in drug design
00:25:44 Could you provide them with any examples of drugs?
00:25:49 In clinical trials or beyond that have been discovered through molecular modeling
00:25:56 Well, I guess I'm the only person up here with it with a microphone on right now
00:26:00 And I don't know the answer to that
00:26:03 I maybe we'll ask Jeff that question during the panel discussion, and we'll save that one
00:26:06 I think that that's a good question to
00:26:09 Get everybody's opinion on
00:26:13 Do you have another question?
00:26:15 well actually I was interested in the general problem of
00:26:20 Getting ensembles of structures out
00:26:22 I mean we're used to looking at a crystal structure and thinking of it as the structure and now
00:26:28 More and more people are beginning to realize that that these this might not be the structure
00:26:33 But part of an ensemble of structures, especially
00:26:36 the solution
00:26:37 structures
00:26:38 How how do you think we're going to deal with?
00:26:42 An explosion of information in one way or another how are we going to represent that when we distribute that kind of data?
00:26:47 Well distributing it. I think just means we have to distribute more
00:26:51 examples from the family
00:26:53 I'm not sure that's any worse of a problem in solution than it is in crystals because most protein crystals
00:26:58 have a lot of water in them and actually all the evidence we have indicates that the
00:27:02 Uncertainty or the spread of the family and crystals is about the same as it is in solution at room temperature
00:27:08 crystallographers are facing up to this problem finally as
00:27:11 NMR spectroscopists are in the sense that that one has to refine a number of structures into the data and
00:27:18 Eventually in places like the protein databank one has to report those to the to the community and then we're still developing those methods
00:27:24 But but I think they're coming all right you have calls waiting the first one is ping on line six your question ping
00:27:31 Yes
00:27:33 This is Ping Yang call from Pima more in Terra Haute, Indiana. I have a question
00:27:40 When you have fast exchange in my proton in your
00:27:45 system
00:27:46 It's very hard to get the NOE data, and do you have method to?
00:27:53 special especially
00:27:55 deal with that problem, or if you don't then do you miss the
00:28:00 distance measurement on that set of data and
00:28:05 The distance geometric calculation method suffer from the point. That's a good question
00:28:11 Clearly it's easiest to do the experiment in d2o
00:28:14 Because you don't have to get rid of the water peak in the NMR spectrum
00:28:17 And so for the protons as you mentioned that don't exchange rapidly
00:28:21 With solvent one can put make the solvent become d2o and do the experiment
00:28:25 But for proteins one can also do these experiments in water solution where all of the amide peaks are there and although
00:28:32 There are problems with the pre saturation of the water pulse
00:28:35 Spectrometers and pulse sequences keep getting better and better and we could actually get all of the amide amide data
00:28:41 We need an amide alpha data and so on from experiments in water as well as experiments in d2o
00:28:46 So it's not as big a problem as as you might have guessed from your question
00:28:50 The problem is not the water exchange is the a my proton it has the fast exchange
00:28:59 Situation not just with the water if the amide proton itself
00:29:03 You will get the overlapping peak on the broke raw peak amide region. It's not that water exchange
00:29:10 Directly, okay. It may have the fast
00:29:13 exchange on the amide proton itself and then in that case you have the difficulty to locate the
00:29:20 position on
00:29:21 the
00:29:23 NOE and
00:29:25 NH proton, okay. I'm not talking about a pre saturation water signal technique itself
00:29:32 Well, certainly the broad
00:29:34 Certainly the broadness of the peaks is a problem both from exchange effects and once as the protein gets bigger and bigger
00:29:41 rotational diffusion slows down and the peaks become broader for that reason too and and it's a real
00:29:47 Science and I'm not really an NMR spectroscopist, but I'm a computer person myself
00:29:51 But but it's it's a real art and science to be able to resolve and quantitate
00:29:56 NOE information off of amide protons as you say and I don't know completely where the limits of that will be
00:30:04 some
00:30:05 Improvement can be can be obtained for example by random partial due duration of large proteins
00:30:10 Frank line eight go ahead with your call. Oh, Dave. This is Frank Brown. I'm with Glaxo and RTP, North Carolina
00:30:18 My question is about the protocol that you outline at the beginning your talk
00:30:23 In the protocol that you have stated you would collect the data and the distances and go on into your structure generation
00:30:30 But how do you know if you have enough data to give you a decent structure as the protocol you outline?
00:30:37 Without enough data, you'd go and make a bunch of structures which become meaningless
00:30:42 Well, I don't think I think in most cases in proteins
00:30:45 You don't get completely meaningless data
00:30:46 but you can see with the less data you have the the wider the spread and the structures is and
00:30:52 And certainly one has to also worry about errors in the data
00:30:55 If there's not a lot of redundant data because then individual mistakes can make a bigger a bigger importance
00:31:00 There also is an iterative effect of these things in the sense that once you have some preliminary structures
00:31:06 You can also often use those structures to to assign more of the of the cross peaks in your spectrum
00:31:11 And therefore you bootstrap your way into more data
00:31:14 Right now we don't really have any way of saying before you do the calculation whether or not you're going to get a good structure
00:31:21 But but it's generally pretty evident from both the number of structures that converge and the divergence amongst the converge structures
00:31:28 Whether or not you have enough data in the end
00:31:31 The examples I showed tended to be for proteins where we do have a lot of data
00:31:35 Things don't look nearly that good when you have a lot less data as you might expect
00:31:40 Line 5 like she me your question, please
00:31:44 This is Lakshmi Narasimhan from Upjohn Company, Kalamazoo, Michigan
00:31:48 Dr. Case, are there any examples of cases where even tight binding inhibitors
00:31:55 their distances to protein atoms have been
00:32:01 Identified through NMR techniques
00:32:05 Yes, there are there's a whole class of experiment called a transferred NOE experiment in which
00:32:11 Essentially you you do the NMR on the this is the case where you have a ligand that's an exchange
00:32:16 It has to be actually fast exchange with
00:32:18 With bound state in a free state and you actually do the NMR on the free state where the assignments are easy
00:32:24 but because of
00:32:25 That those nosy peaks are actually remembering what where they were in the in the bound state
00:32:31 The people have done the most with this or the NIH group of chlorine grown and born their co-workers
00:32:36 And there are a number of examples of this
00:32:39 It won't work in really tight binding situations because then the off rate for getting the ligand off of the protein is too slow
00:32:46 But in a fairly wide range of intermediate situations the transferred NOE experiment
00:32:51 I think will start to tell us a lot about inhibitor binding
00:32:55 even in cases where maybe the
00:32:57 Thing that is binding to as much too large to do the complete NMR experiment on on the protein itself
00:33:03 David we have a call from you on Richard from Richard on line six
00:33:08 Yes, Dr. Case
00:33:10 Many molecules of biological interest are embedded within a lipid matrix such as a membrane
00:33:17 have
00:33:18 structures of this sort studies on structures of this sort been undertaken and
00:33:23 What approaches might be needed to determine?
00:33:27 a structure of a protein within this environment
00:33:30 Yeah, that's a really very good question and and actually the methods I talked about won't work
00:33:35 At least in their present form in situations like that because the high resolution NMR
00:33:41 Which is required to assign all the resonances to get the distance information that I use is
00:33:46 Generally available only when the molecule is tumbling fairly rapidly in a solvent like water or some other similar NMR like solvent
00:33:53 There has been some recent work not necessarily in membranes
00:33:57 But in lipid vesicles at getting some amount of high-resolution NMR
00:34:01 And actually getting enough tumbling time for for molecules that are embedded in in my cells and kind of artificial
00:34:09 nonpolar environments
00:34:11 but the level of precision and detail that one is seeing in those experiments is really reminiscent of
00:34:17 Water NMR ten years ago where you may get a few distances that actually may be very important
00:34:22 But nothing yet like the ability to determine an entire structure in those kinds of environments
00:34:28 All right, Frank is back again on line eight with a follow-up question for you
00:34:35 About the data beforehand you can actually look at the correlation in the data before you actually undergo
00:34:42 minimization or your dynamics to understand whether or not you have a high enough correlation to give you a
00:34:49 Decent refined structure and that's called the template
00:34:53 analysis
00:34:55 The NMR template analysis and it's an available technique that you can use to identify regions, which will have a stable structure
00:35:04 That's right
00:35:04 Frank is referring to some work that was published this year or last year in the Journal of the American Chemical Society and some other
00:35:10 places on
00:35:11 on methods both to handle kind of determining how good the structures might be and also to handle their the very interesting problem of
00:35:19 Deciding in a peptide or some other small system that might be floppy or a drug system
00:35:24 Where you might not even know in advance whether there was a quote native structure or not
00:35:28 Whether it was worth your while to to pursue that or whether what what the data should be expected to to determine it
00:35:35 I'm glad you you brought that up
00:35:37 when I said that you really had to do the calculation first time is maybe looking at a
00:35:41 Situation where it was clear that one had hundreds to thousands of cross peaks and a native structure of a protein
00:35:47 But the real kind of detailed question about the quality of the structure, I don't know of a good way of deciding
00:35:54 Beforehand for that kind of problem right now
00:35:57 Thank you, David
00:35:58 We're going to take a three-minute stretch break right now so that we can set up the stage for one of our panel
00:36:03 Discussions the phones are open and we encourage you to start placing your calls now one last time the numbers to call are
00:36:10 1-800-942-1515 for California
00:36:15 1-800-972-1515 for the US and 1-619-265-6429
00:36:22 For Canada
00:39:30 We have callers waiting the first one is Nathan on line 5. Nathan your question, please
00:39:36 Hello, my name is Nathan Singh from the University of Kentucky
00:39:39 Could someone on the panel, please elaborate on the benefits and drawbacks of various minimization procedures
00:39:46 Such as steepest descent conjugate gradient and Newton Rolfson procedures
00:39:52 Thank you
00:39:54 Let's start out Peter
00:39:55 yeah
00:39:57 the the techniques steepest descent and
00:40:02 conjugate gradient
00:40:04 Are the simpler methods they require less computer memory
00:40:09 But they converge less well near the minimum
00:40:14 The Newton Rafson techniques have the disadvantage in that they require a storage of an n by n matrix where?
00:40:22 n is the number of
00:40:24 Coordinates 3n in terms of the number of atoms
00:40:27 So one tends to use Newton Rafson if one has systems of up to 400 or 500 atoms Dave case has actually done some
00:40:35 Minimizations on large systems like that, but for much larger proteins or other macromolecules
00:40:42 one needs to use conjugate gradient, which is
00:40:45 quite convergent
00:40:47 but is
00:40:49 Doesn't require them the storage
00:40:51 Steepest descent is just a way to get rid of the first really terrible gradients and doesn't converge that well
00:40:57 So one abandons that quickly and goes to conjugate gradient
00:41:01 Jeff on line 6. Let's hear your question
00:41:05 Jeff Noss calling from the Walter Reed Army Institute of Research in Washington DC
00:41:10 There's a variety of commercial micro graphics and micro modeling packages available on the market
00:41:16 And they use different potential fields charm versus amber amber for example
00:41:21 Is there any studies of differences of these force fields on the results that you can get?
00:41:26 using different classes of model compounds nucleic acids proteins
00:41:31 carbohydrates and lipids
00:41:35 Anyone on the panel
00:41:38 The
00:41:40 The most recent sort of study that I know of is one that took place in
00:41:45 1984 although there are many unpublished papers in the literature or on unpublished unpublished works on
00:41:52 on selected classes of molecules
00:41:55 and I think this is an evolving field where
00:41:59 We don't really know the answer and force fields are continually being refined. So
00:42:04 You just need to look I would advise you just to look at the class of molecules you're interested in and
00:42:11 You know and make an evaluation there mm2 and mm3 is of course
00:42:16 Is the best bet for the most refined molecular mechanics force field for organic molecules?
00:42:22 And it's most likely to be correct for those kinds of molecules
00:42:27 Dr. Coleman, we have Serena on line 8 who also has a question for you
00:42:32 Hello, Dr. Coleman
00:42:33 We're interested in modeling nucleic acid systems that are have platinum
00:42:39 Metal ions interacting with the nucleic acids. I was wondering with the amber force field
00:42:46 What is the best methods of introducing metals into the force field and how does one?
00:42:52 determine force constants and the like
00:42:55 The the best method that
00:42:58 one would use in that case for very covalent interaction is to actually put it in as a covalent bond and the specific example you
00:43:05 Mentioned the platinum nucleic acids has already been studied by a number of groups
00:43:10 Kozelka
00:43:12 Petsko Lippard at MIT have published a number of papers. I think around 1987 on exactly that problem using
00:43:19 Using our molecular mechanics software
00:43:21 I think Kozelka has gone back to Paris and done other things and what they find is an equilibrium between
00:43:27 Non-kinked and kinked DNA in the presence of platinum
00:43:31 All right, Dr. Blaney, Surat is on line 11. Surat your question
00:43:36 My question is directed for Dr. Jeff Blaney
00:43:40 It has two parts
00:43:41 Can the information obtained from 2d NMR be used for docking of this small ligand molecule?
00:43:48 onto a receptor or DNA
00:43:50 If yes, then how do you feed the information of intermolecular context detected by 2d NMR?
00:44:00 All right
00:44:01 The answer is yes, you could use 2d NMR information if you had it for example if you had
00:44:07 Intermolecular NOEs that told you something about distances between specific
00:44:12 Atoms in the ligand and and your protein site you can express those as distance constraints and use them to guide the docking
00:44:20 Other experiments that can give you that beyond the transfer NOE that the Dave mentioned
00:44:25 You might look in papers published by Steve Fessick's lab in which they've done a series of isotope edited experiments
00:44:32 For example with n15 labeled peptides or even n15 labeled protein to get intermolecular NOEs out
00:44:39 All right, dr. Jorgensen you have a question on line 7 from Bonnie you're on Bonnie your question
00:44:49 Are you there Bonnie if so pick up the phone all right perhaps she'll get back to us
00:44:55 But art I believe you had a follow-up comment
00:44:57 All right, actually I wanted to get back to the question that was asked of Dave case during his talk
00:45:02 Which is where are the examples? What are they the examples of?
00:45:06 Say drugs or or other biologically active molecules designed by these methods and I'd like to put it open to the panel
00:45:14 But I'd also like to to open it up to the the callers if any of you out there
00:45:19 Know of and can talk about any of these
00:45:23 success stories call in and tell us
00:45:26 Jeff you're probably the first person to I can mention the examples
00:45:30 I know of two of them are in the references and in my section at the back
00:45:34 The only one I know of that's actually I believe gone as far as clinical trials was the work from welcome on the anti-sickling agents
00:45:40 I don't know how far they've gone with those compounds except. I believe they did make it into the clinic the
00:45:46 Work on the phospholipase a2 inhibitors at DuPont is still going on, but has not made it to clinical trials as yet
00:45:52 There are I believe several examples in the agricultural chemical area and pesticide or herbicide design
00:46:00 And I think if you look for a review article by Toshio Fujita, you may find examples on use of QSAR
00:46:07 That was successfully used to design compounds that I believe may have even gone into commercial production there
00:46:13 There are a variety of other examples Lee Kuypers work at welcome on
00:46:17 using modeling to design better inhibitors
00:46:20 But they have not turned out to be
00:46:23 Compounds useful in the clinic. All right. Dr. Jorgensen. We have Bonnie back on the line on line five
00:46:29 Bonnie
00:46:32 I'm sorry line eight Bonnie
00:46:35 This is Bonnie Gun from Oxford, Mississippi
00:46:38 It has been suggested that the bifurcated hydrogen bond themed in the AT region of a of DNA is
00:46:46 The reason for the stability which has been seen in these melting behaviors
00:46:50 Could these hydrogens bond be in fact strong enough to account for this behavior?
00:47:00 I
00:47:02 Think the
00:47:06 I'm not sure exactly how that relates to some of the results
00:47:09 I I showed and I'm not as familiar with with the I guess the topic as I'd like to be my impression
00:47:17 Is that AT and GC mismatches don't show huge variations in some oligonucleotides?
00:47:25 in melting
00:47:27 Behavior which is surprising in the sense of GC is much has much more
00:47:34 strong hydrogen bonding
00:47:36 on the other hand as a key point of my talk was that you have to keep in mind all of the other effects and
00:47:44 including the solvent effect on these
00:47:47 Interactions, which is very strong and damps out the strong GC hydrogen bonding as compared to
00:47:54 AT in water that we don't see in solvents such as chloroform
00:48:01 All right, we have bill on line one to any of our panelists bill your question
00:48:06 Yes, my question is directed to Peter Coleman
00:48:11 Peter this is Bill Welsh from St. Louis
00:48:14 I enjoyed your program very much
00:48:17 my question relates to
00:48:19 Activity in our group involving metalloproteins and
00:48:23 Could you comment on?
00:48:27 When you're conducting mutations say of one metal ion to another and in particular
00:48:33 Say one metal ion to another one of a different balance state like Fe 2 to Fe 3
00:48:38 What particular problems should one be watchful of in dealing with?
00:48:43 In dealing with
00:48:45 Say perturbation calculations or MD calculations in general regarding such a transformation and number two if I could just interject one
00:48:53 more question
00:48:55 in terms of
00:48:57 parameterizing for metal ions contained in metalloproteins
00:49:02 Could you comment on as you know, there's a lack of of
00:49:06 Good parameters for metal ions in these environments. Could you comment on how one best obtains?
00:49:14 reliable
00:49:15 Force field parameters for metal ions in these situations. Thank you
00:49:20 Nice nice to hear from you Bill
00:49:23 the your question is a very good and difficult one to answer in a general way, but
00:49:30 Kenny Murs, for instance, we worked very hard on zinc parameters
00:49:35 Zinc has a charge of plus two as soon as you get to something that has a charge of more than plus one
00:49:39 It is a great challenge to develop
00:49:43 Molecular mechanical parameters for them and you can use two approaches one is the one I mentioned for the platinum
00:49:49 compounds where you use explicit covalent bonds and don't allow a lot of flexibility and the second is to put a
00:49:56 Formal charge and use a non-bonded interaction to represent the metal which can lead to very large
00:50:02 Electrostatic fields and often give you artifacts. We had to struggle with this a lot in both our thermal isin
00:50:08 Calculations and the ones that Kenny Murs has done on carbonic anhydrase
00:50:12 So I guess I'd have to be fairly wishy-washy here and say you use as much
00:50:16 Experimental data as you can and you test out your model for instance to make sure it's behaving correctly on small model systems
00:50:23 Before you go on to more complicated systems
00:50:26 But this is an example of a kind of thing that Bill Jorgensen
00:50:30 Answered with respect to ionic systems doing the free energy calculations on those is a tremendously
00:50:36 Challenging and difficult thing because there are such large energies and there's so large cutoff corrections. Can I just add something to that Peter?
00:50:43 One one thing to avoid or to recognize is that one transition metal ions, especially are in are in ligation complexes
00:50:51 The
00:50:52 Concentration of charge on the metal ion is not anyplace nearly as large as you might think from the formal oxidation stage
00:50:57 And one generally has to do some quantum mechanical calculation to get a distributed charge model
00:51:02 So that in going from iron 2 to iron 3 one doesn't change just one electron charge on the metal
00:51:08 But the but the charge is almost always distributed over quite a large number of atoms
00:51:12 Jeff it's just one other comment that I had on that
00:51:16 Donuts has published at least two papers that I can think of on a force field. They've developed for doing metalloproteins
00:51:23 This has been published sometime in the last few years
00:51:27 Dick line one go ahead, please
00:51:31 Dick Keys Cal State LA this is a general question to any of the panelists
00:51:38 The question is whether there's enough experience with these systems to apply
00:51:46 expert systems to looking for
00:51:49 local minima
00:51:54 That's a hot topic who would like to pick up
00:51:57 You should answer it art. Well, I'll answer it in one way and that is that that I think that
00:52:04 When the experts are still battling amongst themselves, there's no sense in setting up an expert system on the other hand
00:52:11 There's a lot of tedium in setting up the input for any of these programs takes a lot of
00:52:17 basic grunt work to do it and I think that
00:52:20 Expertise exists in those areas and could be encoded to make the interface for for preparation
00:52:27 Much more facile. I don't know if anybody here has any other comments about it
00:52:34 Okay, if not, Dr. Blaney, Eric is on line two
00:52:39 Hi
00:52:41 If you have a series of five or six compounds with diverse
00:52:45 and flexible chemical structures
00:52:47 That seem to bind to the same site based on biochemical evidence, but there's no obvious way to relate them
00:52:53 Are there computational techniques that can help define the karma for pharmacophore?
00:52:58 There
00:53:00 There are some but I think you're still going to have to rely a lot on
00:53:04 What you probably call chemical intuition?
00:53:07 Approaches that seem to be popular for that and will work occasionally maybe
00:53:11 Generate an electrostatic potential surface for example and try and align the molecules in that way
00:53:16 You'll find several papers in the journal molecular graphics and maybe some in journal of computational chemistry
00:53:22 from Phil Dean's group describing
00:53:24 Methods of doing that I think with with me it usually comes down to
00:53:29 trying to find
00:53:31 Which sites in each one of the molecules have structure activity relationships that are similar?
00:53:36 I might look for example for an aromatic substituent and a position on aromatic ring that has the same behavior in each different class of
00:53:44 Structures and then from infer infer from that that they might overlap the same way
00:53:49 That's always a challenging problem. We typically will try several orientations
00:53:54 Until we hopefully converge on one that we think is unique
00:53:57 Our last question is for anyone on the panel it is from from Arthur on line five
00:54:03 Hi, I'm calling from Storrs, Connecticut, and I'm interested in finding out
00:54:08 besides the
00:54:10 Distance geometry approach that has been
00:54:13 very well discussed thus far
00:54:15 There there apparently are some other approaches in general that have not been discussed in any great detail to this point
00:54:23 What I'm getting at is the burning issue. I guess amongst additional chemists and agricultural chemists is
00:54:30 an inability to be able to take a large number of compounds and create let's say a hypothetical active site or a
00:54:38 receptor site to model into
00:54:41 Would like to address this question in general to the panel and I'll await your response
00:54:45 All right
00:54:49 The the approach is available right right now
00:54:53 You know there's the confer approach that was published by Kramer and Bunce that Peter talked about I
00:54:58 Think that one looks very encouraging
00:55:00 Provided you already have an idea of your own of how to overlap these molecules in a pharmacophore
00:55:05 That's always going to be your bottleneck. Okay. You have a must interrupt. I'm very sorry
00:55:10 I want to thank our panelists David case Peter Coleman Bill Jorgensen and Jeff Blaney and our major underwriter digital equipment corporation
00:55:23 Question what computer company has the largest number of software applications for chemists answer
00:55:32 digital equipment corporation
00:55:35 question
00:55:36 What company is the leader in providing integrated solutions for research scientists answer
00:55:43 digital equipment corporation
00:55:46 question
00:55:48 What company recently added vector processing to their family of computer systems?
00:55:53 answer
00:55:55 digital equipment corporation
00:55:58 Still things are not so black and white in today's world of computational chemistry and digital knows that
00:56:06 For years, we've been working closely with scientists helping them search for the unknown and make new discoveries
00:56:12 in fact, the first vac system ever sold was purchased to perform advanced computational chemistry applications and
00:56:20 Digital is collaborating with the quantum chemistry program exchange as well as with other software suppliers that provide chemistry related software
00:56:29 Presently digital is working with Dr. Peter Coleman and associates at the University of California, San Francisco
00:56:34 To optimize amber for the backs 9,000 and back 6,000 computers with vector processing options
00:56:41 The university will make this version available to the scientific community
00:56:45 the backs 9,000 systems with vectors deliver supercomputer performance and
00:56:50 Like other backs is the backs 9,000 is fully compatible with our entire family of backs computers and workstations
00:56:57 All offer a choice of either digitals VMS or Ultrex operating systems
00:57:02 Our risk-based deck system family of high-performance workstations and systems running Ultrex
00:57:07 Demonstrates our continuing commitment to open systems, and there is more to come
00:57:13 We know how important visualization technologies are to molecular biology
00:57:18 Digital backs and risk workstations along with leading public and commercial visualization software provide a broad set of scientific
00:57:26 visualization tools our commitment in this area is strong and active
00:57:31 For presentation of research results digital's compound document architecture integrates text and graphics even from different systems
00:57:39 into scientific documents
00:57:42 Scientific collaboration and communication depend on network technology, which links the research community locally and worldwide
00:57:51 Digital is the leader in supporting network standards and in providing advanced network technology
00:57:57 Digital products are designed for use in a distributed environment
00:58:00 With systems from other vendors connecting resources including supercomputers
00:58:06 From the lab bench to the supercomputer from your desk to information sources worldwide
00:58:11 Digital has the integrated computing solutions for your research environment
00:58:17 Digital Equipment Corporation make that discovery
00:58:30 I
00:58:41 Know that you the viewers have received a great deal of information today
00:58:45 If you have more questions
00:58:47 You can contact our panelists by calling their institutions or by getting their phone numbers from the San Diego
00:58:54 supercomputer Center or the American Chemical Society
00:58:57 One last special thanks this to Dr.
00:59:00 Roseanne Steckler from the San Diego Supercomputer Center for acting as technical consultant and lining up our speakers for today
00:59:08 On behalf of the supercomputer Center, San Diego State University and the American Chemical Society
00:59:15 Continuing Education Department. I want to thank all of you for joining us. We hope you have found the program informative
00:59:22 Please don't forget to complete your course evaluations and return them to ACS and please note that the American Chemical Society's
00:59:30 Next video conference on polymer properties is scheduled for March 16th
00:59:35 1990 the ACS will be announcing their lineup of courses for the autumn very soon. We hope you'll join us again
00:59:42 Thank you and good day