Track your comments!
[x]


When you register, comments on your articles and replies to your comments appear here. Register Now!

Sign in to your account
[x]

Not a Scientific Blogging member yet?

Register Now for a Free Scientificblogging.com Account

  • Customize your profile with pictures, banner, a blogroll and more.
  • Leave comments on articles, add other members to your friend lists, chat with people on the site.
  • Write blog posts that can be seen by hundreds of thousands of readers.

It's free and it only takes a minute!

Already a Scientific Blogging member?

Sign In Now

Banner
By Michael White | November 14th 2008 01:55 PM | 7 comments | Print | E-mail | Track Comments
About Michael

Welcome to Adaptive Complexity, where I write about genomics, systems biology, evolution, and the connection between science and literature,

...

View Michael's Profile

Molecular biologists have long operated on the principle that knowing the structure of a biological entity is critical for understanding how it works. Most famously, this was the premise behind one of biology's most iconic discoveries, Watson and Crick's model of the structure of DNA. Structure-function studies have been the foundation of much of molecular biology ever since.

Although the structure of DNA yielded almost immediate insight into an important biological problem, solving structures hasn't always resulted in a eureka moment. The same year Watson and Crick received their Nobel Prize, two other scientists, John Kendrew and Max Perutz, were also awarded the Nobel for determining the structure of a biological molecule. Unfortunatly for Kendrew and Perutz, instead of a flash of insight the result was incomprehension. They had determined the structure of two related proteins, myoglobin and hemoglobin, and these structures at first glance looked like just an irregular mass of thousands of atoms.

Happily, the befuddlement didn't last long. Scientists quickly learned how protein structures explain their function, and today we have amazing structural snapshots of proteins in action. These studies of structure have helped biologists understand the gritty details of key biological processes, such as how membrane-embedded ion pumps enable our nerves to conduct electrical signals. Using a protein's structure to understand its function has now become routine.

But today biologists are facing another moment of incomprehension. We're staring at structures of a different type of biological entity: a network, not an irregular mass of atoms, but one of connections. We know that biological networks give cells their ability to make sense of the world, to process information, to sense the environment or the cells' own internal state, and to take appropriate action. Scientists have been mapping these networks in great detail for years now, but the result is frequently just a giant, molecular hairball (or 'ridiculogram', as a friend calls it).

In other words, scientists are facing yet another giant structure-function problem. How do the strucures of biological networks result in something functional?


Hemoglobin (left) and the Yeast Protein Interaction Network (right)


Biologists began to make functional sense of protein structures by detecting patterns. In the first structures, Max Perutz and John Kendrew identified helices which had been predicted from theory by the physical chemist Linus Pauling. As more structures came out, scientists began identifying recurring protein folds, called domains, which are modular structures that can be involved in carrying out specific tasks. These domains have been reused over and over in nature - nearly all proteins, of the millions of proteins identified in the thousands of organisms sequenced, contain one of several thousand known protein folding domains. These domains are usually clearly recognizable from the amino acid sequence of the protein, which means that often scientists do not need to actually do the experiment to determine a protein's structure - enough of the structure can usually be predicted computationally.

Domains are structures that provide important clues to the function of a protein. Certain protein domains bind to DNA for example, and thus if you discover a new protein that contains a homeodomain, you can make a good bet that your new protein binds a DNA sequence called a homeobox.

Protein structure information tells us more than just what domains make up a protein. By carrying out detailed structure-function experiments, biologists are able provide a physical explanation of how enzymes and other proteins work. Why is hemoglobin more likely to bind a second oxygen molecule after binding one? Because when the first oxygen binds, it distorts to structure of the protein and changes the shape of the binding site for the next oxygen molecule. How does the digestive enzyme pepsin chop up other proteins? An aspartate amino acid positioned just right 'activates' a water molecule, which can in turn break a peptide bond.

What this means is that our knowledge of how proteins work is based on efforts to understand how structure generates function. At first, the functional properties of protein structures were not intuitively obvious, but after detailed, hypothesis-testing experiments, scientists quickly figured out some general ideas, and now structure-function studies are routine (though not easy by any means).

Network Structure-Function Studies

Biologists are now facing a new structure-function challenge: networks. New technologies have made it possible to map cellular networks on a scale not possible just a decade ago. For many biological systems, we have a good picture of which proteins interact with each other, which regulators control which genes, and what molecular path signals follow as they are passed from the outside environment of the cell to the relevant cellular machinery.

But these maps are just static masses of data. What scientists really want to know is how the dynamics work - how a cell makes a circadian oscillator, or how regulatory circuits produce "sniffers, buzzers, toggles and blinkers." How does a cell flip a switch and keep it on? How is a gene timed to come on and shut off at just the right moment?

As in the case of protein structures, scientists are tackling these questions by looking for patterns in the network maps. Protein structures are largely defined by modular domains, and it turns out that biological networks are also made up of modular structures. These structures, called network motifs provide important clues to understanding how the structure of a regulatory circuit produces its effects. Network motifs are small sets of interacting genes that make up various types of feedback or feed-forward loops, which are just small biological circuits.

These network motifs were identified because they show up in biological networks much more frequently than you would expect if those networks were just wired together randomly, giving researchers a hint that these motifs were playing a significant role in the function of these networks. Now these network motifs are the subject of intense focus. Researchers are studying them in simple systems like bacteria, which can be stimulated with an environmental stimulus and measured for a defined response. Some network motifs operate as a response-delaying mechanism, preventing the activation of a response until the cell can be sure a stimulus is real and not just background noise. Other motifs form negative feedback loops, like your thermostat, that activate a pathway when it is needed and shut it off when the job is done. In many cases a cell needs to integrate several signals before making a decision, and these network motifs can do the job, functioning as AND or OR logic gates.

A major challenge in the effort to make sense of all this is the fact that, as was the case with proteins, the function of a network motif is not obvious from its structure. You need to do the math by building analytical models. And in fact, depending on the numbers you plug into your model, the actual output of a network motif can show dramatically different behaviors. Within certain parameters, you may get a simple response curve, but with other parameters, you get an oscillator. The challenge then becomes determining which behavior is occurring in the cell, and that involves difficult experiments to measure the critical parameters.

In fact, parameterizing a biological network model can be a major experimental challenge - so difficult at times that you may be tempted to ask, why even bother? Why bother making detailed parameter measurements to plug into a mathematical model, when instead you could just go ahead and do the experiments to find out what is actually happening inside the cell? If you want to know whether a network motif is producing oscillations, why not just do the experiment, instead of trying to model oscillations with a network map?

Why Build Models?

This is largely how molecular biology has been done for years. Rather than developing theories based on mathematical models, it has been much easier to experimentally determine a cell's qualitative behavior, and scientists have made tremendous progress with this approach. But there are at least three good reasons why we should turn to making models of networks.

First, network models can help use test whether our maps are correct, or whether we are missing critical interactions. Scientists can ask, 'can this network structure possibly produce this behavior?' Models can tell us, for example, that the network connections we have cannot possibly produce the oscillator that we observe in our experiments, and thus we're missing some critically important component in our network map. This is what good models should do: generate new hypotheses, which we can go and test.

Second, it is important to understand not just what is going on in the cell, but how things actually work. It is possible to study proteins in the complete absence of any structural information, and this is what scientists did for decades before the first protein structure came out. But that kind of phenomenological study of proteins does nothing for our ability to look at a brand new protein sequence and predict what that protein does. Without understanding how the structure of a protein produces its function, we also can't predict what the impact of a mutation will be, or engineer a protein to have a new function.

The same is true of networks. We want to know how those networks produce their effects, so that we can predict and understand the effects of changes to the network (such as when we knock out one component with an anti-cancer drug), or even design new biological circuits to have new functions.

And third, modeling networks helps us get around the misleading question of why - why a network is structured a particular way. It's often not very helpful to answer the question of why, because a big part of that answer is evolutionary contingency - a regulatory circuit is structured the way it is simply because of its evolutionary history, and not because that was the best design for a given function. A better question to ask is how - how the structure of a network gives rise to its function. To answer that requires modeling.

All of this means that biologists have to start thinking seriously about how to do the hard experiments necessary to parameterize network models. Non-quantitative, genome-scale experiments have generated masses of network maps, but these experiments are poorly suited to the kinds of measurements needed for modeling. Like physicists, biologists now have to be comfortable with mathematical theory, and with hard experiments that generated the detailed measurements to test those theories. At least biologists can take comfort in the fact their situation isn't quite as bad as the physicists', who, in order to test their models, have to build multi-billion dollar colliders that cannot even be run all 12 months of the year.

For some further reading:

"Modeling the chemotactic response of Escherichia coli to time-varying stimuli", Tu, Shimizu, and Berg, 2008.

"Dynamics and Design Principles of a Basic Regulatory Architecture Controlling Metabolic Pathways", Chin, Chubukov, Jolly, DeRisi, Li, 2008.

An Introduction to Systems Biology, Uri Alon, 2007.

The protein network image is from Zotenko, et al., "Why Do Hubs in the Yeast Protein Interaction Network Tend To Be Essential: Reexamining the Connection between the Network Topology and Essentiality", PLoS Comp. Biol 4(8): e1000140.

Comments

briantaylor's picture

"How does a cell flip a switch and keep it on? How is a gene timed to come on and shut off at just the right moment?"
I would be interested to hear you compare and contrast network motifs and binary systems.
I've begun researching the dna/protein/electricity relationship and keep coming back to: This switch is either on or off, at the right time. Which, of course, is very familiar to computer scientists.
Is there room in the computer language model to help explain biological networks and can we then take that information further to explain all natural physical systems?


Thanks for making me think,


Brian Taylor



 



adaptivecomplexity's picture
Sorry for the really, really delayed comment - I was going to say something this weekend, and now suddenly it's Monday night.

You're right that this deserves a more in-depth discussion. Boolean models of biological networks are becoming more common in the field of systems biology. Stuart Kauffman has been working on then for decades. His book The Origins of Order is full of technical detail.

But most modelers are sticking with dynamic models, using a thermodynamic or kinetic formalism.  As you probably know, with the right set of feedback loops you get get very switch-like behavior, so ODE models often work well.

As this Nature piece indicates though (subscription required unfortunately, but it might be findable somewhere else on the web), biologists and computer scientists still struggle to find the right formalism for biological models.

Gerhard Adam's picture
I will go a step farther and suggest that computer networks and operating systems are, in part, derived from biological systems.  Whether consciously or not, the biological model is the only one in existence that allows us to consider the operation of an element without external intervention.  Therefore when an engineer designs a system with error handling and recovery, at some level the concept of "healing" enters the picture.

When we observe networks (like the internet), we see a complex structure which is the result of many simpler structures and concepts.  A router can determine the optimal path for a data packet to take by employing various rules and algorithms to gain "insight" into its environment and its neighbors.  As a result we have a series of "dumb" machines that are capable of producing "intelligent", or at least logical, actions.

Just as the router in our example is bound by its design and algorithmic rules, so is an organic molecule bound by its chemistry.  In part we need to be careful to avoid assigning too much significance to the outcomes when trying to understand such systems, since in many cases it may have no greater probability than drawing a royal flush versus any five cards randomly.  The royal flush only has significance because we assign it, not because those five cards have any greater probability of occurring than any other five.

To continue the comparison between biological systems and computer systems a bit more .... the research into the behaviors of these molecules is fundamentally no different (albeit on a simpler scale in computers), than the hacker that is discovering unintended functions because of the structure of a software program.

While I'm not suggesting that there is any functional similarly between biological systems and computers, I think that biology could benefit from the skills of people versed in the  mechanisms of debugging  software and networks.

Molecular biologists have long operated on the principle that knowing the structure of a biological entity is critical for understanding how it works.

But is this the right question?

How do the structures of biological networks result in something functional?

It sounds a bit like creationist thinking dressed up as science. How did the structures get there in the first place? If structures emerge then this implies a process. And process implies function over structure.

Is this a chicken and the egg problem? Which came first - was it the function of the chickens having sex that lead to the structure of the egg or the other way around? But then all objects have both structure and function. They are simultaneously active agents and self sacrificing parts of a network.

So maybe the answer is both the chicken and the egg rather than one or the other- E=Mc2. Maybe the universe started as pure energy (E) and structure evolved out as the initial temperature cooled (m). In this case function leads structure in the sense that it initiates an interdependent process.

In Ayurveda and Chinese medicine - both systems medicines - the idea is that function creates and changes/adapts structure (like the positive feedback motifs). At the same time, structure supports and maintains function (like negative feedback loop motifs).

Gerhard's example of the evolution of the internet is a good example of a network we have seen emerge in real time in front of our very eyes. The structure did not create the function, We did not suddenly wake up one day to a structure called the internet that delivered the functionality it does. Structure adapts to its environment. Therefore we have to understand the environment to understand the structure - and we need to understand the historical environment too.

Very recent research released in the Journal of Clinical Endocrinology and Metabolism on childhood obesity is also raising the idea the idea that function leads structure - an inflammatory environment caused by obesity leads to structural changes in the thyroid and its consequential malfunction as opposed to the traditional view that thyroid problems are the cause of obesity. http://news.bbc.co.uk/2/hi/health/7762471.stm

But old habits die hard. Tradition is not sure it agrees and wants more studies - Professor David Dunger, a paediatric endocrinologist from the University of Cambridge, said that while it was an "interesting observation", it was unlikely to challenge the accepted view that thyroid problems could contribute to obesity, rather than be caused by it.
He said: "It tends to turn thinking about the thyroid on its head, and the findings would need to be reproduced by other studies." - the old guard have a lot to loose.

From the perspective of yogis and Buddhists etc., all objects are interconnected and in constant flux. Something only exists in relationship to other objects - in of itself it is meaningless. In other words an object is defined by the network of relationships it is in. These relationships create the common or shared identity which is another name for a cohesive structure.

My theory is that the biologists have awakened to a new level of complexity that transcends the old reductionist command and control model. They have realised that what they thought of as separate objects are in fact part of networks of relationships. Their current problem is that they are using their old world view to try and make sense of this new expanded world view. This is natural first step and it will change as their neurones and world view rewire to the new reality.

As the Chinese say, things that have a yin, or supporting, structure have a yang, or active, function. And, those things that have a yang structure have a yin function. Moreover, yin and yang are relative to each other - something which is yang relative to one object is yin relative to another object. In other words the cells in my body are sacrificing their individual identities and co-operating together to create a single identity - me - this common identity is built up of shared information which the white blood cells pass to each other when they make contact. Therefore my cells are yin relative to my body - they support my body. But my cells are also yang relative to their constituents. Each cell has its own individual identity as well a communal one. And each cell comprised of a number of components that are sacrificing their individual identities to make up a cohesive network called single cell.

So function and structure are relative and two sides of the same coin. They depend on the perspective of the observer and the definition of the object. We humans forget that our definition of the world is "relative". If everything is connected and in constant flux then in the long run there is impermanence - in the short run we see objects and we get "fooled". But these objects are real in our world of time and space.

In my opinion the future of the species depends on us waking up to our interdependence. Self consciousness is a positive feedback loop. It creates a sense of "identity" and "agency" and a sense of separation from the environment. Through it we have created language and technology and the capacity to adapt our environment. We call this capacity to adapt our environment "progress". But the main thing that is progressing now is the sixth great extinction and the "progressive" degradation of our support base or network.

Are we really making "progress"? I can tell you that the subjective practice of yoga - or observing oneself and mind produced a cosmological theory of emergence or evolution called Samkhya that looks very similar to current models. Self consciousness is simultaneously our defining strength and our Achilles heel. It is our evolutionary design fault. It has allowed us to make "long term' impacts to the environment that we are unable to comprehend because the environmental conditions we evolved in only needed short term feedback systems for success.

I've heard many people comment on a possible marriage between computer science and biology. Personally I would like to see the same between electrical engineering and biology. I see far more overlap between concepts found in EE than I see in computer science.

adaptivecomplexity's picture
It would be good if more biologists could be exposed to these various formalisms. Not being well versed in EE or comp sci., I obviously can't make definitive statements about which formalisms from other fields would be most useful for biologists to use, but I can see how various ideas in EE about control systems and feedback would be more useful for biologists than, say, ideas about algorithms which make up a good chunk of computer science.  (I've also heard your talks on modeling biological systems at various conferences, so I can see where you're coming from.)

Few biologists currently have the necessary background in EE, unfortunately. Biologists are more likely to get training in physical sciences rather than engineering, and in my personal experience physicists and computer scientists are more likely to branch into biology than engineers. It would be great to have more people trained in EE bring their ides over to biology.

Gerhard Adam's picture
I'm not sure that there's a marriage between computers/electrical engineering and biology but I am believe that they both deal with relatively simple implementations that must result in significantly more complex emergent properties.

As a result, they may provide insight without necessarily being required to be the same.

Add a comment

The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <sup> <sub> <a> <em> <strong> <center> <cite> <code> <TH><ul> <ol> <li> <dl> <dt> <dd> <img> <br> <p> <blockquote> <strike> <object> <param> <embed> <del> <pre> <b> <i> <table> <tbody> <div> <tr> <td> <h1> <h2> <h3> <h4> <h5> <h6> <hr> <iframe>
  • Lines and paragraphs break automatically.
  • Web page addresses and e-mail addresses turn into links automatically.
CAPTCHA
If you register, you will never be bothered to prove you are human again. And you get a real editor toolbar to use instead of this HTML thing that wards off spam bots.