Track your comments!
[x]


When you register, comments on your articles and replies to your comments appear here. Register Now!

Sign in to your account
[x]

Not a Scientific Blogging member yet?

Register Now for a Free Scientificblogging.com Account

  • Customize your profile with pictures, banner, a blogroll and more.
  • Leave comments on articles, add other members to your friend lists, chat with people on the site.
  • Write blog posts that can be seen by hundreds of thousands of readers.

It's free and it only takes a minute!

Already a Scientific Blogging member?

Sign In Now

Banner
By Michael White | April 25th 2007 09:14 AM | 13 comments | Track Comments

About Michael White

Welcome to Adaptive Complexity, where I write about genomics, systems biology, evolution, and the connection between science and literature, government, and society.

I'm a biochemist


... Full Bio

More from Michael White

All
Just recently Science published the paper describing the latest primate genome - the rhesus macaque genome. (Check out Science's macaque website for some good (and free) articles on the subject.) Sequencing a large genome like this one is resource intensive (unlike microbial genomes, which are now easily and routinely sequenced), so why did scientists sequence yet another primate genome? In addition to the human genome we already have the chimp genome, and we also have several non-primate mammalian genomes - the mouse, rat, cow, dog, and opossum genomes. Is this a good use of our money? Why put in so much effort just to study evolution?

Evolution is worth studying in and of itself, however evolution is so tightly connected with every field of biology that it's hard to avoid evolution when you're studying anything else. We sequence these genomes because we know we can use evolutionary principles to understand the nuts and bolts of the genome. This strategy has been used already with great success in major genetic model organisms, including flies, worms, and yeast.

Most of us, I would bet, are more interested in the human genome than any other, and ultimately we sequence these primate genomes to understand our own genome. The chimp genome is helpful, but we need a more distantly related species to really enable us to effectively use genome comparison to learn about all those parts in our DNA. The rhesus macaque is a great pick, because it has been used extensively in medical research, and it is an Old World monkey - one of our closest relatives outside of the great apes.

Genome sequencing simply gives us raw sequence, such as this region from human chromosome 11:
(Sequence is read from left to right, line by line, like regular text.)

GAGGAGGCGGCGGAGGAGGGGCCGCCCGCGGCCCCCGGCTCACTCCGGCACTCCG
GGCCGCTCGGCCCCCATGCCTGCCCGACCGCGCTGCCGGAGCCCCAGGTCCGGGG
GCGGAGGGGAGCGCTGCCGCGGGGGTGGGCGGGCGGGGCGCGGGGGCCATGTGCG
AGCGCGGCAGGGAGGCGGGCGGGGCGGGCTGCAGGCGGGGTCCGACTCTGGGGCC
AGTCCGGGCCACGGTTGGGACCCAGTCGAGGGTCGGACTGGTCAGGGTTCAGGCG
GGATCCGGCGTCCGAGTCCTGGTGGGCCGGCCTGGGGCAGGATCTGGCTCTGGCT
GCGGGTCCTGACTCGGGTCAGGGTTGGGCCTCCGATCCAGCCCGCTCCGGGGCAG
GGTTCAATCCCGCATTTGCCGAAGTCCCTGGGGCTGGCCGGGGTGGAAGACGGGG
AGGGCTCTATGTCTGGGAAGGGGCTCTGAAGACCACGTGGGGGCGCTCGAAGGGG
CCTGGGGCCACCCTCCTCTCTGGGTCAAAGGTCATCGCACCGGCAGGGGAGAACT
TCCTCCTCCTTGGCTCTCCCCACTTACTTCCTGATAACCTGGTAGAGGTCTCCCG
CGGGCGGGGAGGGGGAGGCGTAGCAACTTTAGGCAACTTCCCAAAGGTGTGCGCA
GGTTGGGGGCGGGACGCGGCGCCCCGGGAGGTGGCGGCCTCTGCGACAGCGGGAG
TATAAGAGTGGACCTGCAGGCTGGTCGCGAGGAGGTGGAGCGGCGCCCGCCGTGT
GCCTGGGACCGGCATGCTGGGGCAGGAGGGCAGCCGCGTGTCAGGTGTGAAAAGC
TCTGGAGGTGTTTTCATGAGTCCGTGCCTGTGCGTGTGGATGTGGGGAGACCTAG
TGAGAGTGTGTGTGATCATGAGCCTTGACTGAGTTCGTGGATGGGGTGTGCGCTC
CAGGAGAAGTGTGTGAGCACAAGTGTGAGCAGGAGTGAGCACGGGTTTGGGAAGG
CCGGTGCAAGTGTGAAAGCCCTCAGCAGAGAGCGAGCCTGCGTGGGCTTGTGGGG
CTCCTGAGCACCCCGGTGAGTGGAGTGTGTGAACTCGGTGTGAGCACGTCCACTG
GCCTTGGGTCTGCTCTCCAATGCAGAATACCCAGATGAGGGCAGGGTCTCAGAGG
TCCCCCCAACATCTGGAGAAAACTGGGAAGTATCCTGCTCCTGGCTAGGGATTCC
AGGTGGGGTTGAAGGTTGCCTGGGGGCTACGGTTACCCTGCTCCCTGGCCTGGGT
GGGAGTAGGGGCTTTCTAAGCCTCCCCCAGGTTCCCAAGGGGGAGACCTGCTGTC
AGTTACTGGCCCTGAAGACTCTGTTTCCATGGCAACAGCTAGGAGGGGGCAGTGT
TCCTGGGCAGTCCTTCCTTGGACTCTGCCCCCCTTCTTCCCCACTTGCTGGGCTT
GGAAGCCTGGCCCTAGGCCCGAGGTTGGGCAACCCGTGTGGCAGGGTGTCTCCCA
TCCCCCATACCAGTGCTTTCCTGCGAACCTATGGGTCTCTCCGTGCAGGTGACCA
GCGCCATGTCCAGCCAGGTGGTGGGCATTGAGCCTCTCTACATCAAGGCAGAGCC
GGCCAGCCCTGACAGTCCAAAGGGTTCCTCGGAGACAGAGACCGAGCCTCCTGTG
GCCCTGGCCCCTGGTCCAGCTCCCACTCGCTGCCTCCCAGGCCACAAGGAAGAGG
AGGATGGGGAGGGGGCTGGGCCTGGCGAGCAGGGCGGTGGGAAGCTGGTGCTCAG
CTCCCTGCCCAAGCGCCTCTGCCTGGTCTGTGGGGACGTGGCCTCCGGCTACCAC
TATGGTGTGGCATCCTGTGAGGCCTGCAAAGCCTTCTTCAAGAGGACCATCCAGG
GTGAGCCCCCAGCCCACTCCCCTGTCCTTTGCCCTGCACCCTCTGGGTACACTGC
TGGGTGCAATAGGCCCCCTGATGGCTGTGGCACCGCTTGAGGCTAACAATCTGGT
GTTTCCAGTCCCTCTACCTCCCAGAGACACTCTTTCCCTGAGAAGTATGGTAAAA
GCACCGGGTGTGCTGATGCATTGCAGTGGATGTGAGTGAGTTCAGGGTACCACCT
GGGTACTCTAGGCCCAGCACCTTCTACAGTGGCTCTGAAAGAGTCCAAGGCAGCC
TCTGTCTGTTCCTAAGCTTTGTTCTTGTTTCTGGCAGCTTCTGACCTCTCCCCAG
CATAGAACATGTCCCCTTTTTGTTAATTTTCCCAAAGCAGCACCAACACAAGGCA
GATTTTAATTTTTTTTTTTTTGAGACAGAGTCTCACTCTGTTGTTCAGGCTAGAG
TGCAGTGGCACAATCTCTGCTCACTGCAACCTTTGCCCCTGGGTTCAAGAGATTC
TCCTGCCTCAGCCTCCTGAGTAGCTGAGACTGCAGGTGTGCACCACCACGCCCAG
CTAATTTTTGTATTTTTAGTAGAGACGACGTTTCACCATGTCGGCCAGGCTGGTC
TGGAATTCCTGACCACAAATGATCCACCTGCCTCGGCCTCCCAAAACAAGGCAGA
TTTTTATCAGTACTTGAGAGGGGCTACATCATAGTTTAGCACCCAACTTTAAAAA
GACTAACAGGCAAGGCCGGACACAGTTGCTCACACCTGTAATCCCAGCACTTTGG
GAGGCCAAGGTGGGCGGATCACCTGAGGTCAGGAGATCGAGACCAGCCTGGCCAG
GGTGGTGAAACCGCATCTCTACTAAAAATGCAAAAAATTAGCTGGGCATGGTGGC
TCGCGCCTGTAATCTCAGCTACTTGCTACTTGAGAGGCTGAGGCAGGAGAATTGC
TTGAACCCAGGAGGCAGAGGTTGCAGTGAGCCAAGATCACACCACTGTACTCCAG
CCTGGGTGACAGAGCGAGATTCCATCTCAAAAAAAAAAAAAAAAGGCCGGGCACT
GTGGCTCATGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTC
AGGAGATTGAGAACATCCTGGCTAACACGGTGAAACACTGTCTCTACTAAAAATA
CAAAAAATTAGCTGGGCATGGTGGCGGGCGCCTGTAATCCCAGCTACTTGGGAGG
CTGAGGCAGGAGAATGGCGTGAACCCAGGAGGCGGAGGTTGCAGTGAGCCAAGAT
CACGCCACTGCACTCCAGCCTGGGCGACAGAGTGAGACTCCGTCTCAAAAAAAAA
AAAAAAAAGGCTGGGCGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGC
CGAGACGGGCGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGACCAATGTGA
TGAAACCCCGTCTCTACTGAAAATACAAAAATTAGCCAAGCATGGTGGCATGCGC
CTGTCATCCCACTCAAGAGGCTGAGACAGGAGAATTGCTTGAACCTGGGAGGCAG
AGGTTGCAATGAGCCCAGATCGCGCCATTGCACTCTAGCCTGCGCAACAAAAGTG
AAACTCCACCTCAAAAAACAAAAACAAAAACAAAAACAAAAAAACCCAAAAACGC
TGGGCTTGGTGGCTCATGGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCAGAC
GGATCACGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTAAAACCCCGTC
TCTACTAAAAATACAAAAATTAGCCGGGCGTGGTGGTGAGTGCCTGTAATCCCAC
TACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGGCAGAGGTTGCAG
TGAGCTGAGATCATGCCACAGCACTCTAGTCTGGGCAACAGAATGAGACACTCTC
ATCTCAAAAAAAAAAAAAAAAGGACTTACAGGCATGTCTGCTCTTAAAAGTCACT
AATTTTTTTCTCACTCAGGAAAGCTTATCAGAATTTGGGGGAATGAGCAAGATGC
TGACATTAAGCATTGCCTGGGAAGGGCCTATTATTTCCGTTATTTCTGCTTTTAT
GTAACCATTGGTTACTTTGGGGGCTATAACACGTATAATTAAAAAAAAAAAAAAA
AAGGCCAAGTGTGGTGGCTCACACCTGTAATCTCAGCACTTTCGGAGGCTAAGAT
GGGAGGATCACAAGGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCC
TGTCTGTACTAGAAATACAAAAATTAGCCAGGTGTCGTGGTGGGTGCCTGTAGTC
CCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTGGAACCCAGGAGGCAGAGGT
TGGAGTTAGCCAAGATCGTGCCACTGCACTCCCAGCCTGGGTGACAGAGTGAGAG
TTCGTATCAAAAAAAAAAAAAAAAAAAAAATCTTGAGTGCTTACCTTGTGCTAGG
CACTGTATTCTTTTATGATCTCAGTTAGTCCCCACAGCAACCCTATAAGGTGTCA
GTACTGTTATAACTGAAACTAAGAGAGGCATTTGAAACTTTGTTGAAGTCTCACA
ACTAGGAAATGGCAGAACCAAGATTTGAACTTGGGTCAGTATAGGTCCAGAGCTG
AGCTCTTCAATGTTAGACTGCTTCCTCTGCTTATTACTAATAACACCGAACTTTG
GACAGACGCTGAATGACTGATTGTGACATTCCAGCACGTTTTTTTTTTTTTTTTT
GAGACAGTCTCGTGTGGTCGCCCAGGCTGGAGTGCAGTGGCACGATCTCGGCTCA
CTGCAAGCTCCGCCTCCCGGGTTCACACCATTCTCCTGCCTCAGCCTCCTGAGTA
GCTGGGACTACAGGTGCCCGCCACCACGCCTGGCTAATTTTTTGTACTTTTAGTA
GAGACGGGGTTTCAGCGTGTTAGCCAAGATGGTCTTGATTTCCTGACCTCGAGAT
CCACCTGCCTTGGACTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGCTCCT
GGCCAGGTTTTTTTTTTTTTTTTTTTTTTTTTTGAGATGGAGTTTTGCTCTTGTT
GTCCAGGCTGGAGTGCAACGGCCTGCAGTCGTGGTTCACTGCAACCTCTGCCTCC
CGGGTTCAAGCCATTCACCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGGCGC
CTGCCACCATGCCCGGCTAATTTTTGTGTTTTTAGTAGGGATGGGGTTTCACCAT
GTTGGCCAGGCTGGCCTCAAACTCCTGACCTCAGGCGATCTGCCCTCCTCGGGCT
TCCAAAGTGCTGGGATTATAGGTGTGAGCCACTGCACCCCGCCAATCCAGCAAGT
TTTAACTTGGCCAAAATCCACCAATCTTAAACTTTGTGCACCCTTCCCACTCTGA
AGAACAGTGAGCCAGCCGGCCAGGGTGCGGGTATCTCCTACCTACCCTGGGGCCC
CTCACTGTATGTTGACTATTGACAAATATTTATTGTGTGCTGGCTGTGAATAGGA
CTTGTATATTGAGCACTTAGGTGTCATGAACCATGCTGGATGTTTTGACCATATT
ATCCCCTTTAATTCTCACGACCCAACTCTGTGGGGCACTTTTACAGCTGGGAAAC
TGAGGGTTCAAGGGGTTAGGTATGGGACTTGCCCAAGGTCATAAAGGTATGTGGT
AGCCAGAGTCCCTGTTCGGCACAGACCTGTTCTTTGCTGTCCTGGCCAGTGTTCC
AGGCCTTGGGGACATAGCTGGGGCTGAAGCAGGGCTGTTTCTGCCCTCAGGCAGT
TTACATCCTGGCAGAGGGGAGAGCTGGGCAACAGTGAGTTGCACAGACTTGTCTT
ATTACCGCTGTGGTATGTGCAGGAAGGGGAGGTGCTGGTTCTGAGGCTCCAGAGG
GCTTGTCTTTTTTTTTTTTTTTTTGAGACGGAGTCTCGCTTTGTTGCCCAGGCTA
GAGTCCAGTGGCGCGATCTCGGCTCAGTGCAAGCTCCGCCTCCCGGGTTCAAGCG
ATTCTCCTGCCTCAGCCTCCCCAATAGCTGGGATTACAGGCGCATGGCACCACGC
ACGGCTAATTTTGGTATTTTTAGTAGAGACTGGGTTTCACCATGTTAGCCAGGAT
GGTCTCGATCTCCTGACCTCGTGATCCACCCGCCTCGGCCTCCCAAAGTGCTGGG
ATTACGCTCCCGGCCTCTTTTTTTTTTTAGACAGAGTCTCACTCTGTTGCCAGGC
TATAGTACAGTGGCACGATCTCAGCTTACTGCAACCTCCGCCTCCCAGGTTCAAG
CGATTGTTCTCCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCACACGCCCAGCT
AATTTTTGTATTTTTAGTAGAGACAGGGTTTCACCGTGTTGGTCAGGCTGGTCTC
AAACTCCTCACCTCGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTATA
GGCGTGAGCCACTGCGCCTGGCCTTTTTTTTTTTTTGGTACAGAGTTTCGCTCTG
GTTGCCCAGGCTGGAGTGCAATGGCACGATCTTGGCTCACTGCAGCCTCTGCCTC
CCGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCGGAGCAGCTGGGATTACAGACA
TGCACCACCATGTCCGGCTAATTTTTTTTTTTCGAGATGGAGTCTCACTGTGTCA
CCCAGGCTGGAGTGCAGTGGCACAATCTCGGCTCACTGCAACCTCTGCCTCCCGG
GTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGTGCCTG
CCACCACACCCAGCTAATTTTTGTACTTTTAGTAGAGACGGGGTTTTACCATGTT
GGCCAGGCTGGTCTTGAACTTCTGACCTCAGGTGATCCACCCACCTCGGTCTCCC
AAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCCGGCCGTGGTGTCTTGAGCT
GAGTGCAGAAGCGCAAATAGGGGGTAGGAGAAAATGCACCGCGAGGAGAAATGTG
CTGCGGGCCTGCTGTCTAGCTGTGTCATTTGGTCGTTGCGGGGCCCTGTGAGGCC
GGGAGGGCTGCCAGCACCCACCATGTGCCAGGCCTCGTTGCTAGTGCTGGGGCCA
GTTCCTGCCCCGGTGGAGCTGCCACTGAAGGGGGAGGCGTAATAAACAAGATAGG
TGAGTGCATATGCAGCGTGGTCTGTTGTGCTGAGGGCTGAAGAGAAACCAGAAGC
AGGGCTCAGAGGCCAGGAGGACTCTGCAAAGGGATTTGGCATTATCACAGGGTGG
CCAGGGAAGATCTTCAAGGTGACAGTGAGCAGAGGGAGGTGAGGGAGCCTGTGTG
GACTTCAGGACTAGAGCTCCAGGCAGGGCCTGTTTGAGGAACATGGAGGAGGCGA
GAGCAAGGAGTAGAGGTCAAAAGGAGGCAAGAAGCAGGGGCGTAGGCCTAGGAGG
ACATAGGTTCGCTTTGGCTTGGACTCAGAGAAGGGAAATCCCCAGAGGGTTTTGA
GAAGAGGAGGTACAGGATGTAATGGAGGCTTAATAGGACCCTCTTGGCTGCTGAG
TCGAGAACAGACTGGAGCAAGCAGGGACAGCCAAGCGAGGGGCGAGGTGACAGTG
ACTATCAGGTCAAGGGTGGAAGTAGTTGCCAGGGGCAGGAGGCGGATTCTGGACC
TTGGAGGAGGTAAAGCCCACCAGAATGTGTCGGTGGCTTGGATGTGGGGTGTGAG
AGGAACCAGAGATTCTGCCTAGGTTTCTTCTTGGGCAAGTGAACACGTGGAGTCC
ACGTAGGCTGTGTTCGGTCCGAGATGCCTTCTAGACATGCAGGATGTCAAGGAGG
CAGCTGGAGAGATGGGTCTGGAGCTCACAGCAAGTCCAGGCTAGAGGTAGAAACG
TGAGAGCCCCACGGCTGGGGAAGATTGCCATGGGATTGGAGATGAGCTCCAAGGA
CAGCCCTGGCAGTCTGGATGGAAGAGCTTGGGAAGATGCTCAGAAACCACAAAGT
GGCTGGTGCGGTGGGAGGAAAACCAGAGTGTATGCTGTCCTAGAAGCAAAAGAAG
AAAGTGTTTCAGTGTTTCTAGGAGCAGGAAGTGATCAACAGCCTTAGATCCTCCT
TTTAGGCCAAGTAACATGAGGACTAAGAATTGACCACTGGATTTAGCAATGCAGA
GGTCCTTGTGGCCCTTGATGTCGGCAGATGAGGGCAGTGTGGTCCAGAGATGAGG
CTTGGGGCTGAGATGCAGCCCCGCTGCCTGGTCCAGCTCCTCCCTCATCCAGGCA
GGGCTCCCCCGCCCAGCAGCCACTCCCCTCCCTGCCTGCTCATGGCCCCCTGCTC
TCCCTTTCCTCCCCATACCCCCAGACCTGTGCTTGCCCGGGGAGAGTCAGGGCTC
TCCTGTCAGCTGGGTCCCCTCCCAGCCCCGGGAGGCCGCCACTGGAGCCCTGCCT
CTTCCTGGCAGGGAGCATCGAGTACAGCTGTCCGGCCTCCAACGAGTGTGAGATC
ACCAAGCGGAGACGCAAGGCCTGCCAGGCCTGCCGCTTCACCAAGTGCCTGCGGG
TGGGCATGCTCAAGGAGGGTGAGCGCTGGGCAGGGGCTGGGCGAGGGCTGGGGGA
GTCGGGGACCCGGGCCAGGTGGGGGTGAGGCCTGGGAGTTCTGGTGAGTGGACTC
GGG

I purposely included a long chunk (actually it's really a very tiny piece of chromosome 11) just to convey what this vast sea of unannotated sequence looks like. About all you could do with this is use the genetic code to see if there is something looks like a protein coding region in there. But actual protein coding regions are very sparse, and broken into fragments called exons, which are spliced together before the final protein is made.

What we really want to know is where the gene is (in this case, an estrogen receptor gene) and it's controlling elements are. You won't be able to see the details below, but here is the big picture of the same sequence, with important elements highlighted:



And here we're looking at only a few elements - I haven't included promoter regions, enhancers, non-coding RNAs, transposable elements... To find these elements requires three things:

- computer tools to build models of these elements and search the sequence
- sequence from related species for comparison
- experiments to test your computer predictions.

We have these three elements for yeast, flies and worms, but in the case of humans, we have sorely needed more sequence, from an animal like the Rhesus monkey.

I'll finish up with an example of how useful sequence comparisons are, from my own work in yeast. Certain proteins, which are master regulators of cell division, modify target proteins at the sequence 'TP..any letter..R or K'. (Now we're talking about protein sequence, so we don't just have A's, T's, G's, and C's.) To understand how these master regulators carry out their role, we would like to know exactly which proteins are their targets. How do we find those targets? Easy - just look for any protein that has 'TP..any letter..R or K' in it, and you have a candidate protein that you can test in the lab!

Well, it turns out it's not so easy - many proteins have this 'TP..any letter..R or K' just by chance - too many to test in the lab. So we want to choose the most likely targets - those whose sequence has been conserved throughout evolution. You can line up the sequences from different species, and easily see the 'TP..any letter..R or K' which has been conserved over 100 million years of evolution:



The sequence on the top line is from baker's yeast, and the sequence on the bottom is from a yeast that shared an ancestor with baker's yeast 100 million years back.

Comparative genomics really works. It has helped us learn a tremendous amount about flies, worms, and yeast. With the macaque genome, we'll hopefully have the same success learning about our own genome.

Comments

Cash's picture
I saw this yesterday and it looked long so I wanted to wait until I had some more time. I am glad I did!

Since you opened up the question about value I want to follow up because I am not sure you answered it. If we already have the human genome, and things close to human like the great ape, and farther away like a horse, what is the value of doing another one?

adaptivecomplexity's picture
Cash asked: "If we already have the human genome, and things close to human like the great ape, and farther away like a horse, what is the value of doing another one?"

That's a very good question, and one I did not address very clearly in this piece. The answer is that the other genomes we have, like the rat and mouse genomes, are too dissimilar to the human genome to really make the best use of the computational tools that have been pioneered in organisms like yeast, flies, and worms.

This is especially true of computational tools to find non-protein coding regions, like microRNAs and important regulatory regions - those regions tend to be both short and less well conserved. If you compare the human and mouse genomes, things have changed so much that many of these elements are hard to pick out. On the other hand, humans and chimps are so similar that it's hard to tell which similarities are due simply to chance, and which are due to the action of natural selection.

Macaques are at a good evolutionary distance - not too close, and not too far.

adaptivecomplexity's picture
One more comment on the evolutionary positioning of the macaque relative to chimps and humans:

In specific instances in the genome where humans and chimps differ, you would like to know which species' DNA changed - the chimp or the human? (Or both?) By comparing chimp and human sequence with the macaque, in many cases you can tell in which lineage the change occurred. Let's say you have a gene involved in brain development, where at one point in the DNA sequence, humans and chimps differ - we want to know, did the gene change during chimp evolution or human evolution? If the chimp gene sequence is the same as the macaque sequence, then you know the change occurred in the human lineage, and that change may be important in human brain development.

In one of the Science papers they use this technique. Again, we could not do this very well with the mouse genome, because it is too dissimilar to the human genome.

That was actually done last year. See this story and this one.

Hank's picture
UC Davis does good stuff. We have scientists from other CA schools, Berkeley, Stanford, USC, UCLA, etc. and I invited some from UC Davis but they never responded.

adaptivecomplexity's picture
That's true - people use whatever sequence is available to do these kinds of analyses, and studies comparing human, chimp, mouse and rat genomes have been valuable. Pollard and Haussler were able to identify regions which exhibited farily rapid evolutionary rates - regions that are changing dramatically enough that you could see it in a mouse-chimp-human comparison.

But these comparisons are still fairly low resolution when compared with what researchers are doing in yeast or flies. The macaque genome, and others like the orangutan genome, will help us perform higher resolution studies. If our long-term goal is to comprehensiely identify all of the functional elements in the human genome, these higher resolution studies are critical.

Rhesus macaques are used to study a wide range of human diseases, including HIV/AIDS (similar viruses infect monkeys), asthma and lung diseases (their lungs develop in a similar way to ours) and autism and Alzheimer's disease. Of all the non-human primates that our research models, macaques are the species for which the most reagents are available. So having the genome available will accelerate research.

Cash's picture
I found this video as well: http://mediaserver.wustl.edu/mpa/macaque/small.wmv

But because it's on their media server I don't want to embed it and leech them into oblivion.

I basically know the embed comannd which would make it run all of the time. Does anyone know a good tool that can allow embedding from a remote source like this but with play/stop controls so we don't clobber their bandwidth?

From a medical research point of view, macaques are a lot easier to work with than great apes such as chimpanzees, but they're also a lot more similar to humans than mice, which are probably the most important animal for medical research.

Hank's picture
We have a whole columnist devoted to mice so they must be important.

adaptivecomplexity's picture
Personally I'm partial to yeast (it's amazing how similar yeast are to humans!), but mice are indisputably the workhorses of mammalian genetics research.

Hank's picture
it's amazing how similar yeast are to humans!

That's an article I want to read.

How do you title something like that?

"Getting a rise out of the yeast-human connection"

I'll keep trying. Surely as a yeast expert you must know all of the good ones already.

adaptivecomplexity's picture
That one's pretty bad...

There are some pretty horrible titles on yeast and sex - yeast have two different mating types, and they sort of reproduce sexually.

Add a comment

The content of this field is kept private and will not be shown publicly.
  • Allowed HTML tags: <sup> <sub> <a> <em> <strong> <center> <cite> <code> <TH><ul> <ol> <li> <dl> <dt> <dd> <img> <br> <p> <blockquote> <strike> <object> <param> <embed> <del> <pre> <b> <i> <table> <tbody> <div> <tr> <td> <h1> <h2> <h3> <h4> <h5> <h6> <hr> <iframe>
  • Lines and paragraphs break automatically.
  • Web page addresses and e-mail addresses turn into links automatically.

More information about formatting options

CAPTCHA
If you register, you will never be bothered to prove you are human again.
Image CAPTCHA
Copy the characters (respecting upper/lower case) from the image.