Now if we take sequence and the civilian
intended to be gibberish up here, from a chimp and a human, and
you put it through a computer, you're gonna find hidden messages.
This is hidden.
This is hidden.
Computers are good at finding this kind of thing, stuff that's better preserved
than it should've been just by chance and stuff that is different.
Before the ability of computers to extract useful information from genomes,
bioinformatics.
Of course, even big data, computers are crucial.
They enable us to ask questions to get answers in ways we never had before.
Applying bioinformatics to the human and
chimp sequence, we find about 98.5% of it is conserved.
At least 1.5% that's interesting, but it identifies a set of mutations
that accumulated in the DNA of the chimp lineage, and
some of these would have made a chimp a chimp, and
another set of mutations that accumulated in our lineage, making us humans.
So how do we differ from chimps?
Well, about one in a hundred base pairs is different
as compared to an average of one in a thousand between you and me, and
given that the genome is 3 billion base pairs long, that means there may
be 30 million human specific letters of code that are not shared by the chimp.
In addition, there are about 5 million INDELs,
which remember means assertions and deletions in the genome.
And they're mostly in regions that don't encode proteins.
And that means that not many of those 5 million INDELs will contain genes.
So there's not gonna be many copy number variations in us and chimps.
Now bare in mind that there are copy number variations between us.
So I may have more or less genes than you.
But if we look at those genes where modern humans all seem to have the same number,
then about 30 genes are replicated in us since we split from chimps.
That's not many, so we have a very similar number of genes as a chimp.
Interestingly, a lot of the extra genes we've got are involved in
plane development.
The best study of these is a gene called SRGAP2.