The Size of the Genome

In 2012 scientists of the University of Leicester printed the full human genome in books. A BBC news item shows the impressive 130 thick volumes that were required to achieve this. One heavy book is shown to exclusively contain four letters, ACTG, from cover to cover: no spaces, no paragraphs, no chapters.

genome1

My Dutch Copy of ‘Over het ontstaan van soorten’, to show the size.

Practically those books are completely useless, but beside that: wow! From the moment I saw it, I really wanted to have a full genome library like that!

Such a practical representation of the human genome, as it was deciphered in the Human Genome Project, gives an elegant insight in the dimensions, the complexity, and above all: the astonishing incomprehensibility of the genome. Sadly, printed genome library’s were sold nowhere. That is why I tried to make the genome materially concrete for myself in another way. I tried to visualize it, using a book that I, and many other people do have at home: Darwin’s On the Origin of Species. There’s two pictures of my personal Dutch copy beside to show its size.

genome2

Pages of my OTOOS copy, to show the letter size.

What if the human genome would be printed in books the size of my copy of Darwin’s ‘On the Origin of Species’?

Eukaryote_DNA-en.svg

DNA in Eukaryote cell (Wikipedia)

In order to find that out, it might be good to do some genome basics, as a reminder. Humans, like most other living organisms, are made up out of cells. These are quite small, and there’s a lot of them. Each cell has a small core, that contains DNA.

DNA is the molecule with the iconic ‘double helix’ form, that has become the general symbol for genetics. DNA is made up out of very long strings, with four reoccurring variations. These were named A, C, T and G.

The image that was depicted in the famous Watson and Crick 1953 Nature article

The image that was depicted in the famous Watson and Crick 1953 Nature article

Inside each cell there’s about three billion pairs of those letters. All humans share the vast majority of these. It was those three billion letters that were deciphered in the Human Genome Project, and that’s what was printed by the Leicester scientists. (Note that the ‘genome’ is not the same as ‘a gene’ or ‘genes’.) So all cells of our body’s contain roughly the same three billion letter text, and the scope of that, is what I would like to make concrete.

Bookcase

Visualization of seven shelf bookcase, with the OTOOS copy below.

When we take a look at the Darwin Online second edition version of Darwin’s book, we can easily determine that it contains about one million characters, spaces included. So in order to write the genome’s 3.000.000.000 letters in 1.000.000 character books, 3000 books would be required. So roughly three thousand times the text that’s inside my copy of On the Origin of Species, is inside each cell of our bodies.

My copy of The Origin of Species is about 2,4 cm thick, so a row of three thousand copies, would be about 70 meters long!

In order to visualize the amount of text, I imagined a bookcase with seven shelfs. In the university hallway I hung up seven papers, and that seemed reasonable. On the right there’s a photo of this imaginary ‘bookcase’ I made. Also note my copy of On the Origin of species on the lowest ‘shelf’.

A seven shelf bookcase, that were to be filled with 3000 copies of On the Origin of Species, would need to be ten meters long. So if the seven papers were the right hand side of this bookcase, the left hand side would be ten meters down the hallway. In order to visualize this, I took ten big steps, and put my office fitness ball there. A kind student was willing to support my visualization by pointing to the topmost corner of my imaginary bookcase. This is the result:

Bookcase

(Person in photograph: Matthijs Geleijnse.)

So if this were a seven shelf bookcase, from the papers in the front, to the ball and the student at the back, and if it were entirely filled with copies of the tiny book that’s visible in the middle, than that would be the space required to store a full printed version of the human genome.

On second thought, obtaining such a book collection might not be so desirable after all! I already find it difficult to find proper room for my current book collection, and those are books that are actually fun to read.

And furthermore, I supposedly already posses such a library in every single cell of my body. And since those are estimated to number 37.200.000.000.000, adding one more, wouldn’t really make a difference. Besides, this exercise got me a nice imaginary bookcase just outside my room, and that’s where I will leave it.

Leave a Comment

Your email address will not be published. Required fields are marked *