Ten years ago there were 8 billion base pairs of 'finished' sequence in the main databases. That number has now grown to more than 270 billion bases, yet it is dwarfed by the amount of raw sequence being created and stored.
Biologist and self-confessed bookworm Klemens Pichler thinks that he has found his ideal vocation. Pichler is a biocurator at the European Bioinformatics Institute (EBI) in Hinxton, UK, working on the Universal Protein Resource (UniProt) database.
When scientists opened up the human genome, they expected to find the genetic components of common traits and diseases. But they were nowhere to be seen. Brendan Maher shines a light on six places where the missing loot could be stashed away.