Geneticists lay foundations for human transcriptome database

Article metrics

Tokyo

Expressed interest: researchers gathered in Tokyo to assess the integrity of cDNA data. Credit: D. CYRANOSKI

Move over genome, here comes the transcriptome. Last week, 120 researchers from around the world gathered in Tokyo to assemble the core of a transcriptome database, which they hope will one day hold all of the expressed sequences in the human genome.

The database, which should be up and running by December, will be a universal resource for biological research and drug discovery, say the meeting's organizers. “We want to know exactly where the genes are and what they do,” says Sumio Sugano, a researcher from the University of Tokyo's Institute of Medical Science.

As the first step in producing proteins, information in genes is transcribed into messenger RNA (mRNA). This process separates the coding sequences of genes from the rest of the genome — often called 'junk' DNA. The transcriptome is the complete set of transcribed mRNA. For years, researchers have studied these transcripts in the form of complementary DNAs (cDNAs), which are made using the mRNA taken from cells as a template. cDNAs represent the mRNA present in the cell, but they are much easier to work with than mRNA itself.

Now researchers want to incorporate the sequences of all of the human cDNAs into a single database, to be run by the Japan Biological Information Research Center in Tokyo and the DNA Data Bank of Japan (DDBJ) in Mishima. At the Tokyo meeting, researchers analysed cDNA data representing over 20,000 genes — covering more than half of the transcriptome — for inclusion in the database.

Trying to find genes within the human genome sequence often means guessing at which parts are expressed by looking for certain patterns in the sequence. cDNAs, made from mRNA expressed in cells, offer a more direct route. “This will be a real human-gene catalogue — not predicted from the human genome sequence. These are real transcripts,” says meeting organizer Takashi Gojobori, director of the DDBJ in Mishima.

Most of the cDNAs are already publicly available — but many exist as fragments of the complete cDNA. In addition, the lack of proper categorization, and inconsistencies between the various databases, limits the usefulness of the sequences for research.

“The data will be well-defined and quality controlled through the checks and balances of over a hundred scientists,” says Ranajit Chakraborty, director of the Center for Genome Information at the University of Cincinnati Medical Center in Ohio.

To create the data set, the researchers mapped 42,000 cDNAs, collected from six databases around the world, to some 23,000 different regions on the human genome. The overlap of many cDNAs at the same regions will shed light on one of the mysteries of the genome — how so few genes can make the range of proteins that carry out the many functions in human development, and also produce so much variety in people's genetically determined features.

One explanation is that the genes undergo alternative splicing, whereby various mRNAs are produced from the same genomic sequence. By looking at many slightly different cDNAs that cover the same gene regions, researchers say that they will find many examples of these alternate forms of mRNA.

The meeting also offered a large data set, and a platform for debate, concerning non-coding RNA, which does not make protein. Some researchers believe such non-coding RNA has a major role in regulating gene expression, but the idea remains controversial (see Nature 418, 122–124; 2002).

http://www.jbirc.aist.go.jp/index_E.html

http://www.ddbj.nig.ac.jp

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Cyranoski, D. Geneticists lay foundations for human transcriptome database. Nature 419, 3–4 (2002) doi:10.1038/419003a

Download citation

Further reading

  • Investigation of protein functions through data-mining on integrated human transcriptome database, H-Invitational database (H-InvDB)

    • Chisato Yamasaki
    • , Kanako O. Koyanagi
    • , Yasuyuki Fujii
    • , Takeshi Itoh
    • , Roberto Barrero
    • , Takuro Tamura
    • , Yumi Yamaguchi-Kabata
    • , Motohiko Tanino
    • , Jun-ichi Takeda
    • , Satoshi Fukuchi
    • , Satoru Miyazaki
    • , Nobuo Nomura
    • , Sumio Sugano
    • , Tadashi Imanishi
    •  & Takashi Gojobori

    Gene (2005)

  • Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

    • Tadashi Imanishi
    • , Takeshi Itoh
    • , Yutaka Suzuki
    • , Claire O'Donovan
    • , Satoshi Fukuchi
    • , Kanako O Koyanagi
    • , Roberto A Barrero
    • , Takuro Tamura
    • , Yumi Yamaguchi-Kabata
    • , Motohiko Tanino
    • , Kei Yura
    • , Satoru Miyazaki
    • , Kazuho Ikeo
    • , Keiichi Homma
    • , Arek Kasprzyk
    • , Tetsuo Nishikawa
    • , Mika Hirakawa
    • , Jean Thierry-Mieg
    • , Danielle Thierry-Mieg
    • , Jennifer Ashurst
    • , Libin Jia
    • , Mitsuteru Nakao
    • , Michael A Thomas
    • , Nicola Mulder
    • , Youla Karavidopoulou
    • , Lihua Jin
    • , Sangsoo Kim
    • , Tomohiro Yasuda
    • , Boris Lenhard
    • , Eric Eveno
    • , Yoshiyuki Suzuki
    • , Chisato Yamasaki
    • , Jun-ichi Takeda
    • , Craig Gough
    • , Phillip Hilton
    • , Yasuyuki Fujii
    • , Hiroaki Sakai
    • , Susumu Tanaka
    • , Clara Amid
    • , Matthew Bellgard
    • , Maria de Fatima Bonaldo
    • , Hidemasa Bono
    • , Susan K Bromberg
    • , Anthony J Brookes
    • , Elspeth Bruford
    • , Piero Carninci
    • , Claude Chelala
    • , Christine Couillault
    • , Sandro J. de Souza
    • , Marie-Anne Debily
    • , Marie-Dominique Devignes
    • , Inna Dubchak
    • , Toshinori Endo
    • , Anne Estreicher
    • , Eduardo Eyras
    • , Kaoru Fukami-Kobayashi
    • , Gopal R. Gopinath
    • , Esther Graudens
    • , Yoonsoo Hahn
    • , Michael Han
    • , Ze-Guang Han
    • , Kousuke Hanada
    • , Hideki Hanaoka
    • , Erimi Harada
    • , Katsuyuki Hashimoto
    • , Ursula Hinz
    • , Momoki Hirai
    • , Teruyoshi Hishiki
    • , Ian Hopkinson
    • , Sandrine Imbeaud
    • , Hidetoshi Inoko
    • , Alexander Kanapin
    • , Yayoi Kaneko
    • , Takeya Kasukawa
    • , Janet Kelso
    • , Paul Kersey
    • , Reiko Kikuno
    • , Kouichi Kimura
    • , Bernhard Korn
    • , Vladimir Kuryshev
    • , Izabela Makalowska
    • , Takashi Makino
    • , Shuhei Mano
    • , Regine Mariage-Samson
    • , Jun Mashima
    • , Hideo Matsuda
    • , Hans-Werner Mewes
    • , Shinsei Minoshima
    • , Keiichi Nagai
    • , Hideki Nagasaki
    • , Naoki Nagata
    • , Rajni Nigam
    • , Osamu Ogasawara
    • , Osamu Ohara
    • , Masafumi Ohtsubo
    • , Norihiro Okada
    • , Toshihisa Okido
    • , Satoshi Oota
    • , Motonori Ota
    • , Toshio Ota
    • , Tetsuji Otsuki
    • , Dominique Piatier-Tonneau
    • , Annemarie Poustka
    • , Shuang-Xi Ren
    • , Naruya Saitou
    • , Katsunaga Sakai
    • , Shigetaka Sakamoto
    • , Ryuichi Sakate
    • , Ingo Schupp
    • , Florence Servant
    • , Stephen Sherry
    • , Rie Shiba
    • , Nobuyoshi Shimizu
    • , Mary Shimoyama
    • , Andrew J Simpson
    • , Bento Soares
    • , Charles Steward
    • , Makiko Suwa
    • , Mami Suzuki
    • , Aiko Takahashi
    • , Gen Tamiya
    • , Hiroshi Tanaka
    • , Todd Taylor
    • , Joseph D Terwilliger
    • , Per Unneberg
    • , Vamsi Veeramachaneni
    • , Shinya Watanabe
    • , Laurens Wilming
    • , Norikazu Yasuda
    • , Hyang-Sook Yoo
    • , Marvin Stodolsky
    • , Wojciech Makalowski
    • , Mitiko Go
    • , Kenta Nakai
    • , Toshihisa Takagi
    • , Minoru Kanehisa
    • , Yoshiyuki Sakaki
    • , John Quackenbush
    • , Yasushi Okazaki
    • , Yoshihide Hayashizaki
    • , Winston Hide
    • , Ranajit Chakraborty
    • , Ken Nishikawa
    • , Hideaki Sugawara
    • , Yoshio Tateno
    • , Zhu Chen
    • , Michio Oishi
    • , Peter Tonellato
    • , Rolf Apweiler
    • , Kousaku Okubo
    • , Lukas Wagner
    • , Stefan Wiemann
    • , Robert L Strausberg
    • , Takao Isogai
    • , Charles Auffray
    • , Nobuo Nomura
    • , Takashi Gojobori
    • , Sumio Sugano
    •  & Richard Roberts

    PLoS Biology (2004)

  • Outline of disease gene hunting approaches in the Millennium Genome Project of Japan.

    • Teruhiko YOSHIDA
    •  & Kimio YOSHIMURA

    Proceedings of the Japan Academy, Series B (2003)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.