1
|
Reboul J, Vaglio P, Tzellas N, Thierry-Mieg N, Moore T, Jackson C, Shin-i T, Kohara Y, Thierry-Mieg D, Thierry-Mieg J, Lee H, Hitti J, Doucette-Stamm L, Hartley JL, Temple GF, Brasch MA, Vandenhaute J, Lamesch PE, Hill DE, Vidal M. Open-reading-frame sequence tags (OSTs) support the existence of at least 17,300 genes in C. elegans. Nat Genet 2001; 27:332-6. [PMID: 11242119 DOI: 10.1038/85913] [Citation(s) in RCA: 135] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The genome sequences of Caenorhabditis elegans, Drosophila melanogaster and Arabidopsis thaliana have been predicted to contain 19,000, 13,600 and 25,500 genes, respectively. Before this information can be fully used for evolutionary and functional studies, several issues need to be addressed. First, the gene number estimates obtained in silico and not yet supported by any experimental data need to be verified. For example, it seems biologically paradoxical that C. elegans would have 50% more genes than Drosophilia. Second, intron/exon predictions need to be tested experimentally. Third, complete sets of open reading frames (ORFs), or "ORFeomes," need to be cloned into various expression vectors. To address these issues simultaneously, we have designed and applied to C. elegans the following strategy. Predicted ORFs are amplified by PCR from a highly representative cDNA library using ORF-specific primers, cloned by Gateway recombination cloning and then sequenced to generate ORF sequence tags (OSTs) as a way to verify identity and splicing. In a sample (n=1,222) of the nearly 10,000 genes predicted ab initio (that is, for which no expressed sequence tag (EST) is available so far), at least 70% were verified by OSTs. We also observed that 27% of these experimentally confirmed genes have a structure different from that predicted by GeneFinder. We now have experimental evidence that supports the existence of at least 17,300 genes in C. elegans. Hence we suggest that gene counts based primarily on ESTs may underestimate the number of genes in human and in other organisms.
Collapse
Affiliation(s)
- J Reboul
- Dana-Farber Cancer Institute and Department of Genetics, Harvard Medical School, Boston, Massachusetts, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
2
|
Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann Y, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blöcker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowki J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, Szustakowki J. Initial sequencing and analysis of the human genome. Nature 2001; 409:860-921. [PMID: 11237011 DOI: 10.1038/35057062] [Citation(s) in RCA: 14509] [Impact Index Per Article: 630.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The human genome holds an extraordinary trove of information about human development, physiology, medicine and evolution. Here we report the results of an international collaboration to produce and make freely available a draft sequence of the human genome. We also present an initial analysis of the data, describing some of the insights that can be gleaned from the sequence.
Collapse
Affiliation(s)
- E S Lander
- Whitehead Institute for Biomedical Research, Center for Genome Research, Cambridge, MA 02142, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Abstract
WormBase (http://www.wormbase.org) is a web-based resource for the Caenorhabditis elegans genome and its biology. It builds upon the existing ACeDB database of the C.elegans genome by providing data curation services, a significantly expanded range of subject areas and a user-friendly front end.
Collapse
Affiliation(s)
- L Stein
- Cold Spring Harbor Laboratory, 1 Bungtown Road, Cold Spring Harbor, NY 11724, USA.
| | | | | | | | | |
Collapse
|
4
|
Roubin R, Naert K, Popovici C, Vatcher G, Coulier F, Thierry-Mieg J, Pontarotti P, Birnbaum D, Baillie D, Thierry-Mieg D. let-756, a C. elegans fgf essential for worm development. Oncogene 1999; 18:6741-7. [PMID: 10597282 DOI: 10.1038/sj.onc.1203074] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In vertebrates, Fibroblast Growth Factors (FGFs) and their receptors are involved in various developmental and pathological processes, including neoplasia. The number of FGFs and their large range of activities have made the understanding of their precise functions difficult. Investigating their biology in other species might be enlightening. A sequence encoding a putative protein presenting 30-40% identity with the conserved core of vertebrate FGFs has been identified by the C. elegans sequencing consortium. We show here that this gene is transcribed and encodes a putative protein of 425 amino acids (aa). The gene is expressed at all stages of development beyond late embryogenesis, peaking at the larval stages. Loss-of-function mutants of the let-756 gene are rescued by the wild type fgf gene in germline transformation experiments. Two partial loss-of-function alleles, s2613 and s2809, have a mutation that replaces aa 317 by a stop. The truncated protein retains the FGF core but lacks a C-termins portion. These worms are small and develop slowly into clear and scrawny, yet viable and fertile adults. A third allele, s2887, is inactivated by an inversion that disrupts the first exon. It causes a developmental arrest early in the larval stages. Thus, in contrast to the other nematode fgf gene egl-17, let-756/fgf is essential for worm development.
Collapse
Affiliation(s)
- R Roubin
- Laboratoire d'Oncologie Moléculaire, U119 Inserm, Marseille, France
| | | | | | | | | | | | | | | | | | | |
Collapse
|
5
|
|
6
|
Abstract
Much of the world's genomic data are available to the community through networked databases that are accessed via Web interfaces. Although this paradigm provides browse-level access and has greatly facilitated linking between databases, it does not provide any convenient mechanism for programmatically fetching and integrating data from diverse databases. We have created a library and an application programming interface (API) named AcePerl that provides simple, direct access to ACEDB databases from the Perl programming language. With this library, programmers and computer-savvy biologists can write software to pose complex queries on local and remote ACEDB databases, retrieve the data, integrate the results, and move data objects from one database to another. In addition, a set of Web scripts running on top of AcePerl provides Web-based browsing of any local or remote ACEDB database. AcePerl and the AceBrowser Web browser run on Unix systems and are available under a license that allows for unrestricted use and redistribution. Both packages can be downloaded from URL. A Microsoft Windows port of AcePerl is in the planning stages.
Collapse
Affiliation(s)
- L D Stein
- Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA.
| | | |
Collapse
|
7
|
Stein LD, Cartinhour S, Thierry-Mieg D, Thierry-Mieg J. JADE: an approach for interconnecting bioinformatics databases. Gene 1998; 209:GC39-GC43. [PMID: 9583945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
To achieve the integration of biological data available on the World Wide Web and maintained in diverse sources such as GDB, Genbank or Acedb, we have developed a software called Jade. Jade allows programmers to create analytic tools and graphical user interfaces for one or more existing bioinformatics data sources. These tools can then be interchanged, compared and reused without making modifications in the data sources themselves. The system is implemented in the Java programming language and will run equally well on Macintosh, Windows or Unix workstations. Jade is free and can be used immediately by all interested parties.
Collapse
Affiliation(s)
- L D Stein
- MIT, WI/MIT Center for Genome, Cambridge, MA, USA
| | | | | | | |
Collapse
|
8
|
Abstract
Large-scale genomic sequencing requires a software infrastructure to support and integrate applications that are not directly compatible. We describe a suite of software tools built around the Common Assembly Format (CAF), a comprehensive representation of a sequence assembly as a text file. These tools form the backbone of sequencing informatics at the Sanger Centre and the Genome Sequencing Center. The CAF format is intentionally flexible, and our Perl and C libraries, which parse and manipulate it, provide powerful tools for creating new applications as well as wrappers to incorporate other software. The tools are available free by anonymous FTP from ftp://ftp.sanger.ac.uk/pub/badger/.
Collapse
Affiliation(s)
- S Dear
- Sanger Centre, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, CB10 1SA, UK
| | | | | | | | | | | |
Collapse
|
9
|
Sulston J, Du Z, Thomas K, Wilson R, Hillier L, Staden R, Halloran N, Green P, Thierry-Mieg J, Qiu L. The C. elegans genome sequencing project: a beginning. Nature 1992; 356:37-41. [PMID: 1538779 DOI: 10.1038/356037a0] [Citation(s) in RCA: 404] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The long-term goal of this project is the elucidation of the complete sequence of the Caenorhabditis elegans genome. During the first year methods have been developed and a strategy implemented that is amenable to large-scale sequencing. The three cosmids sequenced in this initial phase are surprisingly rich in genes, many of which have mammalian homologues.
Collapse
Affiliation(s)
- J Sulston
- MRC Laboratory of Molecular Biology, Cambridge, UK
| | | | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Abstract
A formally unitary Lagrangian model gauging an internal supersymmetry is proposed. The even subalgebra is gauged as a Yang-Mills theory, while the odd generators are gauged-according to Freedman's method-by skew tensor fields, equivalent dynamically to scalar Higgs fields. Chiral fermions are incorporated by following Townsend's construction and form irreducible supermultiplets graded by their helicity. The application to quantum asthenodynamics is discussed.
Collapse
Affiliation(s)
- J Thierry-Mieg
- Groupe d'Astrophysique Relativiste, Centre National de la Recherche Scientifique, Observatoire de Meudon, 92190 France
| | | |
Collapse
|
11
|
Ne'eman Y, Takasugi E, Thierry-Mieg J. Soft-group-manifold Becchi-Rouet-Stora transformations and unitarity for gravity, supergravity, and extensions. Int J Clin Exp Med 1980. [DOI: 10.1103/physrevd.22.2371] [Citation(s) in RCA: 23] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
12
|
Abstract
We provide a geometrical identification of the ghost fields, essential to the renormalization procedure in the non-Abelian (Yang-Mills) case. These are some of the local components of a connection on a principal bundle. They multiply the differentials of coordinates spanning directions orthogonal to those of a given section, whereas the Yang-Mills potential multiplies the coordinates in the section itself. In the case of a supergroup, the ghosts become commutative for the odd directions, and represent Nambu-Goldstone fields. We apply the results to chiral "flavor" SU(3)(L) x SU(3)(R) and to SU(2/1). The latter reproduces a highly constrained Weinberg-Salam model.
Collapse
Affiliation(s)
- Y Ne'eman
- Department of Physics and Astronomy, Tel-Aviv University, Tel-Aviv, Israel
| | | |
Collapse
|
13
|
Thierry-Mieg J. [Technics of vertebral manipulations used in the treatment of disk cruralgia. Indications, contra-indications and complications]. Sem Ther 1967; 43:401-5. [PMID: 4240941] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
14
|
Thierry-Mieg J. [Technics of vertebral manipulations used in the treatment of sciatica with discal hernia. Indications, contra-indications, and accidents]. Sem Ther 1966; 42:376-9. [PMID: 5984626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
|