scispace - formally typeset
Search or ask a question
Author

Erich M. Schwarz

Bio: Erich M. Schwarz is an academic researcher from Cornell University. The author has contributed to research in topics: Caenorhabditis elegans & WormBase. The author has an hindex of 32, co-authored 54 publications receiving 10134 citations. Previous affiliations of Erich M. Schwarz include Harvard University & Columbia University.


Papers
More filters
Journal ArticleDOI
Midori A. Harris, Jennifer I. Clark1, Ireland A1, Jane Lomax1, Michael Ashburner2, Michael Ashburner1, R. Foulger1, R. Foulger2, Karen Eilbeck1, Karen Eilbeck3, Suzanna E. Lewis1, Suzanna E. Lewis3, B. Marshall3, B. Marshall1, Christopher J. Mungall1, Christopher J. Mungall3, J. Richter3, J. Richter1, Gerald M. Rubin3, Gerald M. Rubin1, Judith A. Blake1, Carol J. Bult1, Dolan M1, Drabkin H1, Janan T. Eppig1, Hill Dp1, L. Ni1, Ringwald M1, Rama Balakrishnan4, Rama Balakrishnan1, J. M. Cherry1, J. M. Cherry4, Karen R. Christie1, Karen R. Christie4, Maria C. Costanzo1, Maria C. Costanzo4, Selina S. Dwight4, Selina S. Dwight1, Stacia R. Engel1, Stacia R. Engel4, Dianna G. Fisk1, Dianna G. Fisk4, Jodi E. Hirschman4, Jodi E. Hirschman1, Eurie L. Hong1, Eurie L. Hong4, Robert S. Nash4, Robert S. Nash1, Anand Sethuraman4, Anand Sethuraman1, Chandra L. Theesfeld4, Chandra L. Theesfeld1, David Botstein5, David Botstein1, Kara Dolinski1, Kara Dolinski5, Becket Feierbach5, Becket Feierbach1, Tanya Z. Berardini6, Tanya Z. Berardini1, S. Mundodi1, S. Mundodi6, Seung Y. Rhee1, Seung Y. Rhee6, Rolf Apweiler1, Daniel Barrell1, Camon E1, E. Dimmer1, Lee1, Rex L. Chisholm, Pascale Gaudet7, Pascale Gaudet1, Warren A. Kibbe1, Warren A. Kibbe7, Ranjana Kishore1, Ranjana Kishore8, Erich M. Schwarz8, Erich M. Schwarz1, Paul W. Sternberg8, Paul W. Sternberg1, M. Gwinn1, Hannick L1, Wortman J1, Matthew Berriman1, Matthew Berriman9, Wood1, Wood9, de la Cruz N10, de la Cruz N1, Peter J. Tonellato10, Peter J. Tonellato1, Pankaj Jaiswal11, Pankaj Jaiswal1, Seigfried T12, Seigfried T1, White R1, White R13 
TL;DR: The Gene Ontology (GO) project as discussed by the authors provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences.
Abstract: The Gene Ontology (GO) project (http://www.geneontology.org/) provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences. Many model organism databases and genome annotation groups use the GO and contribute their annotation sets to the GO resource. The GO database integrates the vocabularies and contributed annotations and provides full access to this information in several formats. Members of the GO Consortium continually work collectively, involving outside experts as needed, to expand and update the GO vocabularies. The GO Web resource also provides access to extensive documentation about the GO project and links to applications that use GO data for functional analyses.

3,565 citations

Journal ArticleDOI
TL;DR: An algorithm has been developed which identifies alpha-helices involved in the interactions of membrane proteins with lipid bilayers and which distinguishes them from helices in soluble proteins, and suggests four transmembrane helices and a surface-seeking helix in fragment B, the moiety known to have trans Membrane function.

2,157 citations

Journal ArticleDOI
Midori A. Harris, Jennifer I. Deegan, Amelia Ireland, Jane Lomax, Michael Ashburner1, Susan Tweedie1, Seth Carbon2, Suzanna E. Lewis2, Christopher J. Mungall2, John Day Richter2, Karen Eilbeck, Judith A. Blake, Carol J. Bult, Alexander D. Diehl, Mary E. Dolan, Harold J. Drabkin, Janan T. Eppig, David P. Hill, Ni Li, Martin Ringwald, Rama Balakrishnan3, Gail Binkley3, J. Michael Cherry3, Karen R. Christie3, Maria C. Costanzo3, Qing Dong3, Stacia R. Engel3, Dianna G. Fisk3, Jodi E. Hirschman3, Benjamin C. Hitz3, Eurie L. Hong3, Cynthia J. Krieger3, Stuart R. Miyasato3, Robert S. Nash3, Julie Park3, Marek S. Skrzypek3, Shuai Weng3, Edith D. Wong3, Kathy K. Zhu3, David Botstein4, Kara Dolinski4, Michael S. Livstone4, Rose Oughtred4, Tanya Z. Berardini5, Li Donghui5, Seung Y. Rhee5, Rolf Apweiler6, Daniel Barrell6, Evelyn Camon6, Emily Dimmer6, Rachael P. Huntley, Nicola Mulder, Varsha K. Khodiyar, Ruth C. Lovering, Sue Povey, Rex L. Chisholm, Petra Fey, Pascale Gaudet, Warren A. Kibbe, Ranjana Kishore, Erich M. Schwarz, Paul W. Sternberg, Kimberly Van Auken, Michelle G. Giglio, Linda Hannick, Jennifer R. Wortman, Martin Aslett, Matthew Berriman, Valerie Wood, Howard J. Jacob, Stan Laulederkind, Victoria Petri, Mary Shimoyama, Jennifer L. Smith, Simon N. Twigger, Pankaj Jaiswal, Trent E. Seigfried, Doug Howe, Monte Westerfield, Candace Collmer, Trudy Torto Alalibo, Erika Feltrin, Giorgio Valle, Susan Bromberg, Shane C. Burgess, Fiona M. McCarthy 
TL;DR: The GO Consortium has launched a focused effort to provide comprehensive and detailed annotation of orthologous genes across a number of ‘reference’ genomes, including human and several key model organisms.
Abstract: The Gene Ontology (GO) project (http://www.geneontology.org) provides a set of structured, controlled vocabularies for community use in annotating genes, gene products and sequences (also see http://www.sequenceontology.org/). The ontologies have been extended and refined for several biological areas, and improvements to the structure of the ontologies have been implemented. To improve the quantity and quality of gene product annotations available from its public repository, the GO Consortium has launched a focused effort to provide comprehensive and detailed annotation of orthologous genes across a number of reference genomes, including human and several key model organisms. Software developments include two releases of the ontology-editing tool OBO-Edit, and improvements to the AmiGO browser interface.

726 citations

Journal ArticleDOI
TL;DR: The nearly complete genomic sequence and comparative analyses of the closely related species Caenorhabditis briggsae have been integrated into WormBase, including gene predictions, ortholog assignments and a new synteny viewer to display the relationships between the two species.
Abstract: WormBase (http://www.wormbase.org/) is the central data repository for information about Caenorhabditis elegans and related nematodes. As a model organism database, WormBase extends beyond the genomic sequence, integrating experimental results with extensively annotated views of the genome. The WormBase Consortium continues to expand the biological scope and utility of WormBase with the inclusion of large-scale genomic analyses, through active data and literature curation, through new analysis and visualization tools, and through refinement of the user interface. Over the past year, the nearly complete genomic sequence and comparative analyses of the closely related species Caenorhabditis briggsae have been integrated into WormBase, including gene predictions, ortholog assignments and a new synteny viewer to display the relationships between the two species. Extensive site-wide refinement of the user interface now provides quick access to the most frequently accessed resources and a consistent browsing experience across the site. Unified single-page views now provide complete summaries of commonly accessed entries like genes. These advances continue to increase the utility of WormBase for C.elegans researchers, as well as for those researchers exploring problems in functional and comparative genomics in the context of a powerful genetic system.

664 citations

01 Jan 2004
TL;DR: The Gene Ontology (GO) project provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences.
Abstract: The Gene Ontology (GO) project (http://www. geneontology.org/) provides structured, controlled vocabularies and classifications that cover several domains of molecular and cellular biology and are freely available for community use in the annotation of genes, gene products and sequences. Many model organism databases and genome annotation groups use the GO and contribute their annotation sets to the GO resource. The GO database integrates the vocabularies and contributed annotations and provides full access to this information in several formats. Members of the GO Consortium continually work collectively, involving outside experts as needed, to expand and update the GO vocabularies. The GO Web resource also provides access to extensive documentation about the GO project and links to applications that use GO data for functional analyses.

559 citations


Cited by
More filters
Journal ArticleDOI
TL;DR: The Swiss-Prot, TrEMBL and PIR protein database activities have united to form the Universal Protein Knowledgebase (UniProt), which is to provide a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and query interfaces.
Abstract: To provide the scientific community with a single, centralized, authoritative resource for protein sequences and functional information, the Swiss-Prot, TrEMBL and PIR protein database activities have united to form the Universal Protein Knowledgebase (UniProt) consortium. Our mission is to provide a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase, with extensive cross-references and query interfaces. The central database will have two sections, corresponding to the familiar Swiss-Prot (fully manually curated entries) and TrEMBL (enriched with automated classification, annotation and extensive cross-references). For convenient sequence searches, UniProt also provides several non-redundant sequence databases. The UniProt NREF (UniRef) databases provide representative subsets of the knowledgebase suitable for efficient searching. The comprehensive UniProt Archive (UniParc) is updated daily from many public source databases. The UniProt databases can be accessed online (http://www.uniprot.org) or downloaded in several formats (ftp://ftp.uniprot.org/pub). The scientific community is encouraged to submit data for inclusion in UniProt.

7,298 citations

Journal ArticleDOI
08 Sep 1989-Science
TL;DR: A deletion of three base pairs that results in the omission of a phenylalanine residue at the center of the first predicted nucleotide-binding domain was detected in CF patients.
Abstract: Overlapping complementary DNA clones were isolated from epithelial cell libraries with a genomic DNA segment containing a portion of the putative cystic fibrosis (CF) locus, which is on chromosome 7 Transcripts, approximately 6500 nucleotides in size, were detectable in the tissues affected in patients with CF The predicted protein consists of two similar motifs, each with (i) a domain having properties consistent with membrane association and (ii) a domain believed to be involved in ATP (adenosine triphosphate) binding A deletion of three base pairs that results in the omission of a phenylalanine residue at the center of the first predicted nucleotide-binding domain was detected in CF patients

6,731 citations

Journal ArticleDOI
TL;DR: The National Center for Biotechnology Information Reference Sequence (RefSeq) database provides a non-redundant collection of sequences representing genomic data, transcripts and proteins that pragmatically includes sequence data that are currently publicly available in the archival databases.
Abstract: The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) provides a non-redundant collection of sequences representing genomic data, transcripts and proteins. Although the goal is to provide a comprehensive dataset representing the complete sequence information for any given species, the database pragmatically includes sequence data that are currently publicly available in the archival databases. The database incorporates data from over 2400 organisms and includes over one million proteins representing significant taxonomic diversity spanning prokaryotes, eukaryotes and viruses. Nucleotide and protein sequences are explicitly linked, and the sequences are linked to other resources including the NCBI Map Viewer and Gene. Sequences are annotated to include coding regions, conserved domains, variation, references, names, database cross-references, and other features using a combined approach of collaboration and other input from the scientific community, automated annotation, propagation from GenBank and curation by NCBI staff.

4,229 citations

Journal ArticleDOI
TL;DR: During 2004, tens of thousands of Knowledgebase records got manually annotated or updated; the UniProt keyword list got augmented by additional keywords; the documentation of the keywords and are continuously overhauling and standardizing the annotation of post-translational modifications.
Abstract: The Universal Protein Resource (UniProt) provides the scientific community with a single, centralized, authoritative resource for protein sequences and functional information. Formed by uniting the Swiss-Prot, TrEMBL and PIR protein database activities, the UniProt consortium produces three layers of protein sequence databases: the UniProt Archive (UniParc), the UniProt Knowledgebase (UniProt) and the UniProt Reference (UniRef) databases. The UniProt Knowledgebase is a comprehensive, fully classified, richly and accurately annotated protein sequence knowledgebase with extensive cross-references. This centrepiece consists of two sections: UniProt/Swiss-Prot, with fully, manually curated entries; and UniProt/TrEMBL, enriched with automated classification and annotation. During 2004, tens of thousands of Knowledgebase records got manually annotated or updated; we introduced a new comment line topic: TOXIC DOSE to store information on the acute toxicity of a toxin; the UniProt keyword list got augmented by additional keywords; we improved the documentation of the keywords and are continuously overhauling and standardizing the annotation of post-translational modifications. Furthermore, we introduced a new documentation file of the strains and their synonyms. Many new database cross-references were introduced and we started to make use of Digital Object Identifiers. We also achieved in collaboration with the Macromolecular Structure Database group at EBI an improved integration with structural databases by residue level mapping of sequences from the Protein Data Bank entries onto corresponding UniProt entries. For convenient sequence searches we provide the UniRef non-redundant sequence databases. The comprehensive UniParc database stores the complete body of publicly available protein sequence data. The UniProt databases can be accessed online (http://www.uniprot.org) or downloaded in several formats (ftp://ftp.uniprot.org/pub). New releases are published every two weeks.

4,074 citations