Home
/
Authors
/
C. M. Sperberg-McQueen

Author

C. M. Sperberg-McQueen

Other affiliations: University of Illinois at Chicago

Bio: C. M. Sperberg-McQueen is an academic researcher from Massachusetts Institute of Technology. The author has contributed to research in topics: Markup language & SGML. The author has an hindex of 16, co-authored 33 publications receiving 9272 citations. Previous affiliations of C. M. Sperberg-McQueen include University of Illinois at Chicago.

Topics: Markup language, SGML, XML, PCDATA, RuleML ...read more

Papers

PDF

Open Access

More filters

Journal Article•

Extensible Markup Language (XML).

[...]

Tim Bray, Jean Paoli, C. M. Sperberg-McQueen

01 Jan 1997-World Wide Web

TL;DR: XML is an extremely simple dialect of SGML which is completely described in this document, to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML.

...read moreread less

Abstract: Extensible Markup Language (XML) is an extremely simple dialect of SGML which is completely described in this document. The goal is to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML. For this reason, XML has been designed for ease of implementation, and for interoperability with both SGML and HTML. Note on status of this document: This is even more of a moving target than the typical W3C working draft. Several important decisions on the details of XML are still outstanding members of the W3C SGML Working Group will recognize these areas of particular volatility in the spec, but those who are not intimately familiar with the deliberative process should be careful to avoid actions based on the content of this document, until the notice you are now reading has been removed.

...read moreread less

5,749 citations

Journal Article•DOI•

Extensible markup language

[...]

Tim Bray, Jean Paoli, C. M. Sperberg-McQueen

01 Nov 1997-World Wide Web

TL;DR: XML is the lingua franca of the wireless Web and is already being used for a host of server-server communication applications, which make it possible for different data servers to easily exchange information.

...read moreread less

Abstract: XML is the lingua franca of the wireless Web. Its strength is in its generality: XML can describe virtually any kind of structured data. Once described, the data can be presented in other formats. Moreover, XML is already being used for a host of server-server communication applications, which make it possible for different data servers to easily exchange information. The trend toward a common format for representing data will doubtlessly present new opportunities for both Web and wireless Web clients.

...read moreread less

1,871 citations

Book•

Guidelines for electronic text encoding and interchange

[...]

C. M. Sperberg-McQueen, Lou Burnard, Linguistic Computing

01 Jan 1994

861 citations

Journal Article•DOI•

What is transcription

[...]

Claus Huitfeldt¹, C. M. Sperberg-McQueen²•Institutions (2)

University of Bergen¹, Massachusetts Institute of Technology²

01 Sep 2008-Literary and Linguistic Computing

TL;DR: Preliminary sketches for a formal account of transcription as it is performed in scholarly editing and in the creation of digital resources are described.

...read moreread less

Abstract: This paper describes preliminary sketches for a formal account of transcription as it is performed in scholarly editing and in the creation of digital resources. After a general outline of our approach, we present two formal models of transcription. The first addresses only the very simplest cases, the second addresses some but not all of the gaps in the first. Finally, we mention some less simple cases and discuss some elaborations of the model which we hope to develop in future work.

...read moreread less

253 citations

Book Chapter•DOI•

GODDAG: A Data Structure for Overlapping Hierarchies

[...]

C. M. Sperberg-McQueen¹, Claus Huitfeldt²•Institutions (2)

Massachusetts Institute of Technology¹, University of Bergen²

13 Sep 2000

TL;DR: In this article, the use of non-SGML notations with translation into SGML for processing has been proposed for document overlapping structures in which the structures of interest are not properly nested.

...read moreread less

Abstract: Notations like SGML and XML represent document structures using tree structures; while this is in general a step forward from earlier systems, it creates certain difficulties for the representation of documents in which the structures of interest are not properly nested. Overlapping structures, discontinuous structures, and material which occurs in different orders in different parts, views, or versions of a document are all problems for SGML and XML. Overlapping structures have received attention from a variety of authors on SGML and XML, who have proposed various solutions including the use of non-SGML notations with translation into SGML for processing, the use of the concur feature of SGML, exploitation of conditional marked sections in the DTD and document instance, the imposition of various kinds of unusual interpretations on SGML/XML elements as milestones or as fragments of some larger ‘virtual’ element, or the use of detailed annotation separate from the base text being annotated.

...read moreread less

95 citations

1
2
3
4
…
5
6
7

Collapse

Cited by

PDF

Open Access

More filters

Journal Article•DOI•

UCSF Chimera--a visualization system for exploratory research and analysis.

[...]

Eric F. Pettersen¹, Thomas D. Goddard¹, Conrad C. Huang¹, Gregory S. Couch¹, Daniel M. Greenblatt¹, Elaine C. Meng¹, Thomas E. Ferrin¹ - Show less +3 more•Institutions (1)

University of California, San Francisco¹

01 Oct 2004-Journal of Computational Chemistry

TL;DR: Two unusual extensions are presented: Multiscale, which adds the ability to visualize large‐scale molecular assemblies such as viral coats, and Collaboratory, which allows researchers to share a Chimera session interactively despite being at separate locales.

...read moreread less

Abstract: The design, implementation, and capabilities of an extensible visualization system, UCSF Chimera, are discussed. Chimera is segmented into a core that provides basic services and visualization, and extensions that provide most higher level functionality. This architecture ensures that the extension mechanism satisfies the demands of outside developers who wish to incorporate new features. Two unusual extensions are presented: Multiscale, which adds the ability to visualize large-scale molecular assemblies such as viral coats, and Collaboratory, which allows researchers to share a Chimera session interactively despite being at separate locales. Other extensions include Multalign Viewer, for showing multiple sequence alignments and associated structures; ViewDock, for screening docked ligand orientations; Movie, for replaying molecular dynamics trajectories; and Volume Viewer, for display and analysis of volumetric data. A discussion of the usage of Chimera in real-world situations is given, along with anticipated future directions. Chimera includes full user documentation, is free to academic and nonprofit users, and is available for Microsoft Windows, Linux, Apple Mac OS X, SGI IRIX, and HP Tru64 Unix from http://www.cgl.ucsf.edu/chimera/.

...read moreread less

35,698 citations

Journal Article•DOI•

Bioconductor: open software development for computational biology and bioinformatics

[...]

Robert Gentleman¹, Vincent J. Carey², Douglas M. Bates³, Benjamin M. Bolstad⁴, Marcel Dettling, Sandrine Dudoit⁴, Byron Ellis¹, Laurent Gautier⁵, Yongchao Ge⁶, Jeff Gentry¹, Kurt Hornik⁷, Torsten Hothorn⁸, Wolfgang Huber⁹, Stefano Maria Iacus¹⁰, Rafael A. Irizarry¹¹, Friedrich Leisch⁷, Cheng Li¹, Martin Maechler, A. J. Rossini¹², Günther Sawitzki, Colin A. Smith¹³, Gordon K. Smyth¹⁴, Luke Tierney¹⁵, Jean Yang, Jianhua Zhang¹ - Show less +21 more•Institutions (15)

Harvard University¹, Brigham and Women's Hospital², University of Wisconsin-Madison³, University of California, Berkeley⁴, Technical University of Denmark⁵, Icahn School of Medicine at Mount Sinai⁶, Vienna University of Technology⁷, University of Erlangen-Nuremberg⁸, German Cancer Research Center⁹, University of Milan¹⁰, Johns Hopkins University¹¹, University of Washington¹², Scripps Research Institute¹³, Walter and Eliza Hall Institute of Medical Research¹⁴, University of Iowa¹⁵

15 Sep 2004-Genome Biology

TL;DR: Details of the aims and methods of Bioconductor, the collaborative creation of extensible software for computational biology and bioinformatics, and current challenges are described.

...read moreread less

Abstract: The Bioconductor project is an initiative for the collaborative creation of extensible software for computational biology and bioinformatics. The goals of the project include: fostering collaborative development and widespread use of innovative software, reducing barriers to entry into interdisciplinary scientific research, and promoting the achievement of remote reproducibility of research results. We describe details of our aims and methods, identify current challenges, compare Bioconductor to other open bioinformatics projects, and provide working examples.

...read moreread less

12,142 citations

Book•

Foundations of Statistical Natural Language Processing

[...]

Christopher D. Manning¹, Hinrich Schütze²•Institutions (2)

Stanford University¹, PARC²

28 May 1999

TL;DR: This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear and provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations.

...read moreread less

Abstract: Statistical approaches to processing natural language text have become dominant in recent years This foundational text is the first comprehensive introduction to statistical natural language processing (NLP) to appear The book contains all the theory and algorithms needed for building NLP tools It provides broad but rigorous coverage of mathematical and linguistic foundations, as well as detailed discussion of statistical methods, allowing students and researchers to construct their own implementations The book covers collocation finding, word sense disambiguation, probabilistic parsing, information retrieval, and other applications

...read moreread less

9,295 citations

OWL Web ontology language overview

[...]

Deborah L. McGuinness

01 Jan 2004

TL;DR: This document provides an introduction to OWL by informally describing the features of each of the sublanguages of OWL, the Web Ontology Language by providing additional vocabulary along with a formal semantics.

...read moreread less

Abstract: The OWL Web Ontology Language is designed for use by applications that need to process the content of information instead of just presenting information to humans. OWL facilitates greater machine interpretability of Web content than that supported by XML, RDF, and RDF Schema (RDF-S) by providing additional vocabulary along with a formal semantics. OWL has three increasingly-expressive sublanguages: OWL Lite, OWL DL, and OWL Full. This document is written for readers who want a first impression of the capabilities of OWL. It provides an introduction to OWL by informally describing the features of each of the sublanguages of OWL. Some knowledge of RDF Schema is useful for understanding this document, but not essential. After this document, interested readers may turn to the OWL Guide for more detailed descriptions and extensive examples on the features of OWL. The normative formal definition of OWL can be found in the OWL Semantics and Abstract Syntax. Status of this document OWL Web Ontology Language Overview https://www.w3.org/TR/owl-features/ 1 de 14 09/05/2017 08:32 a.m. This document has been reviewed by W3C Members and other interested parties, and it has been endorsed by the Director as a W3C Recommendation. W3C's role in making the Recommendation is to draw attention to the specification and to promote its widespread deployment. This enhances the functionality and interoperability of the Web. This is one of six parts of the W3C Recommendation for OWL, the Web Ontology Language. It has been developed by the Web Ontology Working Group as part of the W3C Semantic Web Activity (Activity Statement, Group Charter) for publication on 10 February 2004. The design of OWL expressed in earlier versions of these documents has been widely reviewed and satisfies the Working Group's technical requirements. The Working Group has addressed all comments received, making changes as necessary. Changes to this document since the Proposed Recommendation version are detailed in the change log. Comments are welcome at public-webont-comments@w3.org (archive) and general discussion of related technology is welcome at www-rdf-logic@w3.org (archive). A list of implementations is available. The W3C maintains a list of any patent disclosures related to this work. This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.

...read moreread less

4,147 citations

The Physiology of the Grid An Open Grid Services Architecture for Distributed Systems Integration

[...]

Ian Foster, Carl Kesselman, Jeffrey M. Nick, Steven Tuecke

01 Jan 2002

TL;DR: This presentation complements an earlier foundational article, “The Anatomy of the Grid,” by describing how Grid mechanisms can implement a service-oriented architecture, explaining how Grid functionality can be incorporated into a Web services framework, and illustrating how the architecture can be applied within commercial computing as a basis for distributed system integration.

...read moreread less

Abstract: In both e-business and e-science, we often need to integrate services across distributed, heterogeneous, dynamic “virtual organizations” formed from the disparate resources within a single enterprise and/or from external resource sharing and service provider relationships. This integration can be technically challenging because of the need to achieve various qualities of service when running on top of different native platforms. We present an Open Grid Services Architecture that addresses these challenges. Building on concepts and technologies from the Grid and Web services communities, this architecture defines a uniform exposed service semantics (the Grid service); defines standard mechanisms for creating, naming, and discovering transient Grid service instances; provides location transparency and multiple protocol bindings for service instances; and supports integration with underlying native platform facilities. The Open Grid Services Architecture also defines, in terms of Web Services Description Language (WSDL) interfaces and associated conventions, mechanisms required for creating and composing sophisticated distributed systems, including lifetime management, change management, and notification. Service bindings can support reliable invocation, authentication, authorization, and delegation, if required. Our presentation complements an earlier foundational article, “The Anatomy of the Grid,” by describing how Grid mechanisms can implement a service-oriented architecture, explaining how Grid functionality can be incorporated into a Web services framework, and illustrating how our architecture can be applied within commercial computing as a basis for distributed system integration—within and across organizational domains. This is a DRAFT document and continues to be revised. The latest version can be found at http://www.globus.org/research/papers/ogsa.pdf. Please send comments to foster@mcs.anl.gov, carl@isi.edu, jnick@us.ibm.com, tuecke@mcs.anl.gov Physiology of the Grid 2

...read moreread less

3,455 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse