scispace - formally typeset
Search or ask a question

Showing papers on "XML published in 2000"


Journal ArticleDOI
01 Jun 2000
TL;DR: The WebML language and its accompanying design method are fully implemented in a pre-competitive Web design tool suite, called ToriiSoft, supporting advanced features like multi-device access, personalization, and evolution management.
Abstract: Designing and maintaining Web applications is one of the major challenges for the software industry of the year 2000. In this paper we present Web Modeling Language (WebML), a notation for specifying complex Web sites at the conceptual level. WebML enables the high-level description of a Web site under distinct orthogonal dimensions: its data content (structural model), the pages that compose it (composition model), the topology of links between pages (navigation model), the layout and graphic requirements for page rendering (presentation model), and the customization features for one-to-one content delivery (personalization model). All the concepts of WebML are associated with a graphic notation and a textual XML syntax. WebML specifications are independent of both the client-side language used for delivering the application to users, and of the server-side platform used to bind data to pages, but they can be effectively used to produce a site implementation in a specific technological setting. WebML guarantees a model-driven approach to Web site development, which is a key factor for defining a novel generation of CASE tools for the construction of complex sites, supporting advanced features like multi-device access, personalization, and evolution management. The WebML language and its accompanying design method are fully implemented in a pre-competitive Web design tool suite, called ToriiSoft.

929 citations


Proceedings Article
10 Sep 2000
TL;DR: This paper has developed several index organizations and search algorithms for performing efficient filtering of XML documents for large-scale information dissemination systems and examines their performance across a range of document, workload, and scale scenarios.
Abstract: Information Dissemination applications are gaining increasing popularity due to dramatic improvements in communications bandwidth and ubiquity. The sheer volume of data available necessitates the use of selective approaches to dissemination in order to avoid overwhelming users with unnecessaryinformation. Existing mechanisms for selective dissemination typically rely on simple keyword matching or “bag of words” information retrieval techniques. The advent of XML as a standard for information exchangeand the development of query languages for XML data enables the development of more sophisticated filtering mechanisms that take structure information into account. We have developed several index organizations and search algorithms for performing efficient filtering of XML documents for large-scale information dissemination systems. In this paper we describe these techniques and examine their performance across a range of document, workload, and scale scenarios.

666 citations


Proceedings ArticleDOI
29 Feb 2000
TL;DR: The paper describes the methodology and the software development of XWRAP, an XML-enabled wrapper construction system for semi-automatic generation of wrapper programs, and introduces and develops a two-phase code generation framework.
Abstract: The paper describes the methodology and the software development of XWRAP, an XML-enabled wrapper construction system for semi-automatic generation of wrapper programs. By XML-enabled we mean that the metadata about information content that are implicit in the original Web pages will be extracted and encoded explicitly as XML tags in the wrapped documents. In addition, the query based content filtering process is performed against the XML documents. The XWRAP wrapper generation framework has three distinct features. First, it explicitly separates tasks of building wrappers that are specific to a Web source from the tasks that are repetitive for any source, and uses a component library to provide basic building blocks for wrapper programs. Second, it provides a user friendly interface program to allow wrapper developers to generate their wrapper code with a few mouse clicks. Third and most importantly, we introduce and develop a two-phase code generation framework. The first phase utilizes an interactive interface facility to encode the source-specific metadata knowledge identified by individual wrapper developers as declarative information extraction rules. The second phase combines the information extraction rules generated at the first phase with the XWRAP component library to construct an executable wrapper program for the given Web source. We report the initial experiments on performance of the XWRAP code generation system and the wrapper programs generated by XWRAP.

520 citations


Patent
12 Sep 2000
TL;DR: In this article, a system and method for searching for documents within spaces in a distributed computing environment are provided, where a client sends a lookup message to a space which stores documents, and a set of zero or more documents which match the lookup message are discovered.
Abstract: A system and method for searching for documents within spaces in a distributed computing environment are provided. A client sends a lookup message to a space which stores documents. The lookup message may specify desired characteristics, such as a name or partial XML schema, of the stored documents. The documents may include XML service advertisements and XML device advertisements as well as general-purpose XML documents. A set of zero or more documents which match the lookup message are discovered. In one embodiment, the lookup message may include a desired name. If the lookup message includes both a desired name and a desired schema, the set of discovered documents may include both discovered documents having a name that matches the desired name and discovered documents having a schema that matches the desired schema. If the lookup message includes neither a desired name nor a desired schema, the set of discovered documents may include substantially all the documents stored in the space. After the matching documents are found, the space may send a lookup response message to the client. For each discovered document, the lookup response message may include a name and an advertisement. Each advertisement may include information which is usable by the client to obtain the respective discovered document or access the resource (e.g., a service) that the document advertises. The advertisements and messages may be expressed in a data representation language such as XML.

458 citations


Proceedings Article
01 Jan 2000
TL;DR: This specification defines the XML Linking Language (XLink) Version 1.1, which allows elements to be inserted into XML documents in order to create and describe links between resources, which enhances the functionality and interoperability of the Web.
Abstract: This specification defines the XML Linking Language (XLink) Version 1.1, which allows elements to be inserted into XML documents in order to create and describe links between resources. It uses XML syntax to create structures that can describe links similar to the simple unidirectional hyperlinks of today's HTML, as well as more sophisticated links. Status of this document This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/. This document is a W3C Recommendation. It implements all of the XLink 1.1 requirements documented in [Extending XLink 1.0]. This document is a product of the XML Core Working Group as part of the W3C XML Activity. This edition supersedes the previous W3C Recommendation of 27 June 2001. Please report errors in this document to the public www-xml-linking-comments@w3.org mailing list; public archives are available. There is an Implementation Report for XLink 1.1. A Test Suite is maintained to help assessing the conformance to this specification. This document has been reviewed by W3C Members, by software developers, and by other W3C groups and interested parties, and is endorsed by the Director as a W3C Recommendation. It is a stable document and may be used as reference material or cited from another document. W3C's role in making the Recommendation is to draw attention to the specification and to promote its widespread deployment. This enhances the functionality and interoperability of the Web. This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy. XML Linking Language (XLink) ii

416 citations


Journal ArticleDOI
01 Jun 2000
TL;DR: This paper describes SilkRoute, a general, dynamic, and efficient tool for viewing and querying relational data in XML, which can express mappings of relational data into XML that conforms to arbitrary DTDs.
Abstract: XML is the standard format for data exchange between inter-enterprise applications on the Internet. To facilitate data exchange, industry groups define public document type definitions (DTDs) that specify the format of the XML data to be exchanged between their applications. In this paper, we address the problem of automating the conversion of relational data into XML. We describe SilkRoute, a general , dynamic , and efficient tool for viewing and querying relational data in XML. SilkRoute is general, because it can express mappings of relational data into XML that conforms to arbitrary DTDs. We call these mappings views . Applications express the data they need as an XML-QL query over the view. SilkRoute is dynamic, because it only materializes the fragment of an XML view needed by an application, and it is efficient, because it fully exploits the underlying RDBMs query engine whenever data items in an XML view need to be materialized.

354 citations


01 Jan 2000
TL;DR: XHTML 1.0 as discussed by the authors is a reformulation of HTML 4 as an XML1.0 application, and three DTDs corresponding to the ones defined by HTML 4 are defined in the W3C Recommendation for HTML 4.
Abstract: This specification defines XHTML 1.0, a reformulation of HTML 4 as an XML 1.0 application, and three DTDs corresponding to the ones defined by HTML 4. The semantics of the elements and their attributes are defined in the W3C Recommendation for HTML 4. These semantics provide the foundation for future extensibility of XHTML. Compatibility with existing HTML user agents is possible by following a small set of guidelines.

350 citations


Patent
22 Dec 2000
TL;DR: In this paper, a method for creating user interface (UI) to assemble a document that conforms to a particular document type definition is presented. The method hides the specific syntax of document type definitions such as DTDs and schemas from the user.
Abstract: A method on an information processing unit performing steps for creating a user interface (UI) to assemble a document that conforms to a particular document type definition. The method hides the specific syntax of document type definitions such as DTDs and schemas from the user. The method begins with a selection from a user for a document type or an existing document. Once the document type is selected or determined from the existing document the document type definitions are retrieved. The document type definitions include one or more elements. The method parses the elements which are subsequently mapped to one or more interface controls such as icons, pull-down menus, buttons, selection boxes, progress indicators, on-off checkmarks, scroll bars, windows, window edges for resizing the window, toggle buttons, forms, and UI widgets. UI can be GUIs or interactive voice response systems. A UI editor is presented by assembling the one or more interface controls without presenting specific document type definition syntax to a user. The UI editor permits the user to create and edit the content objects that are associated with the interface controls. The content objects are aggregated in an XML compatible format and ready to be checked in for further processing. The method permits specific UI interfaces to be created for specific publishing environments and at the same time permit the creation of reusable content objects.

333 citations


Patent
Mehran Moshfeghi1
29 Mar 2000
TL;DR: In this article, the authors present methods and apparatus for browsing markup language documents from within the context of a client-server application running on an end-user device, which is embedded in the application, and can be activated by application controls.
Abstract: This invention includes methods and apparatus for browsing markup language documents from within the context of a client-server application running on an end-user device. Browser functionality, which is configured according to user profile information specifying each user's authorization and preferences, is embedded in the application, and can be activated by application controls. While some users have unrestricted authorization and access, others are restricted to certain browser functions and to certain allowed network resources. This restriction is enforced by preventing the browser functionality from generating network addresses that are not on a list of allowed network addresses also present in the user profile information. Network access restriction is achieved, in part, by filtering markup language documents before display to delete linking information that is not allowed. Document filtering methods are presented for Hypertext Markup Language (HTML) and extensible Markup Language (XML) documents. The document filtering methods are extendable to additional markup languages.

316 citations


Proceedings ArticleDOI
01 Dec 2000
TL;DR: The feasibility of splitting the different tags in an XML Web page to different end users browsers, and of pushing updates from the browsing session to heterogeneous devices, including a laptop and a PDA are demonstrated.
Abstract: WebSplitter symbolizes the union of pervasive multi-device computing and collaborative multi-user computing. WebSplitter provides a unified XML framework that enables multi-device and multi-user Web browsing. WebSplitter splits a requested Web page and delivers the appropriate partial view of each page to each user, or more accurately to each user's set of devices. Multiple users can participate in the same browsing session, as in traditional conferencing groupware. Depending on the access privileges of the user to the different components of content on each page, WebSplitter generates a personalized partial view. WebSplitter further splits the partial view among the devices available to each user, e.g. laptop, wireless PDA, projection display, stereo speakers, orchestrating a composite presentation across the devices. A wireless PDA can browse while remotely controlling the multimedia capabilities of nearby devices. The architecture consists of an XML metadata policy file defining access privileges to XML tags on a Web page, a middleware proxy that splits XML Web content to create partial views, and a client-side component, e.g. applet, enabling user login and reception of pushed browsing data. Service discovery finds and registers proxies, browsing sessions, and device capabilities. We demonstrate the feasibility of splitting the different tags in an XML Web page to different end users browsers, and of pushing updates from the browsing session to heterogeneous devices, including a laptop and a PDA.

289 citations


Journal ArticleDOI
TL;DR: GEM is more comprehensive than existing models and is expressively adequate to represent the heterogeneous information contained in guidelines and contributes to a flexible, comprehensible, shareable, and reusable knowledge representation that is both readable by human beings and processible by computers.

Patent
16 Mar 2000
TL;DR: In this article, a preprocessor generates XML-structured request objects with unresolved links to the data sources that have information required by the clients, and a post-processor filters the XML response documents, and applies XSL stylesheets to transform the XML composite response documents into client-specific responses that conform to the format required by clients.
Abstract: A method and system for allowing clients to retrieve data from data sources that do not necessarily support the same protocols and formats as the clients. The clients issue service requests. A pre-processor responds to the requests by generating XML-structured request objects with unresolved links to the data sources that have information required by the clients. An XML processor resolves the links by issuing requests through one or more gateways. The gateways convert the responses received from the data sources into XML, which the XML processor uses to create XML composite response documents. A post-processor filters the XML response documents, and applies XSL stylesheets to transform the XML composite response documents into client-specific responses that conform to the format required by the clients. The clients-specific responses are then sent to the clients.

Book ChapterDOI
18 May 2000
TL;DR: Quilt as discussed by the authors is a query language that combines information from diverse data sources into a new query result with a new structure of its own, which can combine information from different data sources.
Abstract: The World Wide Web promises to transform human society by making virtually all types of information instantly available everywhere. Two prerequisites for this promise to be realized are a universal markup language and a universal query language. The power and flexibility of XML make it the leading candidate for a universal markup language. XML provides a way to label information from diverse data sources including structured and semi-structured documents, relational databases, and object repositories. Several XML-based query languages have been proposed, each oriented toward a specific category of information. Quilt is a new proposal that attempts to unify concepts from several of these query languages, resulting in a new language that exploits the full versatility of XML. The name Quilt suggests both the way in which features from several languages were assembled to make a new query language, and the way in which Quilt queries can combine information from diverse data sources into a query result with a new structure of its own.

Patent
14 Apr 2000
TL;DR: In this article, an XML-based system for synchronizing a configuration of resources across multiple computing devices is described. But it does not describe how to synchronize the XML lists generated by the master computing device with the additional computing devices.
Abstract: Extensible markup language (XML) methods and systems for synchronizing a configuration of resources across multiple computing devices are described. In the described embodiment, a computer network comprises a master computing device and a plurality of additional computing devices communicatively linked with the master computing device. The master computing device comprises a configuration of resources that are desired to be synchronized across the additional computing devices. The master computing device is configured to develop a XML list that describes aspects of its configuration of resources. Each additional computing device is configured to receive the XML list that is developed by the master computing device and evaluate the XML list to determine whether its particular configuration of resources are synchronized with those of the master computing device. If a particular additional computing device's configuration of resources is not synchronized with those of the master computing device, the additional computing device prepares a second XML list that describes its resource configuration needs and sends the second XML list to the master computing device. The master computing device receives the second XML list, evaluates it, and sends data to the corresponding additional computing device for use in synchronizing its configuration of resources. Each of the additional computing devices can send separate independent XML lists that describe their resource needs and the master computing device is capable of individually responding to each one. In one aspect, the computing devices are computer servers that are configured as Web servers.

Journal ArticleDOI
01 Sep 2000
TL;DR: A comparative analysis of six noteworthy XML schema languages is presented and it is shown that there is a substantial increase of the amount of data in XML format.
Abstract: As XML [5] is emerging as the data format of the internet era, there is an substantial increase of the amount of data in XML format. To better describe such XML data structures and constraints, several XML schema languages have been proposed. In this paper, we present a comparative analysis of six noteworthy XML schema languages.

Proceedings ArticleDOI
01 Sep 2000
TL;DR: The subtyping algorithm developed here is a variant of Aiken and Murphy's set-inclusion constraint solver, to which are added several optimizations and two new properties: the algorithm is provably complete, and it allows a useful "subtagging" relation between nodes with different labels in XML trees.
Abstract: We propose regular expression types as a foundation for XML processing languages. Regular expression types are a natural generalization of Document Type Definitions (DTDs), describing structures in XML documents using regular expression operators (i.e., *, ?, |, etc.) and supporting a simple but powerful notion of subtyping.The decision problem for the subtype relation is EXPTIME-hard, but it can be checked quite efficiently in many cases of practical interest. The subtyping algorithm developed here is a variant of Aiken and Murphy's set-inclusion constraint solver, to which are added several optimizations and two new properties: (1) our algorithm is provably complete, and (2) it allows a useful "subtagging" relation between nodes with different labels in XML trees.

Journal ArticleDOI
01 Jun 2000
TL;DR: This work proposes an extension to XML query languages that enables keyword search at the granularity of XML elements, that helps novice users formulate queries, and also yields new optimization opportunities for the query processor.
Abstract: Due to the popularity of the XML data format, several query languages for XML have been proposed, specially devised to handle data of which the structure is unknown, loose, or absent. While these languages are rich enough to allow for querying the content and structure of an XML document, a varying or unknown structure can make formulating queries a very difficult task. We propose an extension to XML query languages that enables keyword search at the granularity of XML elements, that helps novice users formulate queries, and also yields new optimization opportunities for the query processor. We present an implementation of this extension on top of a commercial RDBMS; we then discuss implementation choices and performance results.

Proceedings ArticleDOI
01 Feb 2000
TL;DR: NATHIX is introduced, an efficient, native repository for storing, retrieving and managing tree-structured large objects, preferably XML documents, that takes the semantics of the underlying tree structure of XML documents into account.
Abstract: We introduce NATIX, an efficient, native repository for storing, retrieving and managing tree-structured large objects, preferably XML documents. In contrast to traditionallarge object (LOB) managers, we do not split at arbitrary byte positions but take the semantics of the underlying tree structure of XML documents into account. Our parameterizable split algorithm dynamically maintains physical records of size smaller than a page which contain sets of connected tree nodes. This not only improves efficiency by clustering subtrees but also facilitates their compact representation. Existing approaches to store XML documents either use flat files or map every single tree node onto a separate physical record. The increased flexibility of our approach results in higher efficiency. Performance measurements validate this claim.

Book ChapterDOI
27 Mar 2000
TL;DR: This work presents an access control model to protect information distributed on the Web that, by exploiting XML's own capabilities, allows the definition and enforcement of access restrictions directly on the structure and content of XML documents.
Abstract: Web-based applications greatly increase information availability and ease of access, which is optimal for public information. The distribution and sharing by theWeb of information that must be accessed in a selective way requires the definition and enforcement of security controls, ensuring that information will be accessible only to authorized entities. Approaches proposed to this end level, independently from the semantics of the data to be protected and for this reason result limited. The eXtensible Markup Language (XML), a markup language promoted by the World Wide Web Consortium (W3C), represents an important opportunity to solve this problem. We present an access control model to protect information distributed on the Web that, by exploiting XML's own capabilities, allows the definition and enforcement of access restrictions directly on the structure and content of XML documents. We also present a language for the specification of access restrictions that uses standard notations and concepts and briefly describe a system architecture for access control enforcement based on existing technology.

Journal ArticleDOI
16 May 2000
TL;DR: The results of the experiments with real-life and synthetic DTDs demonstrate the effectiveness of XTRACT's approach in inferring concise and semantically meaningful DTD schemas for XML databases.
Abstract: XML is rapidly emerging as the new standard for data representation and exchange on the Web. An XML document can be accompanied by a Document Type Descriptor (DTD) which plays the role of a schema for an XML data collection. DTDs contain valuable information on the structure of documents and thus have a crucial role in the efficient storage of XML data, as well as the effective formulation and optimization of XML queries. In this paper, we propose XTRACT, a novel system for inferring a DTD schema for a database of XML documents. Since the DTD syntax incorporates the full expressive power of regular expressions, naive approaches typically fail to produce concise and intuitive DTDs. Instead, the XTRACT inference algorithms employ a sequence of sophisticated steps that involve: (1) finding patterns in the input sequences and replacing them with regular expressions to generate “general” candidate DTDs, (2) factoring candidate DTDs using adaptations of algorithms from the logic optimization literature, and (3) applying the Minimum Description Length (MDL) principle to find the best DTD among the candidates. The results of our experiments with real-life and synthetic DTDs demonstrate the effectiveness of XTRACT's approach in inferring concise and semantically meaningful DTD schemas for XML databases.

Journal ArticleDOI
TL;DR: This paper proposes access control policies and an associated model for XML documents, addressing peculiar protection requirements posed by XML, and allows the Security Administrator to choose different policies for documents not covered or only partially covered by the existingAccess control policies for document types.
Abstract: The Web is becoming the main information dissemination means in private and public organizations. As a consequence, several applications at both internet and intranet level need mechanisms to support a selective access to data available over the Web. In this context, developing an access control model, and related mechanisms, in terms of XML (eXtensible Markup Language) is an important step, because XML is increasingly used as the language for representing information exchanged over the Web. In this paper, we propose access control policies and an associated model for XML documents, addressing peculiar protection requirements posed by XML. A first requirement is that varying protection granularity levels should be supported to guarantee a differentiated protection of document contents. A second requirement arises from the fact that XML documents do not always conform to a predefined document type. To cope with these requirements, the proposed model supports varying protection granularity levels, ranging from a set of documents, to a single document or specific document portion(s). Moreover, it allows the Security Administrator to choose different policies for documents not covered or only partially covered by the existing access control policies for document types. An access control mechanism for the enforcement of the proposed model is finally described.

Patent
22 Dec 2000
TL;DR: In this article, a system for end-to-end content publishing using XML with an object dependency graph is proposed, which is based on the separation of content and style and reusability of information content.
Abstract: The system for end-to-end content publishing using XML with an object dependency graph is based on the following two design principles: First, separation of content and style: Information stored in the content management system is independent on how it is going to be presented. The presentation style is encapsulated elsewhere and can be used to customize the look and feel based on the end-user preferences as well as the delivery methods and devices. Second, reusability of information content: By encapsulating common information in fragments and subfragments and making these fragments insertable in other fragments, thereby avoid scattering and duplication of information. This enables a user to restrict the edit operations to a limited number of relevant fragments, to affect global changes. In addition, the present invention provides data consistency and data integrity in the content management.

Patent
19 May 2000
TL;DR: A general purpose application interaction protocol, referred to herein as the Commerce Exchange Interaction Protocol (CXIP), governs the exchange of data between applications resident in computers in a distributed network such as the Internet, providing for application interoperability as mentioned in this paper.
Abstract: A general purpose application interaction protocol, referred to herein as the Commerce Exchange Interaction Protocol (CXIP), governs the exchange of data between applications resident in computers in a distributed network such as the Internet, providing for application interoperability CXIP enables process communications among network-distributed software applications, particularly, but not exclusively, among software applications having dissimilar platforms, language dependencies or vendor dependencies Data structures, object and method invocation requests are exchanged between applications by means of XML documents based on CXIP semantics The application interaction protocol, which is not specific to any particular functional domain and specifies interactions that are independent of transactional content, includes four component parts: message formats, message types, exchange semantics and transportation assumptions CXIP-based interactions use TCP/IP as their underlying transport mechanism, although CXIP may be implemented using SMTP or FTP, or on top of any other standard application-layer protocol, including HTTP

Journal ArticleDOI
01 Jun 2000
TL;DR: The Millau algorithms for compression of XML structures and data are described and some applications of XML-based remote procedure calls and client-server applications based on Millau that take advantage of the compression and streaming technology defined by the system are described.
Abstract: XML is poised to take the World Wide Web to the next level of innovation. XML data, large or small, with or without associated schema, will be exchanged between increasing number of applications running on diverse devices. Efficient storage and transportation of such data is an important issue. We have designed a system called Millau for efficient encoding and streaming of XML structures. In this paper we describe the Millau algorithms for compression of XML structures and data. Millau compression algorithms, in addition to separating structure and text for compression, take advantage of the associated schema (if available) in compressing the structure. Millau also defines a programming model corresponding to XML DOM and SAX for XML APIs for Millau streams of XML documents. Our experiments have shown significant performance gains of our algorithms and APIs. We describe some of these results in this paper. We also describe some applications of XML-based remote procedure calls and client-server applications based on Millau that take advantage of the compression and streaming technology defined by the system.

Patent
21 Jan 2000
TL;DR: In this article, a portal server retrieves user requests through a computer network and looks up information stored in a metadata databases, which includes information including a classmark definition for each document.
Abstract: The system and method for searching and retrieving information stored in heterogeneous information repositories. A portal server retrieves user requests through a computer network and looks up information stored in a metadata databases. For example, the metadata may be encoded in an XML/RDF format and stored in a directory server to facilitate effective searching and retrieval of information from an information repository. Metadata includes information including a classmark definition for each document. The classmark is determined through an automated cataloguing process.

Journal ArticleDOI
01 Mar 2000
TL;DR: In this paper, a comparison of five representative query languages for XML, highlighting their common features and differences, is presented, with a focus on XML query languages. But no standard for XML query language has yet been decided, but discussion is ongoing within the World Wide Web Consortium and within many academic institutions and Internet-related major companies.
Abstract: XML is becoming the most relevant new standard for data representation and exchange on the WWW. Novel languages for extracting and restructuring the XML content have been proposed, some in the tradition of database query languages (i.e. SQL, OQL), others more closely inspired by XML. No standard for XML query language has yet been decided, but the discussion is ongoing within the World Wide Web Consortium and within many academic institutions and Internet-related major companies. We present a comparison of five, representative query languages for XML, highlighting their common features and differences.

Patent
27 Jun 2000
TL;DR: In this paper, a handheld device scans a line of information in the form of Internet URL's, Internet protocol addresses, Internet e-mail addresses, FTP sites, USENET news group addresses, DNS addresses and bar codes as they occur in print advertising and printed media.
Abstract: A handheld device scans a line of information in the form of Internet URL's, Internet protocol addresses, Internet e-mail addresses, FTP sites, USENET news group addresses, DNS addresses and bar codes as they occur in print advertising and printed media. The scanned information is decoded and displayed for user verification. The device utilizes onboard information retrieval software that establishes a connection to the Internet to retrieve HTML, XML, WML, and VRML documents, e-mail messages, USENET news, Java applets, ActiveX documents, Active Server Pages, or file transfers from the Internet locations interpreted by the handheld scanner.

Book
01 Dec 2000
TL;DR: This book is a printed version of the XML specifications and contains the complete text of the original document developed by the World Wide Web Consortium (W3C), an independent and international standards body devoted to furthering Web technologies.
Abstract: From the Publisher: Complete and up-to-date specifications of Extensible Markup Language (XML). A MUST HAVE resource for HTML Web page authors, Internet program developers, information technology managers, and anyone else who publishes documents on the World Wide Web. Describes how XML can be used to define the appearance of printed and displayed documents. Perfect companion for program developers, Web page authors, technology managers, and others who must keep up to date with the latest Web and shared document technology. Representing the future of the World Wide Web, the Extensible Markup Language (XML) provides for an almost limitless variety of text and formatting for Web pages and other electronic documentation. The XML 1.0 specifications are the industry standard recommendation for this important new technology. This book is a printed version of the XML specifications. It contains the complete text of the original document developed by the World Wide Web Consortium (W3C), an independent and international standards body devoted to furthering Web technologies. It's designed to be a handy desktop companion, saving you the time and expense of printing the documentation yourself. It is also useful when viewing the online version of the XML specification is inconvenient.

Patent
Neelakantan Sundaresan1
27 Jul 2000
TL;DR: In this paper, a profile matching system and associated method match the path expressions in a structured or semi-structured document, such as an XML document, to an indexed resource.
Abstract: A profile matching system and associated method match the path expressions in a structured or semi-structured document, such as an XML document, to an indexed resource. The system, having assigned weighting values to the elements in the index, maps the document path expressions and attempts to match them to the index elements according to a predetermined schema. The system is comprised of an indexing module that maps the document and identifies its content attributes, and a matching module that matches the document content attributes to weighted elements of an index. The system defines the attribute or set of attributes in the schema that result in a match. The matching criteria are identified in a map specification file that specifies the specific qualification criteria. The indexing module uses the map specification information to produce efficient indices from XML document. The matching module uses a match specification language (MSL) and a match operator. Each rule in the MSL is a pair of path expressions: one for the source document and one for target document or index.

Patent
06 Dec 2000
TL;DR: In this paper, a system and method for assigning attributes to XML document nodes to facilitate their storage in relational databases and the subsequent retrieval and re-construction of pertinent nodes and fragments in original document order is provided.
Abstract: A system and method for assigning attributes to XML document nodes to facilitate their storage in relational databases and the subsequent retrieval and re-construction of pertinent nodes and fragments in original document order is provided. Since these queries are performed using relational database query engines, the speed of their execution is significantly faster than that using more exotic systems such as object-oriented databases. Furthermore, this method is portable across all vendor platforms, and so can be deployed at client sites without additional investments in database software.