scispace - formally typeset
Patent

Method and apparatus for generating structured document

TLDR
In this paper, a structured document generating method and apparatus capable of easily generating structured documents matching the document structure of each non-structured document, by using a rule directly generated from a preset document structure definition for the conversion of the nonstructured documents into the structured documents.
Abstract
A structured document generating method and apparatus capable of easily generating a structured document matching the document structure of each non-structured document, by using a rule directly generated from a preset document structure definition for the conversion of the non-structured document into the structured document. A keyword extracting module extracts a keyword representative of the document structure from a non-structured document by using a keyword extracting rule, and a keyword/text model is generated which is described by two elements including keywords and other strings. A parsing module generated by a process of automatically parsing the document structure by referring to a parsing rule generated by modifying and converting DTD, performs a parsing process relative to the keyword/text model to generate an interim SGML document. An SGML document correcting module modifies the interim SGML document and generates a final output of an SGML document by referring to DTD different information generated when the parsing rule was generated.

read more

Citations
More filters
Patent

Systems, methods and computer program products for tailoring web page content in hypertext markup language format for display within pervasive computing devices using extensible markup language tools

TL;DR: In this article, the XML-based tools are used to tailor HTML-based Web page content for display within various client devices, such as mobile phones, tablets, and computers, by converting portions of a requested Web page to an XML format and then modifying them using an XML content-tailoring tool.
Patent

Automated creation of an XML dialect and dynamic generation of a corresponding DTD

TL;DR: In this paper, a method, system, and computer-readable code for translating an input document into an Extensible Markup Language (XML) dialect which is well-formed, such that automated, dynamically-selected transformations (such as those that will indicate a user's current context) can be applied to the document.
Patent

System and method of performing profile matching with a structured document

TL;DR: In this paper, a profile matching system and associated method match the path expressions in a structured or semi-structured document, such as an XML document, to an indexed resource.
Patent

Content management and transformation system for digital content

TL;DR: In this article, the authors propose a transformation engine that enables content and information to be transformed from one format, a source format, to a format that is compatible with the requesting device, a destination format.
Patent

Computer generation of documents using layout elements and content elements

TL;DR: In this article, the bindings are used to describe a document and a different document by associating content elements with layout elements, the layout elements defining layout features or placement information to be applied to the associated content elements in the document, the bindings being stored separately from both the content and layout elements.
References
More filters
Journal ArticleDOI

EXPRESS: a data EXtraction, Processing, and Restructuring System

TL;DR: This paper describes the design and implementation of EXPRESS, an experimental prototype data translation system which can access a wide variety of data and restructure it for new uses driven by two very high level nonprocedural languages.
Patent

System for automatically embedding or incorporating contents added to a document

TL;DR: A document-centered user interface architecture for a computer system employs parts as the fundamental building blocks of all documents as mentioned in this paper, and all data is stored in the system as a part, which is comprised of contents and associated editor.
Patent

Method and apparatus for document production using a common document database

TL;DR: In this article, a document is partitioned into a number of encapsulated data elements, and one or more classes of variations are defined and variation names are associated with each class.
Patent

Translating system for processing text with markup signs

TL;DR: In this paper, a method and translating machine for translating a source language with markup signs into a target language maintaining the markup signs is presented. But the system includes a separation module for separating an original text into markup signs and a text body exclusive of markup signs, a memory for storing each markup sign in association with a corresponding word or phrase, a module for producing a parsing tree in a target domain corresponding to the original text body, and a translated sentence producing module for translating target language text by attaching markup signs to translated words corresponding to original text based on the parsing tree.
Patent

Method and apparatus for structured document difference string extraction

TL;DR: In this article, a document difference extraction method and apparatus is used for extracting the difference between structured documents properly meeting the sense of a document editor taking the logical meaning and structure of the structured documents into consideration.