Showing papers on "Formal language published in 2010"

PDF

Open Access

Proceedings Article•DOI•

Rex: Symbolic Regular Expression Explorer

[...]

Margus Veanes¹, Peli de Halleux¹, Nikolai Tillmann¹•Institutions (1)

06 Apr 2010

TL;DR: A method and a tool, called Rex, for symbolically expressing and analyzing regular expression constraints, which is implemented using the SMT solver Z3 and provides experimental evaluation of Rex.

...read moreread less

Abstract: Constraints in form regular expressions over strings are ubiquitous. They occur often in programming languages like Perl and C#, in SQL in form of LIKE expressions, and in web applications. Providing support for regular expression constraints in program analysis and testing has several useful applications. We introduce a method and a tool called Rex, for symbolically expressing and analyzing regular expression constraints. Rex is implemented using the SMT solver Z3, and we provide experimental evaluation of Rex.

...read moreread less

149 citations

Proceedings Article•DOI•

Techne: Towards a New Generation of Requirements Modeling Languages with Goals, Preferences, and Inconsistency Handling

[...]

Ivan Jureta¹, Alexander Borgida², Neil A. Ernst³, John Mylopoulos³•Institutions (3)

Université de Namur¹, Rutgers University², University of Toronto³

27 Sep 2010

TL;DR: The need for Techne is motivated, the need is introduced through examples, and its formalization is sketched.

...read moreread less

Abstract: Techne is an abstract requirements modeling language that lays formal foundations for new modeling languages applicable during early phases of the requirements engineering process. During these phases, the requirements problem for the system-to-be is being structured, its candidate solutions described and compared in terms of how desirable they are to stakeholders. We motivate the need for Techne, introduce it through examples, and sketch its formalization.

...read moreread less

140 citations

Proceedings Article•

Controlled Natural Languages for Knowledge Representation

[...]

Rolf Schwitter¹•Institutions (1)

Macquarie University¹

23 Aug 2010

TL;DR: The most mature of these novel languages are presented, show how they can balance the disadvantages of natural languages and formal languages for knowledge representation, and discuss how domain specialists can be supported writing specifications in controlled natural language.

...read moreread less

Abstract: This paper presents a survey of research in controlled natural languages that can be used as high-level knowledge representation languages. Over the past 10 years or so, a number of machine-oriented controlled natural languages have emerged that can be used as high-level interface languages to various kinds of knowledge systems. These languages are relevant to the area of computational linguistics since they have two very interesting properties: firstly, they look informal like natural languages and are therefore easier to write and understand by humans than formal languages; secondly, they are precisely defined subsets of natural languages and can be translated automatically (and often deterministically) into a formal target language and then be used for automated reasoning. We present and compare the most mature of these novel languages, show how they can balance the disadvantages of natural languages and formal languages for knowledge representation, and discuss how domain specialists can be supported writing specifications in controlled natural language.

...read moreread less

129 citations

Proceedings Article•

A two-dimensional topic-aspect model for discovering multi-faceted topics

[...]

Michael J. Paul¹, Roxana Girju¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

11 Jul 2010

TL;DR: The Topic-Aspect Model is presented, a Bayesian mixture model which jointly discovers topics and aspects and can generate token assignments in both of these dimensions, rather than assuming words come from only one of two orthogonal models.

...read moreread less

Abstract: This paper presents the Topic-Aspect Model (TAM), a Bayesian mixture model which jointly discovers topics and aspects. We broadly define an aspect of a document as a characteristic that spans the document, such as an underlying theme or perspective. Unlike previous models which cluster words by topic or aspect, our model can generate token assignments in both of these dimensions, rather than assuming words come from only one of two orthogonal models. We present two applications of the model. First, we model a corpus of computational linguistics abstracts, and find that the scientific topics identified in the data tend to include both a computational aspect and a linguistic aspect. For example, the computational aspect of GRAMMAR emphasizes parsing, whereas the linguistic aspect focuses on formal languages. Secondly, we show that the model can capture different viewpoints on a variety of topics in a corpus of editorials about the Israeli-Palestinian conflict. We show both qualitative and quantitative improvements in TAM over two other state-of-the-art topic models.

...read moreread less

128 citations

Book Chapter•DOI•

libalf: the automata learning framework

[...]

Benedikt Bollig¹, Joost-Pieter Katoen², Carsten Kern², Martin Leucker³, Daniel Neider², David R. Piegdon² - Show less +2 more•Institutions (3)

École normale supérieure de Cachan¹, RWTH Aachen University², Technische Universität München³

15 Jul 2010

TL;DR: This paper presents libalf, a comprehensive, open-source library for learning formal languages libalf covers various well-known learning techniques for finite automata as well as novel learning algorithms.

...read moreread less

Abstract: This paper presents libalf, a comprehensive, open-source library for learning formal languages libalf covers various well-known learning techniques for finite automata (eg Angluin's L*, Biermann, RPNI etc) as well as novel learning algorithms (such as for NFA and visibly one-counter automata) libalf is flexible and allows facilely interchanging learning algorithms and combining domain-specific features in a plug-and-play fashion Its modular design and C++ implementation make it a suitable platform for adding and engineering further learning algorithms for new target models (eg, Buchi automata)

...read moreread less

106 citations

Book•

A Course in Mathematical Logic for Mathematicians

[...]

Yuri I. Manin¹•Institutions (1)

Max Planck Society¹

29 Apr 2010

TL;DR: In this article, the Continuum Problem and Forcing were studied in the context of formal languages and computable sets, and they were shown to be computable and provenable.

...read moreread less

Abstract: PROVABILITY.- to Formal Languages.- Truth and Deducibility.- The Continuum Problem and Forcing.- The Continuum Problem and Constructible Sets.- COMPUTABILITY.- Recursive Functions and Church#x2019 s Thesis.- Diophantine Sets and Algorithmic Undecidability.- PROVABILITY AND COMPUTABILITY.- G#x00F6 del#x2019 s Incompleteness Theorem.- Recursive Groups.- Constructive Universe and Computation.- MODEL THEORY.- Model Theory.

...read moreread less

67 citations

Journal Article•DOI•

A large neighbourhood search approach to the multi-activity shift scheduling problem

[...]

Claude-Guy Quimper, Louis-Martin Rousseau¹•Institutions (1)

École Polytechnique de Montréal¹

01 Jun 2010-Journal of Heuristics

TL;DR: This paper shows how formal languages can be enhanced and used to model the complex regulations of the shift construction problem and can derive specialized graph structures that can be searched efficiently using a Large Neighbourhood Search.

...read moreread less

Abstract: The challenge in shift scheduling lies in the construction of a set of work shifts, which are subject to specific regulations, in order to cover fluctuating staff demands. This problem becomes harder when multi-skill employees can perform many different activities during the same shift. In this paper, we show how formal languages (such as regular and context-free languages) can be enhanced and used to model the complex regulations of the shift construction problem. From these languages we can derive specialized graph structures that can be searched efficiently. The overall shift scheduling problem can then be solved using a Large Neighbourhood Search. These approaches are able to return near optimal solution on traditional single activity problems and they scale well on large instances containing up to 10 activities.

...read moreread less

56 citations

Theorem-Proving by Resolution as a Basis for Question-Answering Systems

[...]

Cordell Green

01 Jan 2010

TL;DR: This paper shows how a question-answering system can be constructed using first-order logic as its language and a resolution-type theorem-prover as its deductive mechanism, and presents one particular approach in detail.

...read moreread less

Abstract: This paper shows how a question-answering system can be constructed using first-order logic as its language and a resolution-type theorem-prover as its deductive mechanism A working computer program, qa3, based on these ideas is described The performance of the program compares favorably with several other general question-answering systems 1 QUESTION ANSWERING A question-answering system accepts information about some subject areas and answers questions by utilizing this information The type of question-answering system considered in this paper is ideally one having the following features: 1 A language general enough to describe any reasonable question-answering subjects and express desired questions and answers 2 The ability to search efficiently the stored information and recognize items that are relevant to a particular query 3 The ability to derive an answer that is not stored explicitly, but that is derivable by the use of moderate effort from the stored facts 4 Interactions between subject areas; for example, if the system has facts about Subject a and Subject b, then it should be able to answer a question that requires the use of both sets of facts 5 Capability of allowing the user to add new facts or replace old facts conveniently This paper argues the case for formal methods to achieve such a system and presents one particular approach in detail A natural language facility is not one of the properties sought after or discussed (although Coles, 1968, has added to the program described here a translator from a subset of English to first-order logic) The name ‘question-answering system’ requires clarification The system described above might be named an ‘advice taker’ or a ‘multi-purpose problem-solving system’ or ‘general problem-solving system’ McCarthy (1958) proposed using formal languages and deduction to construct such a system, and suggested allowing the user to give hints or advice on how to answer a question; he referred to the proposed system as an ‘advice taker’ Research on ‘multi-purpose’ or ‘general problem-solving’ tends to differ from question-answering as described above by placing more emphasis on solving deeper, more difficult problems and less emphasis on user interaction, formality, and efficient retrieval of relevant facts from a large data base The situation is further confused by the use of ‘question-answering’ to refer sometimes to natural language systems, sometimes to information retrieval systems having little deductive ability, and sometimes to systems with deductive ability limited to the propositional calculus

...read moreread less

55 citations

Journal Article•DOI•

Towards Domain-specific Model Editors with Automatic Model Completion

[...]

Sagar Sen¹, Benoit Baudry¹, Hans Vangheluwe²•Institutions (2)

French Institute for Research in Computer Science and Automation¹, McGill University²

01 Feb 2010

TL;DR: This paper presents a methodology to synthesize model editors equipped with automatic completion from a modeling language’s declarative specification consisting of a meta-model with a visual syntax, powered by a first-order relational logic engine implemented in ALLOY.

...read moreread less

Abstract: Integrated development environments such as Eclipse allow users to write programs quickly by presenting a set of recommendations for code completion. Similarly, word processing tools such as Microsoft Word present corrections for grammatical errors in sentences. Both of these existing structure editors use a set of constraints expressed in the form of a natural language grammar to restrict/correct the user ( syntax-directed editing) or formal grammar (language-directed editing ) to aid document completion. Taking this idea further, in this paper we present an integrated software system capable of generating recommendations for model completion of partial models built in editors for domain-specific modeling languages. We present a methodology to synthesize model editors equipped with automatic completion from a modeling languageâs declarative specification consisting of a meta-model with a visual syntax. This meta-model directed completion feature is powered by a first-order relational logic engine implemented in ALLOY. We incorporate automatic completion in the generative tool AToM3. We use the finite state machines modeling language as a concise running example. Our approach leverages a correct by construction philosophy that renders subsequent simulation of models considerably less error-prone.

...read moreread less

54 citations

Proceedings Article•DOI•

Regular Cost Functions over Finite Trees

[...]

Thomas Colcombet¹, Christof Löding²•Institutions (2)

Paris Diderot University¹, RWTH Aachen University²

11 Jul 2010

TL;DR: The theory of regular cost functions over finite trees is developed, aquantitative extension to the notion of regular languages of trees, and nondeterministic and alternating finite tree cost automata for describing cost functions are introduced.

...read moreread less

Abstract: We develop the theory of regular cost functions over finite trees: aquantitative extension to the notion of regular languages of trees: Cost functions map each input (tree) to a value in~$\omega+1$, and are considered modulo an equivalence relation which forgets about specific values, but preserves boundedness of functions on all subsets of the domain. We introduce nondeterministic and alternating finite tree cost automata for describing cost functions. We show that all these forms of automata are effectively equivalent. We also provide decision procedures for them. Finally, following B\"uchi's seminal idea, we use cost automata for providing decision procedures for cost monadic logic, a quantitative extension of monadic second order logic.

...read moreread less

50 citations

Book Chapter•DOI•

A topological approach to recognition

[...]

Mai Gehrke¹, Serge Grigorieff², Jean-Eric Pin²•Institutions (2)

Radboud University Nijmegen¹, Paris Diderot University²

06 Jul 2010

TL;DR: The existence of a minimum recognizer is proved in a very general setting which applies in particular to any BA of subsets of a discrete space and an equational characterization of BA of languages closed under quotients is given, which extends the known results on regular languages to nonregular languages.

...read moreread less

Abstract: We propose a new approach to the notion of recognition, which departs from the classical definitions by three specific features. First, it does not rely on automata. Secondly, it applies to any Boolean algebra (BA) of subsets rather than to individual subsets. Thirdly, topology is the key ingredient.We prove the existence of a minimum recognizer in a very general setting which applies in particular to any BA of subsets of a discrete space. Our main results show that this minimum recognizer is a uniform space whose completion is the dual of the original BA in Stone-Priestley duality; in the case of a BA of languages closed under quotients, this completion, called the syntactic space of the BA, is a compact monoid if and only if all the languages of the BA are regular. For regular languages, one recovers the notions of a syntactic monoid and of a free profinite monoid. For nonregular languages, the syntactic space is no longer a monoid but is still a compact space. Further, we give an equational characterization of BA of languages closed under quotients, which extends the known results on regular languages to nonregular languages. Finally, we generalize all these results from BAs to lattices, in which case the appropriate structures are partially ordered.

...read moreread less

Book Chapter•DOI•

Optimized temporal monitors for SystemC

[...]

Deian Tabakov¹, Moshe Y. Vardi¹•Institutions (1)

Rice University¹

01 Nov 2010

TL;DR: This paper focuses on automated generation of runtime monitors from temporal properties, with a focus on minimizing runtime overhead, rather than monitor size or monitor-generation time.

...read moreread less

Abstract: SystemC is a modeling language built as an extension of C++. Its growing popularity and the increasing complexity of designs have motivated research efforts aimed at the verification of SystemC models using assertion-based verification (ABV), where the designer asserts properties that capture the design intent in a formal language such as PSL or SVA. The model then can be verified against the properties using runtime or formal verification techniques. In this paper we focus on automated generation of runtime monitors from temporal properties. Our focus is on minimizing runtime overhead, rather than monitor size or monitor-generation time. We identify four issues in monitor generation: state minimization, alphabet representation, alphabet minimization, and monitor encoding. We conduct extensive experimentation on a synthetic workload and identify a configuration that offers the best performance in terms of runtime overhead.

...read moreread less

Book Chapter•DOI•

Extended computation tree logic

[...]

Roland Axelsson¹, Matthew Hague², Stephan Kreutzer², Martin Lange³, Markus Latte¹ - Show less +1 more•Institutions (3)

Ludwig Maximilian University of Munich¹, University of Oxford², University of Kassel³

10 Oct 2010

TL;DR: A generic extension of the popular branching-time logic CTL is introduced which refines the temporal until and release operators with formal languages and shows that even with context-free languages on the until operator the logic still allows for polynomial time model-checking despite the significant increase in expressive power.

...read moreread less

Abstract: We introduce a generic extension of the popular branching-time logic CTL which refines the temporal until and release operators with formal languages. For instance, a language may determine the moments along a path that an until property may be fulfilled. We consider several classes of languages leading to logics with different expressive power and complexity, whose importance is motivated by their use in model checking, synthesis, abstract interpretation, etc. We show that even with context-free languages on the until operator the logic still allows for polynomial time model-checking despite the significant increase in expressive power. This makes the logic a promising candidate for applications in verification. In addition, we analyse the complexity of satisfiability and compare the expressive power of these logics to CTL* and extensions of PDL.

...read moreread less

Book•DOI•

Where Mathematics, Computer Science, Linguistics and Biology Meet

[...]

Carlos Martn-Vide, Victor Mitrana

03 Dec 2010

TL;DR: The book is a collection of papers going deep into classical topics in computer science inspired formal languages, as well as other ones showing new concepts and problems motivated in linguistics and biology.

...read moreread less

Abstract: There are not many interdisciplinary scientific fields as formal language theory In this volume, it is presented as the very intersection point between Mathematics, Computer Science, Linguistics and Biology The book is a collection of papers going deep into classical topics in computer science inspired formal languages, as well as other ones showing new concepts and problems motivated in linguistics and biology The papers are organized in four sections: Grammars and Grammar Systems, Automata, Languages and Combinatorics, and Models of Molecular Computing They clearly prove the power, wealth and vitality of the theory nowadays and sketch some trends for its future development The volume is intended for an audience of computer scientists, computational linguists, theoretical biologists and any other people interested in dealing with the problems and challenges of interdisciplinarity

...read moreread less

Journal Article•DOI•

Decision problems for language equations

[...]

Alexander Okhotin¹•Institutions (1)

University of Turku¹

01 May 2010-Journal of Computer and System Sciences

TL;DR: The families of languages defined by components of unique, least and greatest solutions of such systems are shown to coincide with the classes of recursive, recursively enumerable and co-recursive enumerable sets, respectively.

...read moreread less

Journal Article•DOI•

Symmetry and partial order reduction techniques in model checking Rebeca

[...]

Mohammad Mahdi Jaghoori, Marjan Sirjani¹, Mohammad Reza Mousavi², Ehsan Khamespanah¹, Ali Movaghar³ - Show less +1 more•Institutions (3)

University of Tehran¹, Eindhoven University of Technology², Sharif University of Technology³

12 Jan 2010-Acta Informatica

TL;DR: In this paper, the authors present two approaches for detecting symmetry in Rebeca models: one that detects symmetry in the topology of interconnections among objects and another one which exploits specific data structures to reflect internal symmetry.

...read moreread less

Abstract: Rebeca is an actor-based language with formal semantics which is suitable for modeling concurrent and distributed systems and protocols. Due to its object model, partial order and symmetry detection and reduction techniques can be efficiently applied to dynamic Rebeca models. We present two approaches for detecting symmetry in Rebeca models: One that detects symmetry in the topology of inter-connections among objects and another one which exploits specific data structures to reflect internal symmetry in the internal structure of an object. The former approach is novel in that it does not require any input from the modeler and can deal with the dynamic changes of topology. This approach is potentially applicable to a wide range of modeling languages for distributed and reactive systems. We have also developed a model checking tool that implements all of the above-mentioned techniques. The evaluation results show significant improvements in model size and model-checking time.

...read moreread less

Journal Article•DOI•

Rule-based composite event queries: the language XChange EQ and its semantics

[...]

Michael Eckert¹, François Bry¹•Institutions (1)

Ludwig Maximilian University of Munich¹

01 Dec 2010-Knowledge and Information Systems

TL;DR: The rule-based composite event query language XChangeEQ is described, designed to completely cover and integrate the four complementary querying dimensions: event data, event composition, temporal relationships, and event accumulation.

...read moreread less

Abstract: Web systems, Web services, and Web-based publish/subscribe systems communicate events as XML messages and in many cases, require composite event detection: it is not sufficient to react to single event messages, but events have to be considered in relation to other events that are received over time. This entails a need for expressive, high-level languages for querying composite events. Emphasizing language design and formal semantics, we describe the rule-based composite event query language XChangeEQ. XChangeEQ is designed to completely cover and integrate the four complementary querying dimensions: event data, event composition, temporal relationships, and event accumulation. Semantics are provided as a model theory with accompanying fixpoint theory, an approach that is established for rule languages but has not been applied to event queries so far. Because they are highly declarative, thus easy to understand and well suited for query optimization, such semantics are desirable for event queries.

...read moreread less

Journal Article•DOI•

Formal power series and regular operations on fuzzy languages

[...]

Jelena Ignjatović¹, Miroslav Ćirić¹•Institutions (1)

University of Niš¹

01 Apr 2010-Information Sciences

TL;DR: This paper studies formal power series over a quantale with coefficients in the algebra of all languages over a given alphabet, and representation of fuzzy languages by these formal powerseries, and shows that regular operations on fuzzy languages can be represented byRegular operations on power series are shown by means of operations on ordinary languages.

...read moreread less

Posted Content•

Noncommutative rational functions, their difference-differential calculus and realizations

[...]

Dmitry S. Kaliuzhnyi-Verbovetskyi¹, Victor Vinnikov²•Institutions (2)

Drexel University¹, Ben-Gurion University of the Negev²

02 Mar 2010-arXiv: Rings and Algebras

TL;DR: A survey of non-commutative rational functions, their realization theory and their applications can be found in this paper, where a difference-differential calculus is developed for further analysis.

...read moreread less

Abstract: Noncommutative rational functions appeared in many contexts in system theory and control, from the theory of finite automata and formal languages to robust control and LMIs. We survey the construction of noncommutative rational functions, their realization theory and some of their applications. We also develop a difference-differential calculus as a tool for further analysis.

...read moreread less

Proceedings Article•DOI•

Full satisfiability of UML class diagrams

[...]

Alessandro Artale¹, Diego Calvanese¹, Angélica Ibáñez-García¹•Institutions (1)

Free University of Bozen-Bolzano¹

01 Nov 2010

TL;DR: This investigation shows that the full satisfiability problem is ExpTime-complete in the full scenario, NP- complete if the authors drop isa between relationships, and NLogSpace-complete if they further drop covering over classes.

...read moreread less

Abstract: UML class diagrams (UCDs) are the de-facto standard formalism for the analysis and design of information systems. By adopting formal language techniques to capture constraints expressed by UCDs one can exploit automated reasoning tools to detect relevant properties, such as schema and class satisfiability and subsumption between classes. Among the reasoning tasks of interest, the basic one is detecting full satisfiability of a diagram, i.e., whether there exists an instantiation of the diagram where all classes and associations of the diagram are non-empty and all the constraints of the diagram are respected. In this paper we establish tight complexity results for full satisfiability for various fragments of UML class diagrams. This investigation shows that the full satisfiability problem is ExpTime-complete in the full scenario, NP-complete if we drop isa between relationships, and NLogSpace-complete if we further drop covering over classes.

...read moreread less

Proceedings Article•DOI•

Element Based Semantics in Multi Formalism Performance Models

[...]

Mauro Iacono, Marco Gribaudo

17 Aug 2010

TL;DR: This paper presents a novel approach to multiformalism compositional modeling, that is based on the possibility of freely specifying the dynamics of the elements of a formal modeling language in an open framework, by the application of consolidated metamodeling foundations to the description of models.

...read moreread less

Abstract: The design and the requirements of modern computer-based systems have reached a complexity level that calls for the use of models for the verification of non functional requirements since the beginning of their design cycle. Such systems are however too complex to be modeled directly in a simple unstructured formal language like Queueing Networks or Petri Nets. SIMTHESys (Structured Infrastructure for Multiformalism modeling and Testing of Heterogeneous formalisms and Extensions for SYStems) is a novel approach to multiformalism compositional modeling, that is based on the possibility of freely specifying the dynamics of the elements of a formal modeling language in an open framework. This is obtained by the application of consolidated metamodeling foundations to the description of models, together with the concept of behavior as a bridge between formalism dynamics and solution techniques. In this paper the main concepts of the SIMTHESys approach are presented, together with a running example of how SIMTHESys copes with performance evaluation of multiformalism models.

...read moreread less

Journal Article•DOI•

Logic and Rational Languages of Words Indexed by Linear Orderings

[...]

Nicolas Bedon¹, Alexis Bès², Olivier Carton², Chloé Rispal¹•Institutions (2)

Institut Gaspard Monge¹, University of Paris²

01 May 2010-Theory of Computing Systems \/ Mathematical Systems Theory

TL;DR: It is proved that every rational language of words indexed by linear orderings is definable in monadic second-order logic, and it is shown that the converse is true for the class of languages indexed by countable scattered linear ordering, but false in the general case.

...read moreread less

Abstract: We prove that every rational language of words indexed by linear orderings is definable in monadic second-order logic. We also show that the converse is true for the class of languages indexed by countable scattered linear orderings, but false in the general case. As a corollary we prove that the inclusion problem for rational languages of words indexed by countable linear orderings is decidable.

...read moreread less

Journal Article•DOI•

5′ → 3′ Watson-Crick Automata With Several Runs

[...]

Peter Leupold, Benedek Nagy¹•Institutions (1)

Balassi Institute¹

01 Jan 2010

TL;DR: It is proved that the expressive power of 5′ → 3′ WK-automata increases with every additional run that they can make, both for deterministic and non-deterministic machines.

...read moreread less

Abstract: 5′ → 3′ WK-automata are Watson-Crick automata whose two heads start on opposite ends of the input word and always run in opposite directions. One full reading in both directions is called a run. We prove that the expressive power of these automata increases with every additional run that they can make, both for deterministic and non-deterministic machines. This defines two incomparable infinite hierarchies of language classes between the regular and the context-sensitive languages. These hierarchies are complemented with classes defined by several restricted variants of 5′ → 3′ WK-automata like stateless automata. Finally we show that several standard problems are undecidable for languages accepted by 5′ → 3′ WK-automata in only one run, for example the emptiness and the finiteness problems.

...read moreread less

Journal Article•DOI•

Zsyntax: a formal language for molecular biology with projected applications in text mining and biological prediction.

[...]

Giovanni Boniolo¹, Marcello D'Agostino², Pier Paolo Di Fiore¹•Institutions (2)

University of Milan¹, University of Ferrara²

03 Mar 2010-PLOS ONE

TL;DR: A formal language that allows for transposing biological information precisely and rigorously into machine-readable information is proposed, which is grounded on a particular type of non-classical logic and can be used to write algorithms and computer programs.

...read moreread less

Abstract: We propose a formal language that allows for transposing biological information precisely and rigorously into machine-readable information. This language, which we call Zsyntax (where Z stands for the Greek word ζωή, life), is grounded on a particular type of non-classical logic, and it can be used to write algorithms and computer programs. We present it as a first step towards a comprehensive formal language for molecular biology in which any biological process can be written and analyzed as a sort of logical “deduction”. Moreover, we illustrate the potential value of this language, both in the field of text mining and in that of biological prediction.

...read moreread less

Book•

Scattered Context Grammars and Their Applications

[...]

Alexander Meduna, Jiří Techet

01 Jan 2010

TL;DR: This computer science book represents scattered information by formal languages and gives an in-depth discussion of scattered context grammars as formal means that process these languages with a focus on applications in linguistics.

...read moreread less

Abstract: This computer science book represents scattered information by formal languages and gives an in-depth discussion of scattered context grammars as formal means that process these languages. It is primarily meant as a monograph on these grammars, which represent an important trend of today's formal language theory. The text maintains a balance between fundamental concepts, theoretical results, and applications of these grammars. From a theoretical viewpoint, it introduces several variants of scattered context grammatical models. Based on these models, it demonstrates the concepts, methods, and techniques employed in handling scattered pieces of information with enough rigors to make them quite clear. It also explains a close relation between the subject of the book and several important mathematical fields, such as algebra and graph theory. From a more practical point of view, this book describes scattered information processing by fundamental information technologies. Throughout this book, several in-depth case studies and examples are carefully presented. Whilst discussing various methods concerning grammatical processing of scattered information, the text illustrates their applications with a focus on applications in linguistics.

...read moreread less

Journal Article•DOI•

The expressive power of the shuffle product

[...]

Jean Berstel¹, Luc Boasson², Olivier Carton², Jean-Eric Pin², Antonio Restivo - Show less +1 more•Institutions (2)

University of Paris¹, Paris Diderot University²

01 Nov 2010-Information & Computation

TL;DR: The smallest class of languages containing the singletons and closed under Boolean operations, product and shuffle is studied, including the smallest class containing the languages composed of a single word of length 2 which is closed under boolean operations and shuffle by a letter.

...read moreread less

Abstract: There is an increasing interest in the shuffle product on formal languages, mainly because it is a standard tool for modeling process algebras. It still remains a mysterious operation on regular languages. Antonio Restivo proposed as a challenge to characterize the smallest class of languages containing the singletons and closed under Boolean operations, product and shuffle. This problem is still widely open, but we present some partial results on it. We also study some other smaller classes, including the smallest class containing the languages composed of a single word of length 2 which is closed under Boolean operations and shuffle by a letter (resp. shuffle by a letter and by the star of a letter). The proof techniques have both an algebraic and a combinatorial flavor.

...read moreread less

Journal Article•DOI•

Learning difficulties experienced by students in a course on formal languages and automata theory

[...]

Nelishia Pillay¹•Institutions (1)

University of KwaZulu-Natal¹

18 Jan 2010

TL;DR: The findings of a study conducted to identify learning difficulties for some of the FLAT topics are reported on.

...read moreread less

Abstract: Students taking courses on formal languages and automata theory (FLAT) usually do not find these courses interesting and experience difficulty in grasping the different concepts. While there has been a vast amount of research into methodologies to assist students to conceptualize FLAT topics, there has been no research into the actual learning difficulties experienced by students with the different topics. This paper reports on the findings of a study conducted to identify these learning difficulties for some of the FLAT topics.

...read moreread less

Proceedings Article•

Efficient, Correct, Unsupervised Learning for Context-Sensitive Languages

[...]

Alexander Clark¹•Institutions (1)

Royal Holloway, University of London¹

15 Jul 2010

TL;DR: This paper presents a lattice-theoretic representation for natural language syntax, called Distributional Lattice Grammars, based on a generalisation of distributional learning, and are capable of representing all regular languages, some but not all context-free languages and some non-context- free languages.

...read moreread less

Abstract: A central problem for NLP is grammar induction: the development of unsupervised learning algorithms for syntax. In this paper we present a lattice-theoretic representation for natural language syntax, called Distributional Lattice Grammars. These representations are objective or empiricist, based on a generalisation of distributional learning, and are capable of representing all regular languages, some but not all context-free languages and some non-context-free languages. We present a simple algorithm for learning these grammars together with a complete self-contained proof of the correctness and efficiency of the algorithm.

...read moreread less

Journal Article•DOI•

OCL for formal modelling of topological constraints involving regions with broad boundaries

[...]

Lotfi Bejaoui¹, François Pinet, Michel Schneider², Yvan Bédard¹•Institutions (2)

Laval University¹, Blaise Pascal University²

01 Jul 2010-Geoinformatica

TL;DR: An extension of Spatial OCL is proposed based on a geometric model for objects with vague shapes, and an adverbial approach for modelling topological constraints involving regions with broad boundaries, which provides an easiness in the formal modelling of these complex constraints.

...read moreread less

Abstract: Integrity constraints can control topological relations of objects in spatial databases. These constraints can be modelled using formal languages such as the spatial extension of the Object Constraint Language (Spatial OCL). This language allows the expression of topological integrity constraints involving crisp spatial objects but it does not support constraints involving spatial objects with vague shapes (e.g. forest stand, pollution zone, valley or lake). In this paper, we propose an extension of Spatial OCL based on (1) a geometric model for objects with vague shapes, and (2) an adverbial approach for modelling topological constraints involving regions with broad boundaries. This new language provides an easiness in the formal modelling of these complex constraints. Our approach has been implemented in a code generator. A case study is also presented in the paper in the field of agriculture spreading activities. AOCL OVS takes account of the shape vagueness of spread parcel and improve spatial reasoning about them.

...read moreread less

Journal Article•DOI•

Grammar constraints

[...]

Serdar Kadioglu¹, Meinolf Sellmann¹•Institutions (1)

Brown University¹

01 Jan 2010

TL;DR: A time- and space-efficient incremental arc-consistency algorithm for context-free grammars, investigate when logic combinations of grammar constraints are tractable, and show how to exploit non-constant size Grammars and reorderings of languages.

...read moreread less

Abstract: With the introduction of the Regular Membership Constraint, a new line of research has opened where constraints are based on formal languages. This paper is taking the next step, namely to investigate constraints based on grammars higher up in the Chomsky hierarchy. We devise a time- and space-efficient incremental arc-consistency algorithm for context-free grammars, investigate when logic combinations of grammar constraints are tractable, show how to exploit non-constant size grammars and reorderings of languages, and study where the boundaries run between regular, context-free, and context-sensitive grammar filtering.

...read moreread less

Collapse