scispace - formally typeset
Search or ask a question

Showing papers on "Interval tree published in 2019"


Journal ArticleDOI
TL;DR: The results demonstrate that RTFC is an efficient data structure for querying large-scale datasets regarding Allen’s relations between genomic intervals, such as those required by interpreting genome-wide variation in large populations.
Abstract: Efficient large-scale annotation of genomic intervals is essential for personal genome interpretation in the realm of precision medicine. There are 13 possible relations between two intervals according to Allen's interval algebra. Conventional interval trees are routinely used to identify the genomic intervals satisfying a coarse relation with a query interval, but cannot support efficient query for more refined relations such as all Allen's relations. We design and implement a novel approach to address this unmet need. Through rewriting Allen's interval relations, we transform an interval query to a range query, then adapt and utilize the range trees for querying. We implement two types of range trees: a basic 2-dimensional range tree (2D-RT) and an augmented range tree with fractional cascading (RTFC) and compare them with the conventional interval tree (IT). Theoretical analysis shows that RTFC can achieve the best time complexity for interval queries regarding all Allen's relations among the three trees. We also perform comparative experiments on the efficiency of RTFC, 2D-RT and IT in querying noncoding element annotations in a large collection of personal genomes. Our experimental results show that 2D-RT is more efficient than IT for interval queries regarding most of Allen's relations, RTFC is even more efficient than 2D-RT. The results demonstrate that RTFC is an efficient data structure for querying large-scale datasets regarding Allen's relations between genomic intervals, such as those required by interpreting genome-wide variation in large populations.

5 citations


Journal ArticleDOI
TL;DR: In this paper, the odd integers are placed in a given interval on a perfect full binary tree, and the odd integer's divisors are investigated by combining the original properties of the integers with the properties of binary trees.
Abstract: The paper first puts forward a way to study odd integers by placing the odd integers in a given interval on a perfect full binary tree, then makes an investigation on the odd integers by means of combining the original properties of the integers with the properties of the binary trees and obtains several new results on how an odd integer's divisors distribute on a level of a binary tree. The newly discovered law of divisors' distribution that includes common divisors between two symmetric nodes, genetic divisors between an ancestor node and its descendant node can provide a new and simple approach to factorize odd composite integers. Based on the mathematical deductions, numerical experiments are designed and demonstrated in the Maple software. All the results of the experiments are conformance to expectation and validate the validity of the approach.

4 citations


Book
31 Jan 2019
TL;DR: The current work adapts the optimal tree pruning algorithm (BFOS) introduced by Breiman et al.(1984) and extended by Chou et al. (1989) to the multi-document summarization task and uses Hierarchical Agglomerative Clustering algorithm(HAC) to detect redundancy.
Abstract: The current work adapts the optimal tree pruning algorithm(BFOS) introduced by Breiman et al.(1984) and extended by Chou et al.(1989) to the multi-document summarization task. BFOS algorithm is used to eliminate redundancy which is one of the main issues in multi-document summarization. Hierarchical Agglomerative Clustering algorithm(HAC) is employed to detect the redundancy. The tree designed by HAC algorithm is successively pruned with the optimal tree pruning algorithm to optimize the distortion vs. rate cost of the resultant tree. Rate parameter is defined to be the number of the sentences in the leaves of the tree. Distortion is the sum of the distances between the representative sentence of the cluster at each node and the other sentences in the same cluster. The sentences assigned to the leaves of the resultant tree are included in the summary. The performance of the proposed system assessed with the Rouge-1 metric is seen to be better than the performance of the DUC-2002 winners on DUC-2002 data set.

2 citations


Proceedings ArticleDOI
04 Dec 2019
TL;DR: The searchable symmetric encryption scheme TIQSSE utilizes interval tree structure and it is proved that the scheme achieves efficient search and update time while also maintaining all important security properties such as forward privacy, backward privacy, and it does not leak information outside the desired temporal range.
Abstract: Visual logs become widely available via personal cameras, visual sensors in smart environments, or surveillance systems. Storing such data in public services is a common convenient solution, but it is essential to devise a mechanism to encrypt such data while enabling the capability to query visual content even in encrypted format at the services. This motivates our proposal to develop a smart secure service for visual logs with a temporal interval query. In our system, visual log data are analyzed to generate high-level contents, including entities, scenes, and activities happening in visual data. Then our system supports data owners to query these high-level contents from their visual logs at the server-side in a temporal interval while the data are still encrypted. Our searchable symmetric encryption scheme TIQSSE utilizes interval tree structure and we prove that our scheme achieves efficient search and update time while also maintaining all important security properties such as forward privacy, backward privacy, and it does not leak information outside the desired temporal range.

1 citations


Journal ArticleDOI
TL;DR: In this paper, the authors propose two parallel solutions to the data distribution management (DDM) problem that can operate effectively on shared-memory multiprocessors, based on a data structure (the Interval Tree) that allows concurrent computation of intersections between subscription and update regions, and a novel parallel extension of the Sort Based Matching algorithm, whose sequential version is considered among the most efficient solutions.
Abstract: The problem of identifying intersections between two sets of d-dimensional axis-parallel rectangles appears frequently in the context of agent-based simulation studies. For this reason, the High Level Architecture (HLA) specification -- a standard framework for interoperability among simulators -- includes a Data Distribution Management (DDM) service whose responsibility is to report all intersections between a set of subscription and update regions. The algorithms at the core of the DDM service are CPU-intensive, and could greatly benefit from the large computing power of modern multi-core processors. In this paper we propose two parallel solutions to the DDM problem that can operate effectively on shared-memory multiprocessors. The first solution is based on a data structure (the Interval Tree) that allows concurrent computation of intersections between subscription and update regions. The second solution is based on a novel parallel extension of the Sort Based Matching algorithm, whose sequential version is considered among the most efficient solutions to the DDM problem. Extensive experimental evaluation of the proposed algorithms confirm their effectiveness on taking advantage of multiple execution units in a shared-memory architecture.

1 citations


Patent
26 Feb 2019
TL;DR: In this article, a data storage method and apparatus and a computer readable storage medium, and relates to the technical field of databases, is presented, where the summary configuration metadata comprises a preset flag field interval and a preset summary function.
Abstract: The present disclosure relates to a data storage method and apparatus and a computer readable storage medium, and relates to the technical field of databases. The disclosed method comprises the following steps: acquiring summary configuration metadata, wherein the summary configuration metadata comprises a preset flag field interval and a preset summary function; summarizing the original data in different preset flag field intervals according to the preset summary function to obtain the summary data in different flag field intervals; generating a node according to the interval value of each flag field and the corresponding summary data, and generating an interval tree according to the relationship between each flag field interval to store the summary data, wherein the interval length of the flag field of the parent node in the interval tree is the sum of the interval lengths of the flag fields of each child node. The hierarchical structure based on interval tree can query the summary data in different interval lengths, which improves the query efficiency and saves the system resources.