Showing papers on "Tree (data structure) published in 2017"

PDF

Open Access

Journal Article•DOI•

ggtree: an R package for visualization and annotation of phylogenetic trees with their covariates and other associated data

[...]

Guangchuang Yu¹, David K. Smith¹, Hongbo Zhu¹, Yi Guan¹, Tommy Tsan-Yuk Lam¹ - Show less +1 more•Institutions (1)

University of Hong Kong¹

01 Jan 2017-Methods in Ecology and Evolution

TL;DR: An r package, ggtree, which provides programmable visualization and annotation of phylogenetic trees, which can read more tree file formats than other softwares, and support visualization of phylo, multiphylo, phylo4, phyla4d, obkdata and phyloseq tree objects defined in other r packages.

...read moreread less

Abstract: Summary We present an r package, ggtree, which provides programmable visualization and annotation of phylogenetic trees. ggtree can read more tree file formats than other softwares, including newick, nexus, NHX, phylip and jplace formats, and support visualization of phylo, multiphylo, phylo4, phylo4d, obkdata and phyloseq tree objects defined in other r packages. It can also extract the tree/branch/node-specific and other data from the analysis outputs of beast, epa, hyphy, paml, phylodog, pplacer, r8s, raxml and revbayes software, and allows using these data to annotate the tree. The package allows colouring and annotation of a tree by numerical/categorical node attributes, manipulating a tree by rotating, collapsing and zooming out clades, highlighting user selected clades or operational taxonomic units and exploration of a large tree by zooming into a selected portion. A two-dimensional tree can be drawn by scaling the tree width based on an attribute of the nodes. A tree can be annotated with an associated numerical matrix (as a heat map), multiple sequence alignment, subplots or silhouette images. The package ggtree is released under the artistic-2.0 license. The source code and documents are freely available through bioconductor (http://www.bioconductor.org/packages/ggtree).

...read moreread less

2,692 citations

Book Chapter•DOI•

Tree-based models

[...]

Linda A. Clark, Daryl Pregibon

01 Nov 2017

TL;DR: In this article, the authors describe S functions for tree-based modeling, which is an alternative to linear and additive models for regression problems and to linear logistic and additive logistic models for classification problems.

...read moreread less

Abstract: This chapter describes S functions for tree-based modeling. Tree-based models provide an alternative to linear and additive models for regression problems and to linear logistic and additive logistic models for classification problems. Tree-based modeling is an exploratory technique for uncovering structure in data. Specifically, the technique is useful for classification and regression problems where one has a set of classification or predictor variables and a single-response variable. Statistical inference for tree-based models is in its infancy and far behind that for logistic and linear regression analyses. This is partly because a particular type of variable selection underlies tree-based. Our approach is not to have a single function for tree-based modeling, but rather a collection of functions, which, together with existing S functions, form a basis for building and assessing this new class of models. Implementation centers around the idea of a tree object. A subtree of a tree object can be selected or deleted in a natural way through subscripting.

...read moreread less

662 citations

Posted Content•

StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks

[...]

Han Zhang¹, Tao Xu², Hongsheng Li³, Shaoting Zhang⁴, Xiaogang Wang³, Xiaolei Huang², Dimitris N. Metaxas¹ - Show less +3 more•Institutions (4)

Rutgers University¹, Lehigh University², The Chinese University of Hong Kong³, University of North Carolina at Charlotte⁴

19 Oct 2017-arXiv: Computer Vision and Pattern Recognition

TL;DR: Extensive experiments demonstrate that the proposed stacked generative adversarial networks significantly outperform other state-of-the-art methods in generating photo-realistic images.

...read moreread less

Abstract: Although Generative Adversarial Networks (GANs) have shown remarkable success in various tasks, they still face challenges in generating high quality images. In this paper, we propose Stacked Generative Adversarial Networks (StackGAN) aiming at generating high-resolution photo-realistic images. First, we propose a two-stage generative adversarial network architecture, StackGAN-v1, for text-to-image synthesis. The Stage-I GAN sketches the primitive shape and colors of the object based on given text description, yielding low-resolution images. The Stage-II GAN takes Stage-I results and text descriptions as inputs, and generates high-resolution images with photo-realistic details. Second, an advanced multi-stage generative adversarial network architecture, StackGAN-v2, is proposed for both conditional and unconditional generative tasks. Our StackGAN-v2 consists of multiple generators and discriminators in a tree-like structure; images at multiple scales corresponding to the same scene are generated from different branches of the tree. StackGAN-v2 shows more stable training behavior than StackGAN-v1 by jointly approximating multiple distributions. Extensive experiments demonstrate that the proposed stacked generative adversarial networks significantly outperform other state-of-the-art methods in generating photo-realistic images.

...read moreread less

431 citations

Journal Article•DOI•

Optimal classification trees

[...]

Dimitris Bertsimas¹, Jack Dunn¹•Institutions (1)

Massachusetts Institute of Technology¹

01 Jul 2017-Machine Learning

TL;DR: Optimal classification trees are presented, a novel formulation of the decision tree problem using modern MIO techniques that yields the optimal decision tree for axes-aligned splits and synthetic tests demonstrate that these methods recover the true decision tree more closely than heuristics, refuting the notion that optimal methods overfit the training data.

...read moreread less

Abstract: State-of-the-art decision tree methods apply heuristics recursively to create each split in isolation, which may not capture well the underlying characteristics of the dataset. The optimal decision tree problem attempts to resolve this by creating the entire decision tree at once to achieve global optimality. In the last 25?years, algorithmic advances in integer optimization coupled with hardware improvements have resulted in an astonishing 800 billion factor speedup in mixed-integer optimization (MIO). Motivated by this speedup, we present optimal classification trees, a novel formulation of the decision tree problem using modern MIO techniques that yields the optimal decision tree for axes-aligned splits. We also show the richness of this MIO formulation by adapting it to give optimal classification trees with hyperplanes that generates optimal decision trees with multivariate splits. Synthetic tests demonstrate that these methods recover the true decision tree more closely than heuristics, refuting the notion that optimal methods overfit the training data. We comprehensively benchmark these methods on a sample of 53 datasets from the UCI machine learning repository. We establish that these MIO methods are practically solvable on real-world datasets with sizes in the 1000s, and give average absolute improvements in out-of-sample accuracy over CART of 1---2 and 3---5% for the univariate and multivariate cases, respectively. Furthermore, we identify that optimal classification trees are likely to outperform CART by 1.2---1.3% in situations where the CART accuracy is high and we have sufficient training data, while the multivariate version outperforms CART by 4---7% when the CART accuracy or dimension of the dataset is low.

...read moreread less

390 citations

Proceedings Article•DOI•

Abstract Syntax Networks for Code Generation and Semantic Parsing

[...]

Maxim Rabinovich¹, Mitchell Stern¹, Dan Klein¹•Institutions (1)

University of California, Berkeley¹

25 Apr 2017

TL;DR: In this paper, abstract syntax trees (ASTs) are constructed by a decoder with a dynamically-determined modular structure paralleling the structure of the output tree, which achieves state-of-the-art results on the Atis, Jobs, and Geo semantic parsing datasets.

...read moreread less

Abstract: Tasks like code generation and semantic parsing require mapping unstructured (or partially structured) inputs to well-formed, executable outputs. We introduce abstract syntax networks, a modeling framework for these problems. The outputs are represented as abstract syntax trees (ASTs) and constructed by a decoder with a dynamically-determined modular structure paralleling the structure of the output tree. On the benchmark Hearthstone dataset for code generation, our model obtains 79.2 BLEU and 22.7% exact match accuracy, compared to previous state-of-the-art values of 67.1 and 6.1%. Furthermore, we perform competitively on the Atis, Jobs, and Geo semantic parsing datasets with no task-specific engineering.

...read moreread less

354 citations

Journal Article•DOI•

StarBEAST2 Brings Faster Species Tree Inference and Accurate Estimates of Substitution Rates

[...]

Huw A. Ogilvie¹, Remco R. Bouckaert², Alexei J. Drummond²•Institutions (2)

Australian National University¹, University of Auckland²

01 Aug 2017-Molecular Biology and Evolution

TL;DR: To enable fuller use of available data and more accurate inference of species tree topologies, divergence times, and substitution rates, a new version of *BEAST is developed called StarBEAST2, and species tree relaxed clocks are introduced to enable accurate estimates of per-species substitution rates.

...read moreread less

Abstract: Fully Bayesian multispecies coalescent (MSC) methods like *BEAST estimate species trees from multiple sequence alignments. Today thousands of genes can be sequenced for a given study, but using that many genes with *BEAST is intractably slow. An alternative is to use heuristic methods which compromise accuracy or completeness in return for speed. A common heuristic is concatenation, which assumes that the evolutionary history of each gene tree is identical to the species tree. This is an inconsistent estimator of species tree topology, a worse estimator of divergence times, and induces spurious substitution rate variation when incomplete lineage sorting is present. Another class of heuristics directly motivated by the MSC avoids many of the pitfalls of concatenation but cannot be used to estimate divergence times. To enable fuller use of available data and more accurate inference of species tree topologies, divergence times, and substitution rates, we have developed a new version of *BEAST called StarBEAST2. To improve convergence rates we add analytical integration of population sizes, novel MCMC operators and other optimizations. Computational performance improved by 13.5× and 13.8× respectively when analyzing two empirical data sets, and an average of 33.1× across 30 simulated data sets. To enable accurate estimates of per-species substitution rates, we introduce species tree relaxed clocks, and show that StarBEAST2 is a more powerful and robust estimator of rate variation than concatenation. StarBEAST2 is available through the BEAUTi package manager in BEAST 2.4 and above.

...read moreread less

346 citations

A city is not a tree

[...]

Michael Richards

01 Jan 2017

278 citations

Journal Article•DOI•

Prediction of the compressive strength of normal and high-performance concretes using M5P model tree algorithm

[...]

Ali Behnood¹, Venous Behnood², Mahsa Modiri Gharehveran¹, Kürşat Esat Alyamaç¹, Kürşat Esat Alyamaç³ - Show less +1 more•Institutions (3)

Purdue University¹, University of Mohaghegh Ardabili², Fırat University³

01 Jul 2017-Construction and Building Materials

TL;DR: The results show that M5P model tree can be a better alternative approach for prediction of the compressive strength of NC and HPC using the amount of constituents of concrete as input parameters.

...read moreread less

186 citations

Proceedings Article•

Resource-efficient machine learning in 2 KB RAM for the internet of things

[...]

Ashish Kumar¹, Saurabh Goyal², Manik Varma¹•Institutions (2)

Microsoft¹, Indian Institute of Technology Delhi²

06 Aug 2017

TL;DR: Bonsai can make predictions in milliseconds even on slow microcontrollers, can fit in KB of memory, has lower battery consumption than all other algorithms, and achieves prediction accuracies that can be as much as 30% higher than state-of-the-art methods for resource-efficient machine learning.

...read moreread less

Abstract: This paper develops a novel tree-based algorithm, called Bonsai, for efficient prediction on IoT devices - such as those based on the Arduino Uno board having an 8 bit ATmega328P microcontroller operating at 16 MHz with no native floating point support, 2 KB RAM and 32 KB read-only flash. Bonsai maintains prediction accuracy while minimizing model size and prediction costs by: (a) developing a tree model which learns a single, shallow, sparse tree with powerful nodes; (b) sparsely projecting all data into a low-dimensional space in which the tree is learnt; and (c) jointly learning all tree and projection parameters. Experimental results on multiple benchmark datasets demonstrate that Bonsai can make predictions in milliseconds even on slow microcontrollers, can fit in KB of memory, has lower battery consumption than all other algorithms while achieving prediction accuracies that can be as much as 30% higher than state-of-the-art methods for resource-efficient machine learning. Bonsai is also shown to generalize to other resource constrained settings beyond IoT by generating significantly better search results as compared to Bing's L3 ranker when the model size is restricted to 300 bytes. Bonsai's code can be downloaded from (BonsaiCode).

...read moreread less

184 citations

Journal Article•DOI•

Intertwining phylogenetic trees and networks

[...]

Klaus Schliep¹, Alastair Potts², David A. Morrison³, Guido W. Grimm⁴•Institutions (4)

University of Massachusetts Boston¹, Nelson Mandela Metropolitan University², Uppsala University³, University of Vienna⁴

01 Oct 2017-Methods in Ecology and Evolution

TL;DR: A framework, implemented in the phangorn library in R, to transfer information between trees and networks, which includes identifying and labelling equivalent tree branches and network edges and transferring tree branch support to network edges.

...read moreread less

Abstract: Summary The fields of phylogenetic tree and network inference have dramatically advanced in the past decade, but independently with few attempts to bridge them. Here we provide a framework, implemented in the phangorn library in R, to transfer information between trees and networks. This includes: (i) identifying and labelling equivalent tree branches and network edges, (ii) transferring tree branch support to network edges, and (iii) mapping bipartition support from a sample of trees (e.g. from bootstrapping or Bayesian inference) onto network edges. The ability to readily combine tree and network information should lead to more comprehensive evolutionary comparisons and inferences.

...read moreread less

174 citations

Journal Article•DOI•

Green streets − Quantifying and mapping urban trees with street-level imagery and computer vision

[...]

Ian Seiferling¹, Ian Seiferling², Nikhil Naik¹, Carlo Ratti¹, Raphaël Proulx² - Show less +1 more•Institutions (2)

Massachusetts Institute of Technology¹, Université du Québec à Trois-Rivières²

01 Sep 2017-Landscape and Urban Planning

TL;DR: In this article, a multi-step computer vision algorithm segments and quantifies the percent of tree cover in street-scapes images to a high degree of precision, and then models the relationship between neighbouring images along city street segments.

...read moreread less

Journal Article•

Optimal classification trees

[...]

Dimitris Bertsimas, Jack Dunn

01 Apr 2017-Springer US

TL;DR: In this article, the authors present an optimal classification tree formulation of the decision tree problem using modern mixed-integer optimization (MIO) techniques that yields the optimal decision tree for axes-aligned splits.

...read moreread less

Journal Article•DOI•

STRIDE: Species Tree Root Inference from Gene Duplication Events.

[...]

David M. Emms¹, Steven L. Kelly¹•Institutions (1)

University of Oxford¹

01 Dec 2017-Molecular Biology and Evolution

TL;DR: STRIDE correctly identifies the root of the species tree in multiple large-scale molecular phylogenetic data sets spanning a wide range of timescales and taxonomic groups, and the novel probability model implemented in STRIDE can accurately represent the ambiguity in species tree root assignment for data sets where information is limited.

...read moreread less

Abstract: The correct interpretation of any phylogenetic tree is dependent on that tree being correctly rooted. We present STRIDE, a fast, effective, and outgroup-free method for identification of gene duplication events and species tree root inference in large-scale molecular phylogenetic analyses. STRIDE identifies sets of well-supported in-group gene duplication events from a set of unrooted gene trees, and analyses these events to infer a probability distribution over an unrooted species tree for the location of its root. We show that STRIDE correctly identifies the root of the species tree in multiple large-scale molecular phylogenetic data sets spanning a wide range of timescales and taxonomic groups. We demonstrate that the novel probability model implemented in STRIDE can accurately represent the ambiguity in species tree root assignment for data sets where information is limited. Furthermore, application of STRIDE to outgroup-free inference of the origin of the eukaryotic tree resulted in a root probability distribution that provides additional support for leading hypotheses for the origin of the eukaryotes.

...read moreread less

Journal Article•DOI•

Efficient Bayesian species tree inference under the multispecies coalescent.

[...]

Bruce Rannala¹, Ziheng Yang²•Institutions (2)

University of California, Davis¹, University College London²

01 Sep 2017-Systematic Biology

TL;DR: In this paper, a Bayesian method for inferring the species phylogeny under the multispecies coalescent (MSC) model was developed, which integrates over gene trees, naturally taking account of the uncertainty of gene tree topology and branch lengths given the sequence data.

...read moreread less

Abstract: We develop a Bayesian method for inferring the species phylogeny under the multispecies coalescent (MSC) model. To improve the mixing properties of the Markov chain Monte Carlo (MCMC) algorithm that traverses the space of species trees, we implement two efficient MCMC proposals: the first is based on the Subtree Pruning and Regrafting (SPR) algorithm and the second is based on a node-slider algorithm. Like the Nearest-Neighbor Interchange (NNI) algorithm we implemented previously, both new algorithms propose changes to the species tree, while simultaneously altering the gene trees at multiple genetic loci to automatically avoid conflicts with the newly proposed species tree. The method integrates over gene trees, naturally taking account of the uncertainty of gene tree topology and branch lengths given the sequence data. A simulation study was performed to examine the statistical properties of the new method. The method was found to show excellent statistical performance, inferring the correct species tree with near certainty when 10 loci were included in the dataset. The prior on species trees has some impact, particularly for small numbers of loci. We analyzed several previously published datasets (both real and simulated) for rattlesnakes and Philippine shrews, in comparison with alternative methods. The results suggest that the Bayesian coalescent-based method is statistically more efficient than heuristic methods based on summary statistics, and that our implementation is computationally more efficient than alternative full-likelihood methods under the MSC. Parameter estimates for the rattlesnake data suggest drastically different evolutionary dynamics between the nuclear and mitochondrial loci, even though they support largely consistent species trees. We discuss the different challenges facing the marginal likelihood calculation and transmodel MCMC as alternative strategies for estimating posterior probabilities for species trees. [Bayes factor; Bayesian inference; MCMC; multispecies coalescent; nodeslider; species tree; SPR.].

...read moreread less

Journal Article•DOI•

3D Forest: An application for descriptions of three-dimensional forest structures using terrestrial LiDAR.

[...]

Jan Trochta, Martin Krůček¹, Tomáš Vrška, Kamil Král•Institutions (1)

Mendel University¹

04 May 2017-PLOS ONE

TL;DR: Comparison with field data measurements showed no significant difference in measuring DBH or tree height using 3D Forest, although for DBH only the Randomized Hough Transform algorithm proved to be sufficiently resistant to noise and provided results comparable to traditional field measurements.

...read moreread less

Abstract: Terrestrial laser scanning is a powerful technology for capturing the three-dimensional structure of forests with a high level of detail and accuracy. Over the last decade, many algorithms have been developed to extract various tree parameters from terrestrial laser scanning data. Here we present 3D Forest, an open-source non-platform-specific software application with an easy-to-use graphical user interface with the compilation of algorithms focused on the forest environment and extraction of tree parameters. The current version (0.42) extracts important parameters of forest structure from the terrestrial laser scanning data, such as stem positions (X, Y, Z), tree heights, diameters at breast height (DBH), as well as more advanced parameters such as tree planar projections, stem profiles or detailed crown parameters including convex and concave crown surface and volume. Moreover, 3D Forest provides quantitative measures of between-crown interactions and their real arrangement in 3D space. 3D Forest also includes an original algorithm of automatic tree segmentation and crown segmentation. Comparison with field data measurements showed no significant difference in measuring DBH or tree height using 3D Forest, although for DBH only the Randomized Hough Transform algorithm proved to be sufficiently resistant to noise and provided results comparable to traditional field measurements.

...read moreread less

Journal Article•DOI•

Identifying Different Transportation Modes from Trajectory Data Using Tree-Based Ensemble Classifiers

[...]

Zhibin Xiao, Yang Wang, Kun Fu, Fan Wu

22 Feb 2017-ISPRS international journal of geo-information

TL;DR: An approach based on ensemble learning is proposed to infer hybrid transportation modes using only Global Position System (GPS) data and tree-based ensemble models were used instead of traditional methods to classify the different transportation modes.

...read moreread less

Abstract: Recognition of transportation modes can be used in different applications including human behavior research, transport management and traffic control. Previous work on transportation mode recognition has often relied on using multiple sensors or matching Geographic Information System (GIS) information, which is not possible in many cases. In this paper, an approach based on ensemble learning is proposed to infer hybrid transportation modes using only Global Position System (GPS) data. First, in order to distinguish between different transportation modes, we used a statistical method to generate global features and extract several local features from sub-trajectories after trajectory segmentation, before these features were combined in the classification stage. Second, to obtain a better performance, we used tree-based ensemble models (Random Forest, Gradient Boosting Decision Tree, and XGBoost) instead of traditional methods (K-Nearest Neighbor, Decision Tree, and Support Vector Machines) to classify the different transportation modes. The experiment results on the later have shown the efficacy of our proposed approach. Among them, the XGBoost model produced the best performance with a classification accuracy of 90.77% obtained on the GEOLIFE dataset, and we used a tree-based ensemble method to ensure accurate feature selection to reduce the model complexity.

...read moreread less

Journal Article•DOI•

Quantifying stand structural complexity and its relationship with forest management, tree species diversity and microclimate

[...]

Martin Ehbrecht¹, Peter Schall¹, Christian Ammer¹, Dominik Seidel¹•Institutions (1)

University of Göttingen¹

15 Aug 2017-Agricultural and Forest Meteorology

TL;DR: In this paper, the authors presented an approach to quantify stand structural complexity based on fractal dimension derived from single terrestrial laser scans (TLS) that were made on 126 permanent forest research plots in Germany, representing major stand and management types.

...read moreread less

Proceedings Article•

WORT: write optimal radix tree for persistent memory storage systems

[...]

Se Kwon Lee¹, K. Hyun Lim², Hyunsub Song¹, Beomseok Nam¹, Sam H. Noh¹ - Show less +1 more•Institutions (2)

Ulsan National Institute of Science and Technology¹, Hongik University²

27 Feb 2017

TL;DR: It is shown that the radix tree, which is another less popular indexing structure, can be more appropriate as an efficient PMIndexing structure because it is determined by the prefix of the inserted keys and also does not require tree rebalancing operations and node granularity updates.

...read moreread less

Abstract: Recent interest in persistent memory (PM) has stirred development of index structures that are efficient in PM. Recent such developments have all focused on variations of the B-tree. In this paper, we show that the radix tree, which is another less popular indexing structure, can be more appropriate as an efficient PM indexing structure. This is because the radix tree structure is determined by the prefix of the inserted keys and also does not require tree rebalancing operations and node granularity updates. However, the radix tree as-is cannot be used in PM. As another contribution, we present three radix tree variants, namely, WORT (Write Optimal Radix Tree), WOART (Write Optimal Adaptive Radix Tree), and ART+CoW. Of these, the first two are optimal for PM in the sense that they only use one 8-byte failure-atomic write per update to guarantee the consistency of the structure and do not require any duplicate copies for logging or CoW. Extensive performance studies show that our proposed radix tree variants perform considerable better than recently proposed B-tree variants for PM such NVTree, wB+Tree, and FPTree for synthetic workloads as well as in implementations within Memcached.

...read moreread less

Proceedings Article•DOI•

Provably efficient algorithms for joint placement and allocation of virtual network functions

[...]

Yu Sang¹, Bo Ji¹, Gagan Gupta¹, Xiaojiang Du¹, Lin Ye² - Show less +1 more•Institutions (2)

Temple University¹, Harbin Institute of Technology²

01 May 2017

TL;DR: In this article, the authors focus on minimizing the total number of Virtual Network Function (VNF) instances to provide a specific service (possibly at different locations) to all the flows in a network.

...read moreread less

Abstract: Network Function Virtualization (NFV) has the potential to significantly reduce the capital and operating expenses, shorten product release cycle, and improve service agility. In this paper, we focus on minimizing the total number of Virtual Network Function (VNF) instances to provide a specific service (possibly at different locations) to all the flows in a network. Certain network security and analytics applications may allow fractional processing of a flow at different nodes (corresponding to datacenters), giving an opportunity for greater optimization of resources. Through a reduction from the set cover problem, we show that this problem is NP-hard and cannot even be approximated within a factor of (1 − o(1))lnm (where m is the number of flows) unless P=NP. Then, we design two simple greedy algorithms and prove that they achieve an approximation ratio of (1 − o(1))ln m + 2, which is asymptotically optimal. For special cases where each node hosts multiple VNF instances (which is typically true in practice), we also show that our greedy algorithms have a constant approximation ratio. Further, for tree topologies we develop an optimal greedy algorithm by exploiting the inherent topological structure. Finally, we conduct extensive numerical experiments to evaluate the performance of our proposed algorithms in various scenarios.

...read moreread less

Proceedings Article•DOI•

Combining neural networks and tree search for task and motion planning in challenging environments

[...]

Chris Paxton, Vasumathi Raman, Gregory D. Hager¹, Marin Kobilarov•Institutions (1)

Johns Hopkins University¹

22 Mar 2017

TL;DR: This work integrates Monte Carlo Tree Search with hierarchical neural net policies trained on expressive LTL specifications to find deep neural networks representing both low-level control policies and task-level “option policies” that achieve high-level goals.

...read moreread less

Abstract: Task and motion planning subject to Linear Temporal Logic (LTL) specifications in complex, dynamic environments requires efficient exploration of many possible future worlds. Model-free reinforcement learning has proven successful in a number of challenging tasks, but shows poor performance on tasks that require long-term planning. In this work, we integrate Monte Carlo Tree Search with hierarchical neural net policies trained on expressive LTL specifications. We use reinforcement learning to find deep neural networks representing both low-level control policies and task-level “option policies” that achieve high-level goals. Our combined architecture generates safe and responsive motion plans that respect the LTL constraints. We demonstrate our approach in a simulated autonomous driving setting, where a vehicle must drive down a road in traffic, avoid collisions, and navigate an intersection, all while obeying rules of the road.

...read moreread less

Proceedings Article•

Tree-structured decoding with doubly-recurrent neural networks

[...]

David Alvarez-Melis¹, Tommi S. Jaakkola¹•Institutions (1)

Massachusetts Institute of Technology¹

24 Apr 2017

TL;DR: A novel neural network architecture specifically tailored to treestructured decoding, which maintains separate depth and width recurrent states and combines them to obtain hidden states for every node in the tree, and exhibits desirable invariance properties over sequential architectures.

...read moreread less

Abstract: We propose a neural network architecture for generating tree-structured objects from encoded representations. The core of the method is a doubly-recurrent neural network that models separately the width and depth recurrences across the tree, and combines them inside each cell to generate an output. The topology of the tree is explicitly modeled, allowing the network to predict both content and topology of the tree when decoding. That is, given only an encoded vector representation, the network is able to simultaneously generate a tree from it and predict labels for the nodes. We test this architecture in an encoder-decoder framework, where we train a network to encode a sentence as a vector, and then generate a tree structure from it. The experimental results show the effectiveness of this architecture at recovering latent tree structure in sequences and at mapping sentences to simple functional programs.

...read moreread less

Journal Article•DOI•

Learning Relevant Features of Data with Multi-scale Tensor Networks

[...]

E. M. Stoudenmire

31 Dec 2017-arXiv: Machine Learning

TL;DR: Inspired by coarse-graining approaches used in physics, it is shown how similar algorithms can be adapted for data based on layered tree tensor networks and scale linearly with both the dimension of the input and the training set size.

...read moreread less

Abstract: Inspired by coarse-graining approaches used in physics, we show how similar algorithms can be adapted for data. The resulting algorithms are based on layered tree tensor networks and scale linearly with both the dimension of the input and the training set size. Computing most of the layers with an unsupervised algorithm, then optimizing just the top layer for supervised classification of the MNIST and fashion-MNIST data sets gives very good results. We also discuss mixing a prior guess for supervised weights together with an unsupervised representation of the data, yielding a smaller number of features nevertheless able to give good performance.

...read moreread less

Proceedings Article•DOI•

A Two-Stage Parsing Method for Text-Level Discourse Analysis

[...]

Yizhong Wang¹, Sujian Li¹, Houfeng Wang¹•Institutions (1)

Peking University¹

01 Jul 2017

TL;DR: This paper designs a pipelined two-stage parsing method for generating an RST tree from text and argues that relation labeling can benefit from naked tree structure and should be treated elaborately with consideration of three kinds of relations including within- Sentence, across-sentence and across-paragraph relations.

...read moreread less

Abstract: Previous work introduced transition-based algorithms to form a unified architecture of parsing rhetorical structures (including span, nuclearity and relation), but did not achieve satisfactory performance. In this paper, we propose that transition-based model is more appropriate for parsing the naked discourse tree (i.e., identifying span and nuclearity) due to data sparsity. At the same time, we argue that relation labeling can benefit from naked tree structure and should be treated elaborately with consideration of three kinds of relations including within-sentence, across-sentence and across-paragraph relations. Thus, we design a pipelined two-stage parsing method for generating an RST tree from text. Experimental results show that our method achieves state-of-the-art performance, especially on span and nuclearity identification.

...read moreread less

Proceedings Article•DOI•

Learning to Run Heuristics in Tree Search.

[...]

Elias B. Khalil¹, Bistra Dilkina², George L. Nemhauser², Shabbir Ahmed², Yufen Shao³ - Show less +1 more•Institutions (3)

IBM¹, Georgia Institute of Technology², ExxonMobil³

01 Aug 2017

TL;DR: This work gives a theoretical framework for analyzing this decision-making process in a simplified setting, proposes a ML approach for modeling heuristic success likelihood, and design practical rules that leverage the ML models to dynamically decide whether to run a heuristic at each node of the search tree.

...read moreread less

Abstract: “Primal heuristics” are a key contributor to the improved performance of exact branch-and-bound solvers for combinatorial optimization and integer programming. Perhaps the most crucial question concerning primal heuristics is that of at which nodes they should run, to which the typical answer is via hard-coded rules or fixed solver parameters tuned, offline, by trial-and-error. Alternatively, a heuristic should be run when it is most likely to succeed, based on the problem instance’s characteristics, the state of the search, etc. In this work, we study the problem of deciding at which node a heuristic should be run, such that the overall (primal) performance of the solver is optimized. To our knowledge, this is the first attempt at formalizing and systematically addressing this problem. Central to our approach is the use of Machine Learning (ML) for predicting whether a heuristic will succeed at a given node. We give a theoretical framework for analyzing this decision-making process in a simplified setting, propose a ML approach for modeling heuristic success likelihood, and design practical rules that leverage the ML models to dynamically decide whether to run a heuristic at each node of the search tree. Experimentally, our approach improves the primal performance of a stateof-the-art Mixed Integer Programming solver by up to 6% on a set of benchmark instances, and by up to 60% on a family of hard Independent Set instances.

...read moreread less

Proceedings Article•DOI•

Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder

[...]

Huadong Chen¹, Shujian Huang¹, David Chiang², Jiajun Chen¹•Institutions (2)

Nanjing University¹, University of Notre Dame²

01 Jul 2017

TL;DR: This paper proposed a tree-coverage model that learns both sequential and tree-structured representations, and showed that the proposed model outperformed the sequential attentional model and a stronger baseline with a bottom-up tree encoder and word coverage.

...read moreread less

Abstract: Most neural machine translation (NMT) models are based on the sequential encoder-decoder framework, which makes no use of syntactic information. In this paper, we improve this model by explicitly incorporating source-side syntactic trees. More specifically, we propose (1) a bidirectional tree encoder which learns both sequential and tree structured representations; (2) a tree-coverage model that lets the attention depend on the source-side syntax. Experiments on Chinese-English translation demonstrate that our proposed models outperform the sequential attentional model as well as a stronger baseline with a bottom-up tree encoder and word coverage.

...read moreread less

Proceedings Article•

Thinking Fast and Slow with Deep Learning and Tree Search

[...]

Thomas Anthony¹, Zheng Tian², David Barber²•Institutions (2)

University of Alabama at Birmingham¹, University College London²

23 May 2017

TL;DR: ExIt as mentioned in this paper decomposes the problem into separate planning and generalisation tasks, and uses tree search and a deep neural network policy to guide the search, increasing the strength of new plans.

...read moreread less

Abstract: Sequential decision making problems, such as structured prediction, robotic control, and game playing, require a combination of planning policies and generalisation of those plans. In this paper, we present Expert Iteration (ExIt), a novel reinforcement learning algorithm which decomposes the problem into separate planning and generalisation tasks. Planning new policies is performed by tree search, while a deep neural network generalises those plans. Subsequently, tree search is improved by using the neural network policy to guide search, increasing the strength of new plans. In contrast, standard deep Reinforcement Learning algorithms rely on a neural network not only to generalise plans, but to discover them too. We show that ExIt outperforms REINFORCE for training a neural network to play the board game Hex, and our final tree search agent, trained tabula rasa, defeats MoHex1.0, the most recent Olympiad Champion player to be publicly released.

...read moreread less

Proceedings Article•DOI•

Adaptive RNN Tree for Large-Scale Human Action Recognition

[...]

Wenbo Li¹, Longyin Wen², Ming-Ching Chang¹, Ser-Nam Lim², Siwei Lyu³ - Show less +1 more•Institutions (3)

University at Albany, SUNY¹, General Electric², State University of New York System³

01 Oct 2017

TL;DR: The RNN Tree (RNN-T), an adaptive learning framework for skeleton based human action recognition that effectively addresses two main challenges of large-scale action recognition: able to distinguish fine-grained action classes that are intractable using a single network, and adaptive to new action classes by augmenting an existing model.

...read moreread less

Abstract: In this work, we present the RNN Tree (RNN-T), an adaptive learning framework for skeleton based human action recognition. Our method categorizes action classes and uses multiple Recurrent Neural Networks (RNNs) in a treelike hierarchy. The RNNs in RNN-T are co-trained with the action category hierarchy, which determines the structure of RNN-T. Actions in skeletal representations are recognized via a hierarchical inference process, during which individual RNNs differentiate finer-grained action classes with increasing confidence. Inference in RNN-T ends when any RNN in the tree recognizes the action with high confidence, or a leaf node is reached. RNN-T effectively addresses two main challenges of large-scale action recognition: (i) able to distinguish fine-grained action classes that are intractable using a single network, and (ii) adaptive to new action classes by augmenting an existing model. We demonstrate the effectiveness of RNN-T/ACH method and compare it with the state-of-the-art methods on a large-scale dataset and several existing benchmarks.

...read moreread less

Book•

Climatic Cycles and Tree-Growth: A Study of the Annual Rings of Trees in Relation to Climate and Solar Activity

[...]

A. E. Douglass

19 Apr 2017

Journal Article•DOI•

Learn on Source, Refine on Target: A Model Transfer Learning Framework with Random Forests

[...]

Noam Segev¹, Maayan Harel¹, Shie Mannor¹, Koby Crammer¹, Ran El-Yaniv¹ - Show less +1 more•Institutions (1)

Technion – Israel Institute of Technology¹

01 Sep 2017-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Novel model transfer-learning methods that refine a decision forest model by considering an ensemble that contains the union of the two forests and exhibit impressive experimental results over a range of problems are proposed.

...read moreread less

Abstract: We propose novel model transfer-learning methods that refine a decision forest model $M$ learned within a “source” domain using a training set sampled from a “target” domain, assumed to be a variation of the source. We present two random forest transfer algorithms. The first algorithm searches greedily for locally optimal modifications of each tree structure by trying to locally expand or reduce the tree around individual nodes. The second algorithm does not modify structure, but only the parameter (thresholds) associated with decision nodes. We also propose to combine both methods by considering an ensemble that contains the union of the two forests. The proposed methods exhibit impressive experimental results over a range of problems.

...read moreread less

Posted Content•

Provably Efficient Algorithms for Joint Placement and Allocation of Virtual Network Functions

[...]

Yu Sang¹, Bo Ji¹, Gagan Gupta¹, Xiaojiang Du¹, Lin Ye² - Show less +1 more•Institutions (2)

Temple University¹, Harbin Institute of Technology²

03 Feb 2017-arXiv: Networking and Internet Architecture

TL;DR: This paper focuses on minimizing the total number of Virtual Network Function (VNF) instances to provide a specific service to all the flows in a network, and designs two simple greedy algorithms that achieve an approximation ratio of (1 − o(1))ln m + 2, which is asymptotically optimal.

...read moreread less

Abstract: Network Function Virtualization (NFV) has the potential to significantly reduce the capital and operating expenses, shorten product release cycle, and improve service agility. In this paper, we focus on minimizing the total number of Virtual Network Function (VNF) instances to provide a specific service (possibly at different locations) to all the flows in a network. Certain network security and analytics applications may allow fractional processing of a flow at different nodes (corresponding to datacenters), giving an opportunity for greater optimization of resources. Through a reduction from the set cover problem, we show that this problem is NP-hard and cannot even be approximated within a factor of (1 - o(1)) ln(m) (where m is the number of flows) unless P=NP. Then, we design two simple greedy algorithms and prove that they achieve an approximation ratio of (1 - o(1)) ln(m) + 2, which is asymptotically optimal. For special cases where each node hosts multiple VNF instances (which is typically true in practice), we also show that our greedy algorithms have a constant approximation ratio. Further, for tree topologies we develop an optimal greedy algorithm by exploiting the inherent topological structure. Finally, we conduct extensive numerical experiments to evaluate the performance of our proposed algorithms in various scenarios.

...read moreread less

Collapse