AEGCN: An Autoencoder-Constrained Graph Convolutional Network

doi:10.1016/J.NEUCOM.2020.12.061

Home
/
Papers
/
AEGCN: An Autoencoder-Constrained Graph Convolutional Network

Journal Article•DOI•

AEGCN: An Autoencoder-Constrained Graph Convolutional Network

Mingyuan Ma¹, Sen Na², Hongyu Wang¹•Institutions (2)

Peking University¹, University of Chicago²

07 Apr 2021-Neurocomputing (Elsevier)-Vol. 432, pp 21-31

TL;DR: In this paper, an autoencoder-constrained graph convolutional network is proposed to solve node classification task on graph domains, where the hidden layers are constrained by an auto-encoder.

read less

About: This article is published in Neurocomputing.The article was published on 2021-04-07 and is currently open access. It has received 14 citations till now. The article focuses on the topics: Autoencoder & Adjacency matrix.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

A novel sub-label learning mechanism for enhanced cross-domain fault diagnosis of rotating machinery

[...]

Minqiang Deng, Aidong Deng, Yaowei Shi, Yang Liu, Meng Xu - Show less +1 more

01 May 2022-Reliability Engineering & System Safety

TL;DR: Wang et al. as mentioned in this paper proposed a novel sub-label learning mechanism (SLLM), which exploits the structural connectivity of the original sample space to guide distribution alignment, thereby enhancing domain adaptability.

...read moreread less

12 citations

Journal Article•DOI•

The graph-based behavior-aware recommendation for interactive news

[...]

Mingyuan Ma¹, Sen Na², Hongyu Wang¹, Congzhou Chen¹, Jin Xu¹ - Show less +1 more•Institutions (2)

Peking University¹, University of Chicago²

01 Jun 2021-Applied Intelligence

TL;DR: A graph-based behavior-aware network, which simultaneously considers six different types of behaviors as well as user’s demand on the news diversity is proposed, which achieves recommending news to different users at their different levels of concentration degrees.

...read moreread less

Abstract: Interactive news recommendation has been launched and attracted much attention recently. In this scenario, user’s behavior evolves from single click behavior to multiple behaviors including like, comment, share etc. However, most of the existing methods still use single click behavior as the unique criterion of judging user’s preferences. Further, although heterogeneous graphs have been applied in different areas, a proper way to construct a heterogeneous graph for interactive news data with an appropriate learning mechanism on it is still desired. To address the above concerns, we propose a graph-based behavior-aware network, which simultaneously considers six different types of behaviors as well as user’s demand on the news diversity. We have three mainsteps. First, we build an interaction behavior graph for multi-level and multi-category data. Second, we apply DeepWalk on the behavior graph to obtain entity semantics, then build a graph-based convolutional neural network called G-CNN to learn news representations, and an attention-based LSTM to learn behavior sequence representations. Third, we introduce core and coritivity features for the behavior graph, which measure the concentration degree of user’s interests. These features affect the trade-off between accuracy and diversity of our personalized recommendation system. Taking these features into account, our system finally achieves recommending news to different users at their different levels of concentration degrees.

...read moreread less

9 citations

Journal Article•DOI•

Spatial-Temporal Complex Graph Convolution Network for Traffic Flow Prediction

[...]

Yinxin Bao, Jiashuang Huang, Qin-Qin Shen, Yang Cao, Weiping Ding, Zhenquan Shi, Quan Shi - Show less +3 more

01 May 2023-Engineering Applications of Artificial Intelligence

TL;DR: Wang et al. as discussed by the authors proposed a novel method based on Spatial-Temporal Complex Graph Convolution Network (ST-CGCN) for traffic flow prediction, which constructs the distance matrix, the data correlation matrix, and the comfort measurement matrix according to the geographical locations, the historical data record and the external interference between traffic nodes.

...read moreread less

6 citations

Journal Article•DOI•

Graph attention autoencoder inspired CNN based brain tumor classification using MRI

[...]

Lalita Mishra, Shekhar Verma

01 Jul 2022-Neurocomputing

TL;DR: In this paper , the authors proposed a graph attention based image classification model for brain tumor detection using magnetic resonance (MRI and X-ray) images, which achieved an accuracy of 98.27% for first dataset, 99.83% for second dataset, and 98.78% for third dataset demonstrating preferable network performance than the state-of-the-art CNN models.

...read moreread less

4 citations

Journal Article•DOI•

PseudoBound: Limiting the anomaly reconstruction capability of one-class classifiers using pseudo anomalies

[...]

Marcella Astrid, Muhammad Zaigham Zaheer, Seung-Ik Lee

01 Mar 2023-Neurocomputing

TL;DR: In this paper , the authors propose to limit the anomaly reconstruction capability of an autoencoder by incorporating pseudo anomalies during the training of an AE, which shows the robustness of their training mechanism towards any kind of pseudo anomaly.

...read moreread less

1 citations

References

PDF

Open Access

More filters

Proceedings Article•

ImageNet Classification with Deep Convolutional Neural Networks

[...]

Alex Krizhevsky¹, Ilya Sutskever¹, Geoffrey E. Hinton¹•Institutions (1)

University of Toronto¹

03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Abstract: We trained a large, deep convolutional neural network to classify the 1.2 million high-resolution images in the ImageNet LSVRC-2010 contest into the 1000 different classes. On the test data, we achieved top-1 and top-5 error rates of 37.5% and 17.0% which is considerably better than the previous state-of-the-art. The neural network, which has 60 million parameters and 650,000 neurons, consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax. To make training faster, we used non-saturating neurons and a very efficient GPU implementation of the convolution operation. To reduce overriding in the fully-connected layers we employed a recently-developed regularization method called "dropout" that proved to be very effective. We also entered a variant of this model in the ILSVRC-2012 competition and achieved a winning top-5 test error rate of 15.3%, compared to 26.2% achieved by the second-best entry.

...read moreread less

73,978 citations

Proceedings Article•

Attention is All you Need

[...]

Ashish Vaswani¹, Noam Shazeer¹, Niki Parmar², Jakob Uszkoreit¹, Llion Jones¹, Aidan N. Gomez¹, Lukasz Kaiser¹, Illia Polosukhin¹ - Show less +4 more•Institutions (2)

Google¹, University of Southern California²

12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Abstract: The dominant sequence transduction models are based on complex recurrent orconvolutional neural networks in an encoder and decoder configuration. The best performing such models also connect the encoder and decoder through an attentionm echanisms. We propose a novel, simple network architecture based solely onan attention mechanism, dispensing with recurrence and convolutions entirely.Experiments on two machine translation tasks show these models to be superiorin quality while being more parallelizable and requiring significantly less timeto train. Our single model with 165 million parameters, achieves 27.5 BLEU onEnglish-to-German translation, improving over the existing best ensemble result by over 1 BLEU. On English-to-French translation, we outperform the previoussingle state-of-the-art with model by 0.7 BLEU, achieving a BLEU score of 41.1.

...read moreread less

52,856 citations

Journal Article•

Visualizing Data using t-SNE

[...]

Laurens van der Maaten, Geoffrey E. Hinton

01 Jan 2008-Journal of Machine Learning Research

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.

...read moreread less

Abstract: We present a new technique called “t-SNE” that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map. The technique is a variation of Stochastic Neighbor Embedding (Hinton and Roweis, 2002) that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map. t-SNE is better than existing techniques at creating a single map that reveals structure at many different scales. This is particularly important for high-dimensional data that lie on several different, but related, low-dimensional manifolds, such as images of objects from multiple classes seen from multiple viewpoints. For visualizing the structure of very large datasets, we show how t-SNE can use random walks on neighborhood graphs to allow the implicit structure of all of the data to influence the way in which a subset of the data is displayed. We illustrate the performance of t-SNE on a wide variety of datasets and compare it with many other non-parametric visualization techniques, including Sammon mapping, Isomap, and Locally Linear Embedding. The visualizations produced by t-SNE are significantly better than those produced by the other techniques on almost all of the datasets.

...read moreread less

30,124 citations

Posted Content•

Efficient Estimation of Word Representations in Vector Space

[...]

Tomas Mikolov¹, Kai Chen², Greg S. Corrado³, Jeffrey Dean³•Institutions (3)

Brno University of Technology¹, Beijing University of Posts and Telecommunications², Google³

16 Jan 2013-arXiv: Computation and Language

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.

...read moreread less

Abstract: We propose two novel model architectures for computing continuous vector representations of words from very large data sets. The quality of these representations is measured in a word similarity task, and the results are compared to the previously best performing techniques based on different types of neural networks. We observe large improvements in accuracy at much lower computational cost, i.e. it takes less than a day to learn high quality word vectors from a 1.6 billion words data set. Furthermore, we show that these vectors provide state-of-the-art performance on our test set for measuring syntactic and semantic word similarities.

...read moreread less

20,077 citations

Posted Content•

Semi-Supervised Classification with Graph Convolutional Networks

[...]

Thomas Kipf¹, Max Welling¹•Institutions (1)

University of Amsterdam¹

09 Sep 2016-arXiv: Learning

TL;DR: A scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs which outperforms related methods by a significant margin.

...read moreread less

Abstract: We present a scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs. We motivate the choice of our convolutional architecture via a localized first-order approximation of spectral graph convolutions. Our model scales linearly in the number of graph edges and learns hidden layer representations that encode both local graph structure and features of nodes. In a number of experiments on citation networks and on a knowledge graph dataset we demonstrate that our approach outperforms related methods by a significant margin.

...read moreread less

15,696 citations