Showing papers on "Unsupervised learning published in 2021"

PDF

Open Access

Journal Article•DOI•

Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey

[...]

Longlong Jing¹, Yingli Tian¹•Institutions (1)

01 Nov 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: An extensive review of deep learning-based self-supervised general visual feature learning methods from images or videos as a subset of unsupervised learning methods to learn general image and video features from large-scale unlabeled data without using any human-annotated labels is provided.

...read moreread less

Abstract: Large-scale labeled data are generally required to train deep neural networks in order to obtain better performance in visual feature learning from images or videos for computer vision applications. To avoid extensive cost of collecting and annotating large-scale datasets, as a subset of unsupervised learning methods, self-supervised learning methods are proposed to learn general image and video features from large-scale unlabeled data without using any human-annotated labels. This paper provides an extensive review of deep learning-based self-supervised general visual feature learning methods from images or videos. First, the motivation, general pipeline, and terminologies of this field are described. Then the common deep neural network architectures that used for self-supervised learning are summarized. Next, the schema and evaluation metrics of self-supervised learning methods are reviewed followed by the commonly used datasets for images, videos, audios, and 3D data, as well as the existing self-supervised visual feature learning methods. Finally, quantitative performance comparisons of the reviewed methods on benchmark datasets are summarized and discussed for both image and video feature learning. At last, this paper is concluded and lists a set of promising future directions for self-supervised visual feature learning.

...read moreread less

876 citations

Journal Article•DOI•

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences

[...]

Alexander Rives¹, Alexander Rives², Joshua Meier¹, Tom Sercu¹, Siddharth Goyal¹, Zeming Lin², Jason Liu¹, Demi Guo³, Myle Ott¹, C. Lawrence Zitnick¹, Jerry Ma⁴, Jerry Ma⁵, Rob Fergus² - Show less +9 more•Institutions (5)

Facebook¹, New York University², Harvard University³, Yale University⁴, University of Chicago⁵

13 Apr 2021-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: This paper used unsupervised learning to train a deep contextual language model on 86 billion amino acids across 250 million protein sequences spanning evolutionary diversity, which contains information about biological properties in its representations.

...read moreread less

Abstract: In the field of artificial intelligence, a combination of scale in data and model capacity enabled by unsupervised learning has led to major advances in representation learning and statistical generation In the life sciences, the anticipated growth of sequencing promises unprecedented data on natural sequence diversity Protein language modeling at the scale of evolution is a logical step toward predictive and generative artificial intelligence for biology To this end, we use unsupervised learning to train a deep contextual language model on 86 billion amino acids across 250 million protein sequences spanning evolutionary diversity The resulting model contains information about biological properties in its representations The representations are learned from sequence data alone The learned representation space has a multiscale organization reflecting structure from the level of biochemical properties of amino acids to remote homology of proteins Information about secondary and tertiary structure is encoded in the representations and can be identified by linear projections Representation learning produces features that generalize across a range of applications, enabling state-of-the-art supervised prediction of mutational effect and secondary structure and improving state-of-the-art features for long-range contact prediction

...read moreread less

700 citations

Proceedings Article•DOI•

Graph Contrastive Learning with Adaptive Augmentation

[...]

Yanqiao Zhu¹, Yichen Xu², Feng Yu³, Qiang Liu¹, Shu Wu¹, Liang Wang¹ - Show less +2 more•Institutions (3)

Chinese Academy of Sciences¹, Beijing University of Posts and Telecommunications², Alibaba Group³

19 Apr 2021

TL;DR: This paper proposes a novel graph contrastive representation learning method with adaptive augmentation that incorporates various priors for topological and semantic aspects of the graph that consistently outperforms existing state-of-the-art baselines and even surpasses some supervised counterparts.

...read moreread less

Abstract: Recently, contrastive learning (CL) has emerged as a successful method for unsupervised graph representation learning. Most graph CL methods first perform stochastic augmentation on the input graph to obtain two graph views and maximize the agreement of representations in the two views. Despite the prosperous development of graph CL methods, the design of graph augmentation schemes—a crucial component in CL—remains rarely explored. We argue that the data augmentation schemes should preserve intrinsic structures and attributes of graphs, which will force the model to learn representations that are insensitive to perturbation on unimportant nodes and edges. However, most existing methods adopt uniform data augmentation schemes, like uniformly dropping edges and uniformly shuffling features, leading to suboptimal performance. In this paper, we propose a novel graph contrastive representation learning method with adaptive augmentation that incorporates various priors for topological and semantic aspects of the graph. Specifically, on the topology level, we design augmentation schemes based on node centrality measures to highlight important connective structures. On the node attribute level, we corrupt node features by adding more noise to unimportant node features, to enforce the model to recognize underlying semantic information. We perform extensive experiments of node classification on a variety of real-world datasets. Experimental results demonstrate that our proposed method consistently outperforms existing state-of-the-art baselines and even surpasses some supervised counterparts, which validates the effectiveness of the proposed contrastive framework with adaptive augmentation.

...read moreread less

359 citations

Journal Article•DOI•

Deep Feature Learning for Medical Image Analysis with Convolutional Autoencoder Neural Network

[...]

Min Chen¹, Xiaobo Shi², Yin Zhang³, Di Wu⁴, Mohsen Guizani⁵ - Show less +1 more•Institutions (5)

Huazhong University of Science and Technology¹, Henan Normal University², Zhongnan University of Economics and Law³, Sun Yat-sen University⁴, University of Idaho⁵

01 Sep 2021-IEEE Transactions on Big Data

TL;DR: A convolutional autoencoder deep learning framework to support unsupervised image features learning for lung nodule through unlabeled data, which only needs a small amount of labeled data for efficient feature learning.

...read moreread less

Abstract: At present, computed tomography (CT) is widely used to assist disease diagnosis. Especially, computer aided diagnosis (CAD) based on artificial intelligence (AI) recently exhibits its importance in intelligent healthcare. However, it is a great challenge to establish an adequate labeled dataset for CT analysis assistance, due to the privacy and security issues. Therefore, this paper proposes a convolutional autoencoder deep learning framework to support unsupervised image features learning for lung nodule through unlabeled data, which only needs a small amount of labeled data for efficient feature learning. Through comprehensive experiments, it shows that the proposed scheme is superior to other approaches, which effectively solves the intrinsic labor-intensive problem during artificial image labeling. Moreover, it verifies that the proposed convolutional autoencoder approach can be extended for similarity measurement of lung nodules images. Especially, the features extracted through unsupervised learning are also applicable in other related scenarios.

...read moreread less

345 citations

Journal Article•DOI•

A Survey on Multi-Task Learning

[...]

Yu Zhang¹, Qiang Yang¹•Institutions (1)

Hong Kong University of Science and Technology¹

31 Mar 2021-IEEE Transactions on Knowledge and Data Engineering

TL;DR: A survey for MTL is given, which classifies different MTL algorithms into several categories, including feature learning approach, low-rank approach, task clustering approaches, task relation learning approaches, and decomposition approach, and then discusses the characteristics of each approach.

...read moreread less

Abstract: Multi-Task Learning (MTL) is a learning paradigm in machine learning and its aim is to leverage useful information contained in multiple related tasks to help improve the generalization performance of all the tasks In this paper, we give a survey for MTL from the perspective of algorithmic modeling, applications, and theoretical analyses For algorithmic modeling, we give a definition of MTL and then classify different MTL algorithms into five categories, including feature learning approach, low-rank approach, task clustering approach, task relation learning approach, and decomposition approach as well as discussing the characteristics of each approach In order to improve the performance of learning tasks further, MTL can be combined with other learning paradigms including semi-supervised learning, active learning, unsupervised learning, reinforcement learning, multi-view learning, and graphical models When the number of tasks is large or the data dimensionality is high, we review online, parallel, and distributed MTL models as well as dimensionality reduction and feature hashing to reveal their computational and storage advantages Many real-world applications use MTL to boost their performance and we review representative works Finally, we present theoretical analyses and discuss several future directions for MTL

...read moreread less

305 citations

Journal Article•DOI•

Semi-Supervised Semantic Segmentation With High- and Low-Level Consistency

[...]

Sudhanshu Mittal¹, Maxim Tatarchenko¹, Thomas Brox¹•Institutions (1)

University of Freiburg¹

01 Apr 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This work proposes an approach for semi-supervised semantic segmentation that learns from limited pixel-wise annotated samples while exploiting additional annotation-free images, and achieves significant improvement over existing methods, especially when trained with very few labeled samples.

...read moreread less

Abstract: The ability to understand visual information from limited labeled data is an important aspect of machine learning. While image-level classification has been extensively studied in a semi-supervised setting, dense pixel-level classification with limited data has only drawn attention recently. In this work, we propose an approach for semi-supervised semantic segmentation that learns from limited pixel-wise annotated samples while exploiting additional annotation-free images. The proposed approach relies on adversarial training with a feature matching loss to learn from unlabeled images. It uses two network branches that link semi-supervised classification with semi-supervised segmentation including self-training. The dual-branch approach reduces both the low-level and the high-level artifacts typical when training with few labels. The approach attains significant improvement over existing methods, especially when trained with very few labeled samples. On several standard benchmarks—PASCAL VOC 2012, PASCAL-Context, and Cityscapes—the approach achieves new state-of-the-art in semi-supervised learning.

...read moreread less

255 citations

Journal Article•DOI•

Online learning: A comprehensive survey

[...]

Steven C. H. Hoi¹, Steven C. H. Hoi², Doyen Sahoo², Jing Lu, Peilin Zhao³ - Show less +1 more•Institutions (3)

Singapore Management University¹, Salesforce.com², Tencent³

12 Oct 2021-Neurocomputing

TL;DR: Online learning as mentioned in this paper is a family of machine learning methods, where a learner attempts to tackle some predictive (or any type of decision-making) task by learning from a sequence of data instances one by one at each time.

...read moreread less

234 citations

Journal Article•DOI•

Artificial intelligence to deep learning: machine intelligence approach for drug discovery.

[...]

Rohan Gupta¹, Devesh Srivastava¹, Mehar Sahu¹, Swati Tiwari¹, Rashmi K. Ambasta¹, Pravir Kumar¹ - Show less +2 more•Institutions (1)

Delhi Technological University¹

12 Apr 2021-Molecular Diversity

TL;DR: In this article, Artificial Neural Networks and deep learning algorithms have been implemented in several drug discovery processes such as peptide synthesis, structure-based virtual screening, ligand-based screening, toxicity prediction, drug monitoring and release, pharmacophore modeling, quantitative structure-activity relationship, drug repositioning, polypharmacology, and physiochemical activity.

...read moreread less

Abstract: Drug designing and development is an important area of research for pharmaceutical companies and chemical scientists. However, low efficacy, off-target delivery, time consumption, and high cost impose a hurdle and challenges that impact drug design and discovery. Further, complex and big data from genomics, proteomics, microarray data, and clinical trials also impose an obstacle in the drug discovery pipeline. Artificial intelligence and machine learning technology play a crucial role in drug discovery and development. In other words, artificial neural networks and deep learning algorithms have modernized the area. Machine learning and deep learning algorithms have been implemented in several drug discovery processes such as peptide synthesis, structure-based virtual screening, ligand-based virtual screening, toxicity prediction, drug monitoring and release, pharmacophore modeling, quantitative structure-activity relationship, drug repositioning, polypharmacology, and physiochemical activity. Evidence from the past strengthens the implementation of artificial intelligence and deep learning in this field. Moreover, novel data mining, curation, and management techniques provided critical support to recently developed modeling algorithms. In summary, artificial intelligence and deep learning advancements provide an excellent opportunity for rational drug design and discovery process, which will eventually impact mankind. The primary concern associated with drug design and development is time consumption and production cost. Further, inefficiency, inaccurate target delivery, and inappropriate dosage are other hurdles that inhibit the process of drug delivery and development. With advancements in technology, computer-aided drug design integrating artificial intelligence algorithms can eliminate the challenges and hurdles of traditional drug design and development. Artificial intelligence is referred to as superset comprising machine learning, whereas machine learning comprises supervised learning, unsupervised learning, and reinforcement learning. Further, deep learning, a subset of machine learning, has been extensively implemented in drug design and development. The artificial neural network, deep neural network, support vector machines, classification and regression, generative adversarial networks, symbolic learning, and meta-learning are examples of the algorithms applied to the drug design and discovery process. Artificial intelligence has been applied to different areas of drug design and development process, such as from peptide synthesis to molecule design, virtual screening to molecular docking, quantitative structure-activity relationship to drug repositioning, protein misfolding to protein-protein interactions, and molecular pathway identification to polypharmacology. Artificial intelligence principles have been applied to the classification of active and inactive, monitoring drug release, pre-clinical and clinical development, primary and secondary drug screening, biomarker development, pharmaceutical manufacturing, bioactivity identification and physiochemical properties, prediction of toxicity, and identification of mode of action.

...read moreread less

211 citations

Journal Article•DOI•

A Tutorial on Ultrareliable and Low-Latency Communications in 6G: Integrating Domain Knowledge Into Deep Learning

[...]

Changyang She¹, Chengjian Sun², Zhouyou Gu¹, Yonghui Li¹, Chenyang Yang², H. Vincent Poor³, Branka Vucetic¹ - Show less +3 more•Institutions (3)

University of Sydney¹, Beihang University², Princeton University³

04 Mar 2021

TL;DR: In this paper, the authors discuss the potential of applying supervised/unsupervised deep learning and deep reinforcement learning in ultrareliable and low-latency communications (URLLCs) in future 6G networks.

...read moreread less

Abstract: As one of the key communication scenarios in the fifth-generation and also the sixth-generation (6G) mobile communication networks, ultrareliable and low-latency communications (URLLCs) will be central for the development of various emerging mission-critical applications. State-of-the-art mobile communication systems do not fulfill the end-to-end delay and overall reliability requirements of URLLCs. In particular, a holistic framework that takes into account latency, reliability, availability, scalability, and decision-making under uncertainty is lacking. Driven by recent breakthroughs in deep neural networks, deep learning algorithms have been considered as promising ways of developing enabling technologies for URLLCs in future 6G networks. This tutorial illustrates how domain knowledge (models, analytical tools, and optimization frameworks) of communications and networking can be integrated into different kinds of deep learning algorithms for URLLCs. We first provide some background of URLLCs and review promising network architectures and deep learning frameworks for 6G. To better illustrate how to improve learning algorithms with domain knowledge, we revisit model-based analytical tools and cross-layer optimization frameworks for URLLCs. Following this, we examine the potential of applying supervised/unsupervised deep learning and deep reinforcement learning in URLLCs and summarize related open problems. Finally, we provide simulation and experimental results to validate the effectiveness of different learning algorithms and discuss future directions.

...read moreread less

203 citations

Journal Article•DOI•

Coronavirus disease (COVID-19) cases analysis using machine-learning applications.

[...]

Ameer Sardar Kwekha-Rashid¹, Heamn Noori Abduljabbar², Bilal Alhayani³•Institutions (3)

University of Sulaymaniyah¹, Salahaddin University², Yıldız Technical University³

21 May 2021-Applied Nanoscience

TL;DR: In this article, the role of machine learning applications and algorithms in investigating and various purposes that deals with COVID-19 was detected and the purpose of this study is to detect the role machine learning application and algorithms.

...read moreread less

Abstract: Today world thinks about coronavirus disease that which means all even this pandemic disease is not unique. The purpose of this study is to detect the role of machine-learning applications and algorithms in investigating and various purposes that deals with COVID-19. Review of the studies that had been published during 2020 and were related to this topic by seeking in Science Direct, Springer, Hindawi, and MDPI using COVID-19, machine learning, supervised learning, and unsupervised learning as keywords. The total articles obtained were 16,306 overall but after limitation; only 14 researches of these articles were included in this study. Our findings show that machine learning can produce an important role in COVID-19 investigations, prediction, and discrimination. In conclusion, machine learning can be involved in the health provider programs and plans to assess and triage the COVID-19 cases. Supervised learning showed better results than other Unsupervised learning algorithms by having 92.9% testing accuracy. In the future recurrent supervised learning can be utilized for superior accuracy.

...read moreread less

202 citations

Journal Article•DOI•

An ensemble machine learning approach through effective feature extraction to classify fake news

[...]

Saqib Hakak¹, Mamoun Alazab², Suleman Khan³, Thippa Reddy Gadekallu⁴, Praveen Kumar Reddy Maddikunta⁴, Wazir Zada Khan⁵ - Show less +2 more•Institutions (5)

University of New Brunswick¹, Charles Darwin University², Air University (Islamabad)³, VIT University⁴, Jazan University⁵

01 Apr 2021-Future Generation Computer Systems

TL;DR: This article has proposed an ensemble classification model for detection of the fake news that has achieved a better accuracy compared to the state-of-the-art.

...read moreread less

Journal Article•DOI•

Using machine learning approaches for multi-omics data analysis: A review

[...]

Parminder Singh Reel¹, Smarti Reel¹, Ewan R. Pearson¹, Emanuele Trucco¹, Emily Jefferson¹ - Show less +1 more•Institutions (1)

University of Dundee¹

29 Mar 2021-Biotechnology Advances

TL;DR: In this article, the authors explore different integrative machine learning methods which have been used to provide an in-depth understanding of biological systems during normal physiological functioning and in the presence of a disease.

...read moreread less

Journal Article•DOI•

Deep autoencoder based energy method for the bending, vibration, and buckling analysis of Kirchhoff plates with transfer learning

[...]

Xiaoying Zhuang¹, Xiaoying Zhuang², Hongwei Guo¹, Naif Alajlan³, Hehua Zhu², Timon Rabczuk⁴, Timon Rabczuk³ - Show less +3 more•Institutions (4)

Leibniz University of Hanover¹, Tongji University², King Saud University³, Bauhaus University, Weimar⁴

01 May 2021-European Journal of Mechanics A-solids

TL;DR: In this paper, a deep autoencoder based energy method (DAEM) is proposed for bending, vibration and buckling analysis of Kirchhoff plates, where the objective function is to minimize the total potential energy.

...read moreread less

Abstract: In this paper, we present a deep autoencoder based energy method (DAEM) for the bending, vibration and buckling analysis of Kirchhoff plates. The DAEM exploits the higher order continuity of the DAEM and integrates a deep autoencoder and the minimum total potential principle in one framework yielding an unsupervised feature learning method. The DAEM is a specific type of feedforward deep neural network (DNN) and can also serve as function approximator. With robust feature extraction capacity, the DAEM can more efficiently identify patterns behind the whole energy system, such as the field variables, natural frequency and critical buckling load factor studied in this paper. The objective function is to minimize the total potential energy. The DAEM performs unsupervised learning based on generated collocation points inside the physical domain so that the total potential energy is minimized at all points. For the vibration and buckling analysis, the loss function is constructed based on Rayleigh’s principle and the fundamental frequency and the critical buckling load is extracted. A scaled hyperbolic tangent activation function for the underlying mechanical model is presented which meets the continuity requirement and alleviates the gradient vanishing/explosive problems under bending. The DAEM is implemented using Pytorch and the LBFGS optimizer. To further improve the computational efficiency and enhance the generality of this machine learning method, we employ transfer learning. A comprehensive study of the DAEM configuration is performed for several numerical examples with various geometries, load conditions, and boundary conditions.

...read moreread less

Journal Article•DOI•

Unsupervised Learning Methods for Molecular Simulation Data.

[...]

Aldo Glielmo¹, Brooke E. Husic², Alex Rodriguez³, Cecilia Clementi², Frank Noé⁴, Frank Noé², Alessandro Laio³, Alessandro Laio¹ - Show less +4 more•Institutions (4)

International School for Advanced Studies¹, Free University of Berlin², International Centre for Theoretical Physics³, Rice University⁴

04 May 2021-Chemical Reviews

TL;DR: This Review provides a comprehensive overview of the methods of unsupervised learning that have been most commonly used to investigate simulation data and indicates likely directions for further developments in the field.

...read moreread less

Abstract: Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of data produced by atomistic and molecular simulations, in material science, solid state physics, biophysics, and biochemistry. In this Review, we provide a comprehensive overview of the methods of unsupervised learning that have been most commonly used to investigate simulation data and indicate likely directions for further developments in the field. In particular, we discuss feature representation of molecular systems and present state-of-the-art algorithms of dimensionality reduction, density estimation, and clustering, and kinetic models. We divide our discussion into self-contained sections, each discussing a specific method. In each section, we briefly touch upon the mathematical and algorithmic foundations of the method, highlight its strengths and limitations, and describe the specific ways in which it has been used-or can be used-to analyze molecular simulation data.

...read moreread less

Journal Article•DOI•

Deep learning for geophysics: Current and future trends

[...]

Siwei Yu¹, Jianwei Ma²•Institutions (2)

Harbin Institute of Technology¹, Peking University²

01 Sep 2021-Reviews of Geophysics

TL;DR: A new data-driven technique, i.e., deep learning (DL), has attracted significantly increasing attention in the geophysical community and the collision of DL and traditional methods has had an impact on traditional methods.

...read moreread less

Abstract: Recently deep learning (DL), as a new data-driven technique compared to conventional approaches, has attracted increasing attention in geophysical community, resulting in many opportunities and challenges. DL was proven to have the potential to predict complex system states accurately and relieve the “curse of dimensionality” in large temporal and spatial geophysical applications. We address the basic concepts, state-of-the-art literature, and future trends by reviewing DL approaches in various geosciences scenarios. Exploration geophysics, earthquakes, and remote sensing are the main focuses. More applications, including Earth structure, water resources, atmospheric science, and space science, are also reviewed. Additionally, the difficulties of applying DL in the geophysical community are discussed. The trends of DL in geophysics in recent years are analyzed. Several promising directions are provided for future research involving DL in geophysics, such as unsupervised learning, transfer learning, multimodal DL, federated learning, uncertainty estimation, and active learning. A coding tutorial and a summary of tips for rapidly exploring DL are presented for beginners and interested readers of geophysics.

...read moreread less

Journal Article•DOI•

Neural Image Compression for Gigapixel Histopathology Image Analysis

[...]

David Tellez¹, Geert Litjens¹, Jeroen van der Laak¹, Francesco Ciompi¹•Institutions (1)

Radboud University Nijmegen¹

01 Feb 2021-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Neural Image Compression (NIC) as discussed by the authors is a two-step method to build convolutional neural networks for gigapixel image analysis solely using weak image-level labels, avoiding the need for fine-grained manual annotations.

...read moreread less

Abstract: We propose Neural Image Compression (NIC), a two-step method to build convolutional neural networks for gigapixel image analysis solely using weak image-level labels. First, gigapixel images are compressed using a neural network trained in an unsupervised fashion, retaining high-level information while suppressing pixel-level noise. Second, a convolutional neural network (CNN) is trained on these compressed image representations to predict image-level labels, avoiding the need for fine-grained manual annotations. We compared several encoding strategies, namely reconstruction error minimization, contrastive training and adversarial feature learning, and evaluated NIC on a synthetic task and two public histopathology datasets. We found that NIC can exploit visual cues associated with image-level labels successfully, integrating both global and local visual information. Furthermore, we visualized the regions of the input gigapixel images where the CNN attended to, and confirmed that they overlapped with annotations from human experts.

...read moreread less

Journal Article•DOI•

Unsupervised neural network models of the ventral visual stream

[...]

Chengxu Zhuang¹, Siming Yan², Aran Nayebi¹, Martin Schrimpf³, Michael C. Frank¹, James J. DiCarlo³, Daniel L. K. Yamins¹ - Show less +3 more•Institutions (3)

Stanford University¹, University of Texas at Austin², Massachusetts Institute of Technology³

19 Jan 2021-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: Recently, this article showed that neural network models learned with deep unsupervised contrastive embedding methods achieve neural prediction accuracy in multiple ventral visual cortical areas that equals or exceeds that of models derived using today's best supervised methods and that the mapping of neural network hidden layers is neuroanatomically consistent across the ventral stream.

...read moreread less

Abstract: Deep neural networks currently provide the best quantitative models of the response patterns of neurons throughout the primate ventral visual stream. However, such networks have remained implausible as a model of the development of the ventral stream, in part because they are trained with supervised methods requiring many more labels than are accessible to infants during development. Here, we report that recent rapid progress in unsupervised learning has largely closed this gap. We find that neural network models learned with deep unsupervised contrastive embedding methods achieve neural prediction accuracy in multiple ventral visual cortical areas that equals or exceeds that of models derived using today's best supervised methods and that the mapping of these neural network models' hidden layers is neuroanatomically consistent across the ventral stream. Strikingly, we find that these methods produce brain-like representations even when trained solely with real human child developmental data collected from head-mounted cameras, despite the fact that these datasets are noisy and limited. We also find that semisupervised deep contrastive embeddings can leverage small numbers of labeled examples to produce representations with substantially improved error-pattern consistency to human behavior. Taken together, these results illustrate a use of unsupervised learning to provide a quantitative model of a multiarea cortical brain system and present a strong candidate for a biologically plausible computational theory of primate sensory learning.

...read moreread less

Proceedings Article•DOI•

A Transformer-based Framework for Multivariate Time Series Representation Learning

[...]

George Zerveas¹, Srideepika Jayaraman², Dhaval Patel², Anuradha Bhamidipaty², Carsten Eickhoff¹ - Show less +1 more•Institutions (2)

Brown University¹, IBM²

14 Aug 2021

TL;DR: In this paper, an unsupervised pre-training scheme for multivariate time series representation learning based on the transformer encoder architecture is proposed, which can offer substantial performance benefits over fully supervised learning on downstream tasks, both with but even without leveraging additional unlabeled data.

...read moreread less

Abstract: We present a novel framework for multivariate time series representation learning based on the transformer encoder architecture. The framework includes an unsupervised pre-training scheme, which can offer substantial performance benefits over fully supervised learning on downstream tasks, both with but even without leveraging additional unlabeled data, i.e., by reusing the existing data samples. Evaluating our framework on several public multivariate time series datasets from various domains and with diverse characteristics, we demonstrate that it performs significantly better than the best currently available methods for regression and classification, even for datasets which consist of only a few hundred training samples. Given the pronounced interest in unsupervised learning for nearly all domains in the sciences and in industry, these findings represent an important landmark, presenting the first unsupervised method shown to push the limits of state-of-the-art performance for multivariate time series regression and classification.

...read moreread less

Journal Article•DOI•

A Survey on Semi-, Self- and Unsupervised Learning for Image Classification

[...]

Lars Schmarje¹, Monty Santarossa¹, Simon-Martin Schröder¹, Reinhard Koch¹•Institutions (1)

University of Kiel¹

27 May 2021-IEEE Access

TL;DR: In this article, the authors provide an overview of often used ideas and methods in image classification with fewer labels and compare 34 methods in detail based on their performance and their commonly used ideas rather than a fine-grained taxonomy.

...read moreread less

Abstract: While deep learning strategies achieve outstanding results in computer vision tasks, one issue remains: The current strategies rely heavily on a huge amount of labeled data. In many real-world problems, it is not feasible to create such an amount of labeled training data. Therefore, it is common to incorporate unlabeled data into the training process to reach equal results with fewer labels. Due to a lot of concurrent research, it is difficult to keep track of recent developments. In this survey, we provide an overview of often used ideas and methods in image classification with fewer labels. We compare 34 methods in detail based on their performance and their commonly used ideas rather than a fine-grained taxonomy. In our analysis, we identify three major trends that lead to future research opportunities. 1. State-of-the-art methods are scalable to real-world applications in theory but issues like class imbalance, robustness, or fuzzy labels are not considered. 2. The degree of supervision which is needed to achieve comparable results to the usage of all labels is decreasing and therefore methods need to be extended to settings with a variable number of classes. 3. All methods share some common ideas but we identify clusters of methods that do not share many ideas. We show that combining ideas from different clusters can lead to better performance.

...read moreread less

Journal Article•DOI•

A Hybrid Unsupervised Clustering-Based Anomaly Detection Method

[...]

Guo Pu¹, Lijuan Wang¹, Jun Shen², Fang Dong³•Institutions (3)

Xidian University¹, University of Wollongong², Southeast University³

05 Apr 2021-Tsinghua Science & Technology

TL;DR: An unsupervised anomaly detection method is presented, which combines Sub-Space Clustering (SSC) and One Class Support Vector Machine (OCSVM) to detect attacks without any prior knowledge.

...read moreread less

Journal Article•DOI•

GAN-based imbalanced data intrusion detection system

[...]

Joo-Hwa Lee¹, KeeHyun Park¹•Institutions (1)

Keimyung University¹

01 Feb 2021-Personal and Ubiquitous Computing

TL;DR: The purpose of this study is to solve data imbalance by using the Generative Adversarial Networks (GAN) model, which is an unsupervised learning method of deep learning which generates new virtual data similar to the existing data.

...read moreread less

Abstract: According to the development of deep learning technologies, a wide variety of research is being performed to detect intrusion data by using vast amounts of data. Although deep learning performs more accurately than machine learning algorithms when learning large amounts of data, the performance declines significantly in the case of learning from imbalanced data. And, while there are many studies on imbalanced data, most have weaknesses that can result in data loss or overfitting. The purpose of this study is to solve data imbalance by using the Generative Adversarial Networks (GAN) model, which is an unsupervised learning method of deep learning which generates new virtual data similar to the existing data. It also proposed a model that would be classified as Random Forest to identify detection performance after addressing data imbalances based on a GAN. The results of the experiment showed that the performance of the model proposed in this paper was better than the model classified without addressing the imbalance of data. In addition, it was found that the performance of the model proposed in this paper was excellent when compared with other models that were previously used widely for the data imbalance problem.

...read moreread less

Journal Article•DOI•

SESF-Fuse: an unsupervised deep model for multi-focus image fusion

[...]

Boyuan Ma¹, Yu Zhu¹, Xiang Yin¹, Xiaojuan Ban¹, Haiyou Huang¹, Michele Mukeshimana² - Show less +2 more•Institutions (2)

University of Science and Technology Beijing¹, University of Burundi²

01 Jun 2021-Neural Computing and Applications

TL;DR: A novel unsupervised deep learning model is proposed to address multi-focus image fusion problem and analyzes sharp appearance in deep feature instead of original image to achieve state-of-art fusion performance.

...read moreread less

Abstract: Muti-focus image fusion is the extraction of focused regions from different images to create one all-in-focus fused image. The key point is that only objects within the depth-of-field have a sharp appearance in the photograph, while other objects are likely to be blurred. We propose an unsupervised deep learning model for multi-focus image fusion. We train an encoder–decoder network in an unsupervised manner to acquire deep features of input images. Then, we utilize spatial frequency, a gradient-based method to measure sharp variation from these deep features, to reflect activity levels. We apply some consistency verification methods to adjust the decision map and draw out the fused result. Our method analyzes sharp appearances in deep features instead of original images, which can be seen as another success story of unsupervised learning in image processing. Experimental results demonstrate that the proposed method achieves state-of-the-art fusion performance compared to 16 fusion methods in objective and subjective assessments, especially in gradient-based fusion metrics.

...read moreread less

Journal Article•DOI•

Anomaly Detection on Attributed Networks via Contrastive Self-Supervised Learning

[...]

Yixin Liu¹, Zhao Li², Shirui Pan¹, Chen Gong³, Chuan Zhou⁴, George Karypis⁵ - Show less +2 more•Institutions (5)

Monash University, Clayton campus¹, Alibaba Group², Nanjing University of Science and Technology³, Chinese Academy of Sciences⁴, University of Minnesota⁵

05 Apr 2021-IEEE Transactions on Neural Networks

TL;DR: In this article, a contrastive self-supervised learning framework for anomaly detection on attributed networks is proposed, which exploits the local information from network data by sampling a novel type of contrastive instance pair, which can capture the relationship between each node and its neighboring substructure.

...read moreread less

Abstract: Anomaly detection on attributed networks attracts considerable research interests due to wide applications of attributed networks in modeling a wide range of complex systems. Recently, the deep learning-based anomaly detection methods have shown promising results over shallow approaches, especially on networks with high-dimensional attributes and complex structures. However, existing approaches, which employ graph autoencoder as their backbone, do not fully exploit the rich information of the network, resulting in suboptimal performance. Furthermore, these methods do not directly target anomaly detection in their learning objective and fail to scale to large networks due to the full graph training mechanism. To overcome these limitations, in this article, we present a novel Contrastive self-supervised Learning framework for Anomaly detection on attributed networks (CoLA for abbreviation). Our framework fully exploits the local information from network data by sampling a novel type of contrastive instance pair, which can capture the relationship between each node and its neighboring substructure in an unsupervised way. Meanwhile, a well-designed graph neural network (GNN)-based contrastive learning model is proposed to learn informative embedding from high-dimensional attributes and local structure and measure the agreement of each instance pairs with its outputted scores. The multiround predicted scores by the contrastive learning model are further used to evaluate the abnormality of each node with statistical estimation. In this way, the learning model is trained by a specific anomaly detection-aware target. Furthermore, since the input of the GNN module is batches of instance pairs instead of the full network, our framework can adapt to large networks flexibly. Experimental results show that our proposed framework outperforms the state-of-the-art baseline methods on all seven benchmark data sets.

...read moreread less

Journal Article•DOI•

Unsupervised deep learning for super-resolution reconstruction of turbulence

[...]

Hyo-Jin Kim¹, Jun-Hyuk Kim¹, Sungjin Won¹, Changhoon Lee•Institutions (1)

Yonsei University¹

13 Jan 2021-Journal of Fluid Mechanics

TL;DR: In this article, an unsupervised learning model that adopts a cycle-consistent generative adversarial network (CycleGAN) that can be trained with unpaired turbulence data for super-resolution reconstruction is presented.

...read moreread less

Abstract: Recent attempts to use deep learning for super-resolution reconstruction of turbulent flows have used supervised learning, which requires paired data for training. This limitation hinders more practical applications of super-resolution reconstruction. Therefore, we present an unsupervised learning model that adopts a cycle-consistent generative adversarial network (CycleGAN) that can be trained with unpaired turbulence data for super-resolution reconstruction. Our model is validated using three examples: (i) recovering the original flow field from filtered data using direct numerical simulation (DNS) of homogeneous isotropic turbulence; (ii) reconstructing full-resolution fields using partially measured data from the DNS of turbulent channel flows; and (iii) generating a DNS-resolution flow field from large-eddy simulation (LES) data for turbulent channel flows. In examples (i) and (ii), for which paired data are available for supervised learning, our unsupervised model demonstrates qualitatively and quantitatively similar performance as that of the best supervised learning model. More importantly, in example (iii), where supervised learning is impossible, our model successfully reconstructs the high-resolution flow field of statistical DNS quality from the LES data. Furthermore, we find that the present model has almost universal applicability to all values of Reynolds numbers within the tested range. This demonstrates that unsupervised learning of turbulence data is indeed possible, opening a new door for the wide application of super-resolution reconstruction of turbulent fields.

...read moreread less

Journal Article•DOI•

A Review on Machine Learning for EEG Signal Processing in Bioengineering

[...]

Mohammad-Parsa Hosseini¹, Amin Hosseini², Kiarash Ahi³•Institutions (3)

Santa Clara University¹, Islamic Azad University², University of Connecticut³

01 Jan 2021-IEEE Reviews in Biomedical Engineering

TL;DR: This paper provides a comprehensive overview of Machine Learning applications used in EEG analysis and gives an overview of each of the methods and general applications that each is best suited to.

...read moreread less

Abstract: Electroencephalography (EEG) has been a staple method for identifying certain health conditions in patients since its discovery. Due to the many different types of classifiers available to use, the analysis methods are also equally numerous. In this review, we will be examining specifically machine learning methods that have been developed for EEG analysis with bioengineering applications. We reviewed literature from 1988 to 2018 to capture previous and current classification methods for EEG in multiple applications. From this information, we are able to determine the overall effectiveness of each machine learning method as well as the key characteristics. We have found that all the primary methods used in machine learning have been applied in some form in EEG classification. This ranges from Naive-Bayes to Decision Tree/Random Forest, to Support Vector Machine (SVM). Supervised learning methods are on average of higher accuracy than their unsupervised counterparts. This includes SVM and KNN. While each of the methods individually is limited in their accuracy in their respective applications, there is hope that the combination of methods when implemented properly has a higher overall classification accuracy. This paper provides a comprehensive overview of Machine Learning applications used in EEG analysis. It also gives an overview of each of the methods and general applications that each is best suited to.

...read moreread less

Journal Article•DOI•

A case study of conditional deep convolutional generative adversarial networks in machine fault diagnosis

[...]

Jia Luo¹, Jinying Huang¹, Hongmei Li¹•Institutions (1)

North University of China¹

01 Feb 2021-Journal of Intelligent Manufacturing

TL;DR: An imbalanced fault diagnosis method based on the generative model of conditional-deep convolutional generative adversarial network (C-DCGAN) is presented and could improve the accuracy of fault diagnosis and the generalization ability of the classifier in the case of small samples and display better fault diagnosis performance.

...read moreread less

Abstract: Due to the real working conditions, the collected mechanical fault datasets are actually limited and always highly imbalanced, which restricts the diagnosis accuracy and stability. To solve these problems, we present an imbalanced fault diagnosis method based on the generative model of conditional-deep convolutional generative adversarial network (C-DCGAN) and provide a study in detail. Deep convolutional generative adversarial network (DCGAN), based on traditional generative adversarial networks (GAN), introduces convolutional neural network into the training for unsupervised learning to improve the effect of generative networks. Conditional generative adversarial network (CGAN) is a conditional model obtained through introducing conditional extension into GAN. C-DCGAN is a combination of DCGAN and CGAN. In C-DCGAN, based on the feature extraction ability of convolutional networks, through the structural optimization, conditional auxiliary generative samples are used as augmented data and applied in machine fault diagnosis. Two datasets (Bearing dataset and Planetary gear box dataset) are carried out to validate. The simulation experiments showed that the improved performance is mainly due to the generated signals from C-DCGAN to balance the dataset. The proposed method can deal with imbalanced fault classification problem much more effectively. This model could improve the accuracy of fault diagnosis and the generalization ability of the classifier in the case of small samples and display better fault diagnosis performance.

...read moreread less

Journal Article•DOI•

Toward data anomaly detection for automated structural health monitoring: Exploiting generative adversarial nets and autoencoders

[...]

Jian-Xiao Mao¹, Jian-Xiao Mao², Hao Wang¹, Billie F. Spencer²•Institutions (2)

Southeast University¹, University of Illinois at Urbana–Champaign²

01 Jul 2021-Structural Health Monitoring-an International Journal

TL;DR: The generative adversarial networks are combined with a widely applied unsupervised method, that is, autoencoders, to improve the performance of existing unsuper supervised learning methods to overcome one of the key difficulties in achieving automated structural health monitoring.

...read moreread less

Abstract: Damage detection is one of the most important tasks for structural health monitoring of civil infrastructure. Before a damage detection algorithm can be applied, the integrity of the data must be e...

...read moreread less

Journal Article•DOI•

Diagnosis of wind turbine faults with transfer learning algorithms

[...]

Wanqiu Chen¹, Yingning Qiu¹, Yanhui Feng¹, Ye Li², Andrew Kusiak³ - Show less +1 more•Institutions (3)

Nanjing University of Science and Technology¹, Shanghai Jiao Tong University², University of Iowa³

01 Jan 2021-Renewable Energy

TL;DR: A novel transfer learning algorithm studied in this paper, TrAdaBoost, has been proved to have superior performance on dealing with data imbalance and different distributions and provides important insights into unsupervised learning for wind turbine fault diagnosis.

...read moreread less

Journal Article•DOI•

The Future of Coding: A Comparison of Hand-Coding and Three Types of Computer-Assisted Text Analysis Methods:

[...]

Laura K. Nelson¹, Derek Burk², Marcel L Knudsen², Leslie McCall³•Institutions (3)

Northeastern University¹, Northwestern University², City University of New York³

01 Feb 2021-Sociological Methods & Research

TL;DR: Although it is found that SML methods perform best in replicating hand-coded results, it is argued that content analysts in the social sciences would do well to keep all these approaches in their toolkit, deploying them purposefully according to the task at hand.

...read moreread less

Abstract: Advances in computer science and computational linguistics have yielded new, and faster, computational approaches to structuring and analyzing textual data. These approaches perform well on tasks l...

...read moreread less

Journal Article•DOI•

Deep Adversarial Domain Adaptation Model for Bearing Fault Diagnosis

[...]

Zhao-Hua Liu¹, Bi-Liang Lu¹, Hua-Liang Wei², Chen Lei¹, Xiao-Hua Li¹, Matthias Rätsch³ - Show less +2 more•Institutions (3)

Hunan University of Science and Technology¹, University of Sheffield², Reutlingen University³

01 Jul 2021-IEEE Transactions on Systems, Man, and Cybernetics

TL;DR: A deep adversarial domain adaptation (DADA) model is proposed for rolling bearing fault diagnosis; the experimental results demonstrate that the new method outperforms the existing machine learning and deep learning methods, in terms of classification accuracy and generalization ability.

...read moreread less

Abstract: Fault diagnosis of rolling bearings is an essential process for improving the reliability and safety of the rotating machinery. It is always a major challenge to ensure fault diagnosis accuracy in particular under severe working conditions. In this article, a deep adversarial domain adaptation (DADA) model is proposed for rolling bearing fault diagnosis. This model constructs an adversarial adaptation network to solve the commonly encountered problem in numerous real applications: the source domain and the target domain are inconsistent in their distribution. First, a deep stack autoencoder (DSAE) is combined with representative feature learning for dimensionality reduction, and such a combination provides an unsupervised learning method to effectively acquire fault features. Meanwhile, domain adaptation and recognition classification are implemented using a Softmax classifier to augment classification accuracy. Second, the effects of the number of hidden layers in the stack autoencoder network, the number of neurons in each hidden layer, and the hyperparameters of the proposed fault diagnosis algorithm are analyzed. Third, comprehensive analysis is performed on real data to validate the performance of the proposed method; the experimental results demonstrate that the new method outperforms the existing machine learning and deep learning methods, in terms of classification accuracy and generalization ability.

...read moreread less

Collapse