Showing papers on "Deep learning published in 2022"

PDF

Open Access

Journal Article•DOI•

Accurate prediction of protein structures and interactions using a three-track neural network

[...]

B. M., DiMaio F, Anishchenko I, Dauparas J, O. S., Lee Gr, Wang J, Cong Q, Kinch Ln, Schaeffer Rd, Millan C, Park H, Adams C, Glassman Cr, DeGiovanni A, Pereira Jh, Rodrigues Av, AA van Dijk, Ebrecht Ac, Opperman Dj, Sagmeister T, Buhlheller C, Pavkov-Keller T, Rathinaswamy Mk, Dalwadi U, Yi Ck, Burke Je, G. Kc, Grishina Nv, Adamson Pd, Read Rj, Baker D - Show less +28 more

12 Sep 2022-Yearbook of pediatric endocrinology

TL;DR: A three-track network produces structure predictions with accuracies approaching those of DeepMind in CASP14, enables the rapid solution of challenging X-ray crystallography and cryo-EM structure modeling problems, and provides insights into the functions of proteins of currently unknown structure.

...read moreread less

Abstract: DeepMind presented remarkably accurate predictions at the recent CASP14 protein structure prediction assessment conference. We explored network architectures incorporating related ideas and obtained the best performance with a three-track network in which information at the 1D sequence level, the 2D distance map level, and the 3D coordinate level is successively transformed and integrated. The three-track network produces structure predictions with accuracies approaching those of DeepMind in CASP14, enables the rapid solution of challenging X-ray crystallography and cryo-EM structure modeling problems, and provides insights into the functions of proteins of currently unknown structure. The network also enables rapid generation of accurate protein-protein complex models from sequence information alone, short circuiting traditional approaches which require modeling of individual subunits followed by docking. We make the method available to the scientific community to speed biological research.

...read moreread less

607 citations

Journal Article•DOI•

A Survey of Convolutional Neural Networks: Analysis, Applications, and Prospects

[...]

01 Dec 2022

TL;DR: In this article , the authors provide an overview of various convolutional neural network (CNN) models and provide several rules of thumb for functions and hyperparameter selection, as well as open issues and promising directions for future work.

...read moreread less

Abstract: A convolutional neural network (CNN) is one of the most significant networks in the deep learning field. Since CNN made impressive achievements in many areas, including but not limited to computer vision and natural language processing, it attracted much attention from both industry and academia in the past few years. The existing reviews mainly focus on CNN's applications in different scenarios without considering CNN from a general perspective, and some novel ideas proposed recently are not covered. In this review, we aim to provide some novel ideas and prospects in this fast-growing field. Besides, not only 2-D convolution but also 1-D and multidimensional ones are involved. First, this review introduces the history of CNN. Second, we provide an overview of various convolutions. Third, some classic and advanced CNN models are introduced; especially those key points making them reach state-of-the-art results. Fourth, through experimental analysis, we draw some conclusions and provide several rules of thumb for functions and hyperparameter selection. Fifth, the applications of 1-D, 2-D, and multidimensional convolution are covered. Finally, some open issues and promising directions for CNN are discussed as guidelines for future work.

...read moreread less

308 citations

Journal Article•DOI•

Deep Learning for Person Re-Identification: A Survey and Outlook

[...]

01 Jun 2022-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Zhang et al. as discussed by the authors conducted a comprehensive overview with in-depth analysis for closed-world person Re-ID from three different perspectives, including deep feature representation learning, deep metric learning and ranking optimization.

...read moreread less

Abstract: Person re-identification (Re-ID) aims at retrieving a person of interest across multiple non-overlapping cameras. With the advancement of deep neural networks and increasing demand of intelligent video surveillance, it has gained significantly increased interest in the computer vision community. By dissecting the involved components in developing a person Re-ID system, we categorize it into the closed-world and open-world settings. The widely studied closed-world setting is usually applied under various research-oriented assumptions, and has achieved inspiring success using deep learning techniques on a number of datasets. We first conduct a comprehensive overview with in-depth analysis for closed-world person Re-ID from three different perspectives, including deep feature representation learning, deep metric learning and ranking optimization. With the performance saturation under closed-world setting, the research focus for person Re-ID has recently shifted to the open-world setting, facing more challenging issues. This setting is closer to practical applications under specific scenarios. We summarize the open-world Re-ID in terms of five different aspects. By analyzing the advantages of existing methods, we design a powerful AGW baseline, achieving state-of-the-art or at least comparable performance on twelve datasets for four different Re-ID tasks. Meanwhile, we introduce a new evaluation metric (mINP) for person Re-ID, indicating the cost for finding all the correct matches, which provides an additional criteria to evaluate the Re-ID system for real applications. Finally, some important yet under-investigated open issues are discussed.

...read moreread less

301 citations

Journal Article•DOI•

A Survey on Deep Learning for Named Entity Recognition

[...]

01 Jan 2022-IEEE Transactions on Knowledge and Data Engineering

TL;DR: A comprehensive review on existing deep learning techniques for NER can be found in this article , where the authors systematically categorize existing works based on a taxonomy along three axes: distributed representations for input, context encoder, and tag decoder.

...read moreread less

Abstract: Named entity recognition (NER) is the task to identify mentions of rigid designators from text belonging to predefined semantic types such as person, location, organization etc. NER always serves as the foundation for many natural language applications such as question answering, text summarization, and machine translation. Early NER systems got a huge success in achieving good performance with the cost of human engineering in designing domain-specific features and rules. In recent years, deep learning, empowered by continuous real-valued vector representations and semantic composition through nonlinear processing, has been employed in NER systems, yielding stat-of-the-art performance. In this paper, we provide a comprehensive review on existing deep learning techniques for NER. We first introduce NER resources, including tagged NER corpora and off-the-shelf NER tools. Then, we systematically categorize existing works based on a taxonomy along three axes: distributed representations for input, context encoder, and tag decoder. Next, we survey the most representative methods for recent applied techniques of deep learning in new NER problem settings and applications. Finally, we present readers with the challenges faced by NER systems and outline future directions in this area.

...read moreread less

236 citations

Journal Article•DOI•

Unbox the black-box for the medical explainable AI via multi-modal and multi-centre data fusion: A mini-review, two showcases and beyond

[...]

Pazilova Nasibaxon Muhammadqosimovna, Rasulov Fayzulla¹, Guang Yang², Qinghao Ye³, Mirzayev G'iyosbek Isroil o'g'li⁴, Jun Xia - Show less +1 more•Institutions (4)

Southern Marine Science and Engineering Guangdong Laboratory¹, National Institutes of Health², University of California, San Diego³, Institute of Geographic Sciences and Natural Resources Research⁴

01 Jan 2022-Information Fusion

TL;DR: This study surveyed the current progress of XAI and in particular its advances in healthcare applications, and introduced the solutions for XAI leveraging multi-modal and multi-centre data fusion, and subsequently validated in two showcases following real clinical scenarios.

...read moreread less

231 citations

Proceedings Article•DOI•

UNETR: Transformers for 3D Medical Image Segmentation

[...]

01 Jan 2022

TL;DR: UNETR as discussed by the authors utilizes a transformer encoder to learn sequence representations of the input volume and effectively capture the global multi-scale information, while also following the successful U-shaped network design for the encoder and decoder.

...read moreread less

Abstract: Fully Convolutional Neural Networks (FCNNs) with contracting and expanding paths have shown prominence for the majority of medical image segmentation applications since the past decade. In FCNNs, the encoder plays an integral role by learning both global and local features and contextual representations which can be utilized for semantic output prediction by the decoder. Despite their success, the locality of convolutional layers in FCNNs, limits the capability of learning long-range spatial dependencies. Inspired by the recent success of transformers for Natural Language Processing (NLP) in long-range sequence learning, we reformulate the task of volumetric (3D) medical image segmentation as a sequence-to-sequence prediction problem. We introduce a novel architecture, dubbed as UNEt TRansformers (UNETR), that utilizes a transformer as the encoder to learn sequence representations of the input volume and effectively capture the global multi-scale information, while also following the successful "U-shaped" network design for the encoder and decoder. The transformer encoder is directly connected to a decoder via skip connections at different resolutions to compute the final semantic segmentation output. We have validated the performance of our method on the Multi Atlas Labeling Beyond The Cranial Vault (BTCV) dataset for multi-organ segmentation and the Medical Segmentation Decathlon (MSD) dataset for brain tumor and spleen segmentation tasks. Our benchmarks demonstrate new state-of-the-art performance on the BTCV leaderboard.

...read moreread less

219 citations

Journal Article•DOI•

Deep Learning on Graphs: A Survey

[...]

01 Jan 2022-IEEE Transactions on Knowledge and Data Engineering

TL;DR: Deep learning has been shown to be successful in a number of domains, ranging from acoustics, images, to natural language processing as mentioned in this paper . However, applying deep learning to the ubiquitous graph data is non-trivial because of the unique characteristics of graphs.

...read moreread less

Abstract: Deep learning has been shown to be successful in a number of domains, ranging from acoustics, images, to natural language processing. However, applying deep learning to the ubiquitous graph data is non-trivial because of the unique characteristics of graphs. Recently, substantial research efforts have been devoted to applying deep learning methods to graphs, resulting in beneficial advances in graph analysis techniques. In this survey, we comprehensively review the different types of deep learning methods on graphs. We divide the existing methods into five categories based on their model architectures and training strategies: graph recurrent neural networks, graph convolutional networks, graph autoencoders, graph reinforcement learning, and graph adversarial methods. We then provide a comprehensive overview of these methods in a systematic manner mainly by following their development history. We also analyze the differences and compositions of different methods. Finally, we briefly outline the applications in which they have been used and discuss potential future research directions.

...read moreread less

218 citations

Journal Article•DOI•

Deep Facial Expression Recognition: A Survey

[...]

01 Jul 2022-IEEE Transactions on Affective Computing

TL;DR: A comprehensive review of deep facial expression recognition (FER) including datasets and algorithms that provide insights into these intrinsic problems can be found in this article , where the authors introduce the available datasets that are widely used in the literature and provide accepted data selection and evaluation principles for these datasets.

...read moreread less

Abstract: With the transition of facial expression recognition (FER) from laboratory-controlled to challenging in-the-wild conditions and the recent success of deep learning techniques in various fields, deep neural networks have increasingly been leveraged to learn discriminative representations for automatic FER. Recent deep FER systems generally focus on two important issues: overfitting caused by a lack of sufficient training data and expression-unrelated variations, such as illumination, head pose, and identity bias. In this survey, we provide a comprehensive review of deep FER, including datasets and algorithms that provide insights into these intrinsic problems. First, we introduce the available datasets that are widely used in the literature and provide accepted data selection and evaluation principles for these datasets. We then describe the standard pipeline of a deep FER system with the related background knowledge and suggestions for applicable implementations for each stage. For the state-of-the-art in deep FER, we introduce existing novel deep neural networks and related training strategies that are designed for FER based on both static images and dynamic image sequences and discuss their advantages and limitations. Competitive performances and experimental comparisons on widely used benchmarks are also summarized. We then extend our survey to additional related issues and application scenarios. Finally, we review the remaining challenges and corresponding opportunities in this field as well as future directions for the design of robust deep FER systems.

...read moreread less

209 citations

Journal Article•DOI•

Tabular data: Deep learning is not all you need

[...]

Ravid Shwartz-Ziv¹•Institutions (1)

Intel¹

01 May 2022-Information Fusion

TL;DR: In this paper , the authors explore whether deep learning models should be a recommended option for tabular data by rigorously comparing the new deep models to XGBoost on various datasets.

...read moreread less

201 citations

Journal Article•DOI•

Robust deep learning based protein sequence design using ProteinMPNN

[...]

04 Jun 2022-Science

TL;DR: The broad utility and high accuracy of ProteinMPNN is demonstrated using X-ray crystallography, cryoEM and functional studies by rescuing previously failed designs, made using Rosetta or AlphaFold, of protein monomers, cyclic homo-oligomers, tetrahedral nanoparticles, and target binding proteins.

...read moreread less

Abstract: While deep learning has revolutionized protein structure prediction, almost all experimentally characterized de novo protein designs have been generated using physically based approaches such as Rosetta. Here we describe a deep learning based protein sequence design method, ProteinMPNN, with outstanding performance in both in silico and experimental tests. The amino acid sequence at different positions can be coupled between single or multiple chains, enabling application to a wide range of current protein design challenges. On native protein backbones, ProteinMPNN has a sequence recovery of 52.4%, compared to 32.9% for Rosetta. Incorporation of noise during training improves sequence recovery on protein structure models, and produces sequences which more robustly encode their structures as assessed using structure prediction algorithms. We demonstrate the broad utility and high accuracy of ProteinMPNN using X-ray crystallography, cryoEM and functional studies by rescuing previously failed designs, made using Rosetta or AlphaFold, of protein monomers, cyclic homo-oligomers, tetrahedral nanoparticles, and target binding proteins. One-sentence summary A deep learning based protein sequence design method is described that is widely applicable to current design challenges and shows outstanding performance in both in silico and experimental tests.

...read moreread less

193 citations

Journal Article•DOI•

Unbox the black-box for the medical explainable AI via multi-modal and multi-centre data fusion: A mini-review, two showcases and beyond

[...]

Pazilova Nasibaxon Muhammadqosimovna, Rasulov Fayzulla¹, Mirzayev G'iyosbek Isroil o'g'li²•Institutions (2)

Southern Marine Science and Engineering Guangdong Laboratory¹, Institute of Geographic Sciences and Natural Resources Research²

01 Jan 2022-Information Fusion

TL;DR: Explainable Artificial Intelligence (XAI) is an emerging research topic of machine learning aimed at unboxing how AI systems' black-box choices are made as mentioned in this paper , which is particularly true of the most popular deep neural network approaches currently in use.

...read moreread less

Journal Article•DOI•

A survey of modern deep learning based object detection models

[...]

01 Jun 2022-Digital signal processing

TL;DR: In this paper , a survey of recent developments in deep learning based object detectors is presented along with some of the prominent backbone architectures used in recognition tasks and compared the performances of these architectures on multiple metrics.

...read moreread less

Abstract: Object Detection is the task of classification and localization of objects in an image or video. It has gained prominence in recent years due to its widespread applications. This article surveys recent developments in deep learning based object detectors. Concise overview of benchmark datasets and evaluation metrics used in detection is also provided along with some of the prominent backbone architectures used in recognition tasks. It also covers contemporary lightweight classification models used on edge devices. Lastly, we compare the performances of these architectures on multiple metrics.

...read moreread less

Journal Article•DOI•

Deep physical neural networks trained with backpropagation

[...]

Logan G. Wright, Tatsuhiro Onodera, Martin M. Stein, Tianyu Wang, Darren T. Schachter, Zoey Hu, Peter L. McMahon - Show less +3 more

01 Jan 2022-Visual education

TL;DR: Physical Neural Networks as discussed by the authors automatically train the functionality of any sequence of real physical systems, directly, using backpropagation, the same technique used for modern deep neural networks, using three diverse physical systems-optical, mechanical, and electrical.

...read moreread less

Abstract: Deep neural networks have become a pervasive tool in science and engineering. However, modern deep neural networks' growing energy requirements now increasingly limit their scaling and broader use. We propose a radical alternative for implementing deep neural network models: Physical Neural Networks. We introduce a hybrid physical-digital algorithm called Physics-Aware Training to efficiently train sequences of controllable physical systems to act as deep neural networks. This method automatically trains the functionality of any sequence of real physical systems, directly, using backpropagation, the same technique used for modern deep neural networks. To illustrate their generality, we demonstrate physical neural networks with three diverse physical systems-optical, mechanical, and electrical. Physical neural networks may facilitate unconventional machine learning hardware that is orders of magnitude faster and more energy efficient than conventional electronic processors.

...read moreread less

Journal Article•DOI•

Deep learning in optical metrology: a review

[...]

Chao Zuo, Jiaming Qian, Shijie Feng, Wei Yin, Yixuan Li, Pengfei Fan, Jing Han, Kemao Qian, Qian Chen - Show less +5 more

23 Feb 2022-Light-Science & Applications

TL;DR: Deep learning-enabled optical metrology is a kind of data-driven approach, which has already provided numerous alternative solutions to many challenging problems in this field with better performances as discussed by the authors .

...read moreread less

Abstract: Abstract With the advances in scientific foundations and technological implementations, optical metrology has become versatile problem-solving backbones in manufacturing, fundamental research, and engineering applications, such as quality control, nondestructive testing, experimental mechanics, and biomedicine. In recent years, deep learning, a subfield of machine learning, is emerging as a powerful tool to address problems by learning from data, largely driven by the availability of massive datasets, enhanced computational power, fast data storage, and novel training algorithms for the deep neural network. It is currently promoting increased interests and gaining extensive attention for its utilization in the field of optical metrology. Unlike the traditional “physics-based” approach, deep-learning-enabled optical metrology is a kind of “data-driven” approach, which has already provided numerous alternative solutions to many challenging problems in this field with better performances. In this review, we present an overview of the current status and the latest progress of deep-learning technologies in the field of optical metrology. We first briefly introduce both traditional image-processing algorithms in optical metrology and the basic concepts of deep learning, followed by a comprehensive review of its applications in various optical metrology tasks, such as fringe denoising, phase retrieval, phase unwrapping, subset correlation, and error compensation. The open challenges faced by the current deep-learning approach in optical metrology are then discussed. Finally, the directions for future research are outlined.

...read moreread less

Journal Article•DOI•

A perspective survey on deep transfer learning for fault diagnosis in industrial scenarios: Theories, applications and challenges

[...]

Ieva Barauskienė¹•Institutions (1)

University of Wollongong¹

01 Mar 2022-Mechanical Systems and Signal Processing

TL;DR: Deep Transfer Learning (DTL) is a new paradigm of machine learning, which can not only leverage the advantages of Deep Learning (DL) in feature representation, but also benefit from the superiority of transfer learning (TL) in knowledge transfer as mentioned in this paper .

...read moreread less

Journal Article•DOI•

A perspective survey on deep transfer learning for fault diagnosis in industrial scenarios: Theories, applications and challenges

[...]

Weihua Li¹, Ieva Barauskienė², Ruyi Huang¹, Jipu Li¹, Yixiao Liao¹, Zhuyun Chen¹, Guolin He¹, Ruqiang Yan³, Konstantinos Gryllias⁴ - Show less +5 more•Institutions (4)

South China University of Technology¹, University of Wollongong², Xi'an Jiaotong University³, Katholieke Universiteit Leuven⁴

15 Mar 2022-Mechanical Systems and Signal Processing

...read moreread less

Journal Article•DOI•

Ensemble deep learning: A review

[...]

01 Oct 2022-Engineering Applications of Artificial Intelligence

TL;DR: Deep Ensemble Learning (DEL) as discussed by the authors combines several individual models to obtain better generalization performance by combining the advantages of both the deep learning models as well as the ensemble learning.

...read moreread less

Journal Article•DOI•

Deep Transfer Learning Based Classification Model for COVID-19 Disease

[...]

01 Apr 2022-Irbm

TL;DR: In this paper , a deep transfer learning technique is used to classify COVID-19 infected patients from chest CT images and a top-2 smooth loss function with cost-sensitive attributes is also utilized to handle noisy and imbalanced COVID19 dataset kind of problems.

...read moreread less

Abstract: The COVID-19 infection is increasing at a rapid rate, with the availability of limited number of testing kits. Therefore, the development of COVID-19 testing kits is still an open area of research. Recently, many studies have shown that chest Computed Tomography (CT) images can be used for COVID-19 testing, as chest CT images show a bilateral change in COVID-19 infected patients. However, the classification of COVID-19 patients from chest CT images is not an easy task as predicting the bilateral change is defined as an ill-posed problem. Therefore, in this paper, a deep transfer learning technique is used to classify COVID-19 infected patients. Additionally, a top-2 smooth loss function with cost-sensitive attributes is also utilized to handle noisy and imbalanced COVID-19 dataset kind of problems. Experimental results reveal that the proposed deep transfer learning-based COVID-19 classification model provides efficient results as compared to the other supervised learning models.

...read moreread less

Journal Article•DOI•

An introduction to Deep Learning in Natural Language Processing: Models, techniques, and tools

[...]

Ivano Lauriola¹, Ivano Lauriola², Alberto Lavelli³, Fabio Aiolli¹•Institutions (3)

University of Padua¹, Amazon.com², fondazione bruno kessler³

01 Jan 2022-Neurocomputing

TL;DR: A survey of the application of deep learning techniques in NLP, with a focus on the various tasks where deep learning is demonstrating stronger impact.

...read moreread less

Journal Article•DOI•

Salient Object Detection in the Deep Learning Era: An In-Depth Survey

[...]

01 Jun 2022-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: Wang et al. as mentioned in this paper provided a comprehensive survey covering various aspects, ranging from algorithm taxonomy to unsolved issues, including network architecture, level of supervision, learning paradigm, and object-/instance-level detection.

...read moreread less

Abstract: As an essential problem in computer vision, salient object detection (SOD) has attracted an increasing amount of research attention over the years. Recent advances in SOD are predominantly led by deep learning-based solutions (named deep SOD). To enable in-depth understanding of deep SOD, in this paper, we provide a comprehensive survey covering various aspects, ranging from algorithm taxonomy to unsolved issues. In particular, we first review deep SOD algorithms from different perspectives, including network architecture, level of supervision, learning paradigm, and object-/instance-level detection. Following that, we summarize and analyze existing SOD datasets and evaluation metrics. Then, we benchmark a large group of representative SOD models, and provide detailed analyses of the comparison results. Moreover, we study the performance of SOD algorithms under different attribute settings, which has not been thoroughly explored previously, by constructing a novel SOD dataset with rich attribute annotations covering various salient object types, challenging factors, and scene categories. We further analyze, for the first time in the field, the robustness of SOD models to random input perturbations and adversarial attacks. We also look into the generalization and difficulty of existing SOD datasets. Finally, we discuss several open issues of SOD and outline future research directions. All the saliency prediction maps, our constructed dataset with annotations, and codes for evaluation are publicly available at https://github.com/wenguanwang/SODsurvey.

...read moreread less

Journal Article•DOI•

Unsupervised Hyperspectral Microscopic Image Segmentation Using Deep Embedded Clustering Algorithm

[...]

Parande Ajay, B. Nagaraj, R. A. Kumar, Ruihang Huang, P. Ananthi - Show less +1 more

06 Jun 2022-Scanning

TL;DR: A novel research on hyperspectral microscopic picture using deep learning and effective unsupervised learning is explored and the Kullback–Leibler divergence is used to test the objective function convergence.

...read moreread less

Abstract: Hyperspectral microscopy in biology and minerals, unsupervised deep learning neural network denoising SRS photos: hyperspectral resolution enhancement and denoising one hyperspectral picture is enough to teach unsupervised method. An intuitive chemical species map for a lithium ore sample is produced using k-means clustering. Many researchers are now interested in biosignals. Uncertainty limits the algorithms' capacity to evaluate these signals for further information. Even while AI systems can answer puzzles, they remain limited. Deep learning is used when machine learning is inefficient. Supervised learning needs a lot of data. Deep learning is vital in modern AI. Supervised learning requires a large labeled dataset. The selection of parameters prevents over- or underfitting. Unsupervised learning is used to overcome the challenges outlined above (performed by the clustering algorithm). To accomplish this, two processing processes were used: (1) utilizing nonlinear deep learning networks to turn data into a latent feature space (Z). The Kullback–Leibler divergence is used to test the objective function convergence. This article explores a novel research on hyperspectral microscopic picture using deep learning and effective unsupervised learning.

...read moreread less

Journal Article•DOI•

A programmable diffractive deep neural network based on a digital-coding metasurface array

[...]

Qian Ma, Zhang Jie Luo, Qiao Ru Hong, Qiang Xiao, Hao Chi Zhang, Long Miao, Wen Ming Yu, Qiang Cheng, Lin Li, Tie Jun Cui - Show less +6 more

01 Feb 2022-Nature electronics

TL;DR: In this paper , a programmable diffractive deep neural network based on a multi-layer digital-coding metasurface array is presented, which can handle various deep learning tasks for wave sensing, including image classification, mobile communication coding-decoding and real-time multi-beam focusing.

...read moreread less

Abstract: The development of artificial intelligence is typically focused on computer algorithms and integrated circuits. Recently, all-optical diffractive deep neural networks have been created that are based on passive structures and can perform complicated functions designed by computer-based neural networks. However, once a passive diffractive deep neural network architecture is fabricated, its function is fixed. Here we report a programmable diffractive deep neural network that is based on a multi-layer digital-coding metasurface array. Each meta-atom on the metasurfaces is integrated with two amplifier chips and acts an active artificial neuron, providing a dynamic modulation range of 35 dB (from −22 dB to 13 dB). We show that the system, which we term a programmable artificial intelligence machine, can handle various deep learning tasks for wave sensing, including image classification, mobile communication coding–decoding and real-time multi-beam focusing. We also develop a reinforcement learning algorithm for on-site learning and a discrete optimization algorithm for digital coding. Using a multi-layer metasurface array in which each meta-atom of the metasurface acts as an active artificial neuron, a programmable diffractive deep neural network can be created that directly processes electromagnetic waves in free space for wave sensing and wireless communications.

...read moreread less

Journal Article•DOI•

Deep Learning-Based Vehicle Behavior Prediction for Autonomous Driving Applications: A Review

[...]

01 Jan 2022-IEEE Transactions on Intelligent Transportation Systems

TL;DR: A comprehensive review of the state-of-the-art deep learning-based approaches for vehicle behaviour prediction can be found in this article , where the authors give an overview of the generic problem of vehicle behavior prediction and discuss its challenges.

...read moreread less

Abstract: Behaviour prediction function of an autonomous vehicle predicts the future states of the nearby vehicles based on the current and past observations of the surrounding environment. This helps enhance their awareness of the imminent hazards. However, conventional behaviour prediction solutions are applicable in simple driving scenarios that require short prediction horizons. Most recently, deep learning-based approaches have become popular due to their superior performance in more complex environments compared to the conventional approaches. Motivated by this increased popularity, we provide a comprehensive review of the state-of-the-art of deep learning-based approaches for vehicle behaviour prediction in this paper. We firstly give an overview of the generic problem of vehicle behaviour prediction and discuss its challenges, followed by classification and review of the most recent deep learning-based solutions based on three criteria: input representation, output type, and prediction method. The paper also discusses the performance of several well-known solutions, identifies the research gaps in the literature and outlines potential new research directions.

...read moreread less

Journal Article•DOI•

Learning From Noisy Labels With Deep Neural Networks: A Survey

[...]

01 Jan 2022-IEEE transactions on neural networks and learning systems

TL;DR: In this paper , a comprehensive review of state-of-the-art robust training methods is presented, all of which are categorized into five groups according to their methodological difference, followed by a systematic comparison of six properties used to evaluate their superiority.

...read moreread less

Abstract: Deep learning has achieved remarkable success in numerous domains with help from large amounts of big data. However, the quality of data labels is a concern because of the lack of high-quality labels in many real-world scenarios. As noisy labels severely degrade the generalization performance of deep neural networks, learning from noisy labels (robust training) is becoming an important task in modern deep learning applications. In this survey, we first describe the problem of learning with label noise from a supervised learning perspective. Next, we provide a comprehensive review of 62 state-of-the-art robust training methods, all of which are categorized into five groups according to their methodological difference, followed by a systematic comparison of six properties used to evaluate their superiority. Subsequently, we perform an in-depth analysis of noise rate estimation and summarize the typically used evaluation methodology, including public noisy datasets and evaluation metrics. Finally, we present several promising research directions that can serve as a guideline for future studies.

...read moreread less

Journal Article•DOI•

Generative Adversarial Networks in Medical Image augmentation: A review

[...]

Yizhou Chen, Xu-Hua Yang, Zihan Wei, Ali Heidari, Nenggan Zheng, Zhicheng Li, Huiling Chen, Haigen Hu, Qianwei Zhou, Qiu Guan - Show less +6 more

05 Mar 2022-Computers in Biology and Medicine

TL;DR: A comprehensive and systematic review and analysis of medical image augmentation work are carried out, and its research status and development prospects are reviewed in this paper , which mainly collected by ELSEVIER, IEEE Xplore, and Springer from 2018 to 2021.

...read moreread less

Journal Article•DOI•

BiERU: Bidirectional Emotional Recurrent Unit for Conversational Sentiment Analysis

[...]

Wei Li¹, Wei Shao², Shaoxiong Ji³, Erik Cambria¹•Institutions (3)

Nanyang Technological University¹, City University of Hong Kong², Aalto University³

07 Jan 2022-Neurocomputing

TL;DR: This paper proposed a bidirectional emotional recurrent unit for conversational sentiment analysis, where a generalized neural tensor block followed by a two-channel classifier is designed to perform context compositionality and sentiment classification, respectively.

...read moreread less

Journal Article•DOI•

FAIR1M: A benchmark dataset for fine-grained object recognition in high-resolution remote sensing imagery

[...]

Kristin Johnston Largen¹, Arnon, Dan²•Institutions (2)

Northeast Forestry University¹, Aerospace Information Research Institute²

01 Feb 2022

TL;DR: The FAIR1M dataset as discussed by the authors is a large-scale dataset with more than 1 million instances and more than 40,000 images for fine-grained object detection in high-resolution remote sensing imagery.

...read moreread less

Abstract: With the rapid development of deep learning, many deep learning-based approaches have made great achievements in object detection tasks. It is generally known that deep learning is a data-driven approach. Data directly impact the performance of object detectors to some extent. Although existing datasets include common objects in remote sensing images, they still have some scale, category, and image limitations. Therefore, there is a strong requirement for establishing a large-scale object detection benchmark for high-resolution remote sensing images. In this paper, we propose a novel benchmark dataset with more than 1 million instances and more than 40,000 images for Fine-grAined object recognItion in high-Resolution remote sensing imagery which is named as FAIR1M. We collected remote sensing images with a resolution of 0.3 m to 0.8 m from different platforms, which are spread across many countries and regions. All objects in the FAIR1M dataset are annotated with respect to 5 categories and 37 subcategories by oriented bounding boxes. Compared with existing detection datasets that are dedicated to object detection, the FAIR1M dataset has 4 particular characteristics: (1) it is much larger than other existing object detection datasets both in terms of the number of instances and the number of images, (2) it provides richer fine-grained category information for objects in remote sensing images, (3) it contains geographic information such as latitude, longitude and resolution attributes, and (4) it provides better image quality due to the use of a careful data cleaning procedure. Based on the FAIR1M dataset, we propose three fine-grained object detection and recognition tasks. Moreover, we evaluate several state-of-the-art approaches to establish baselines for future research. Experimental results indicate that the FAIR1M dataset effectively represents real remote sensing applications and is quite challenging for existing methods. Considering the fine-grained characteristics, we improve the evaluation metric and introduce the idea of hierarchy detection into the algorithms. We believe that the FAIR1M dataset will contribute to the earth observation community via fine-grained object detection in large-scale real-world scenes. FAIR1M Website: http://gaofen-challenge.com/.

...read moreread less

Journal Article•DOI•

Deep learning in optical metrology: a review

[...]

Chao Zuo, Jiaming Qian, Shijie Feng, Wei Yin, Yixuan Li, Pengfei Fan, Jing Han, Kemao Qian, Qian Chen - Show less +5 more

23 Feb 2022-Light-Science & Applications

...read moreread less

Journal Article•DOI•

Explainable artificial intelligence (XAI) in deep learning-based medical image analysis

[...]

01 Jul 2022-Medical Image Analysis

TL;DR: An overview of explainable artificial intelligence (XAI) used in deep learning-based medical image analysis can be found in this article , where a framework of XAI criteria is introduced to classify deep learning based medical image classification methods.

...read moreread less

Journal Article•DOI•

An explainable artificial intelligence approach for unsupervised fault detection and diagnosis in rotating machinery

[...]

Lucas Costa Brito, Gian Antonio Susto¹, Jorge Nei Brito², Marcus Antonio Viana Duarte•Institutions (2)

University of Padua¹, Universidade Federal de São João del-Rei²

15 Jan 2022-Mechanical Systems and Signal Processing

TL;DR: A new approach for fault detection and diagnosis in rotating machinery is proposed, namely: unsupervised classification and root cause analysis, and a comparison between models used in machine learning explainability: SHAP and Local Depth-based Feature Importance for the Isolation Forest (Local-DIFFI).

...read moreread less

Collapse