scispace - formally typeset
Open AccessJournal ArticleDOI

Deep learning applications and challenges in big data analytics

Reads0
Chats0
TLDR
This study explores how Deep Learning can be utilized for addressing some important problems in Big Data Analytics, including extracting complex patterns from massive volumes of data, semantic indexing, data tagging, fast information retrieval, and simplifying discriminative tasks.
Abstract
Big Data Analytics and Deep Learning are two high-focus of data science. Big Data has become important as many organizations both public and private have been collecting massive amounts of domain-specific information, which can contain useful information about problems such as national intelligence, cyber security, fraud detection, marketing, and medical informatics. Companies such as Google and Microsoft are analyzing large volumes of data for business analysis and decisions, impacting existing and future technology. Deep Learning algorithms extract high-level, complex abstractions as data representations through a hierarchical learning process. Complex abstractions are learnt at a given level based on relatively simpler abstractions formulated in the preceding level in the hierarchy. A key benefit of Deep Learning is the analysis and learning of massive amounts of unsupervised data, making it a valuable tool for Big Data Analytics where raw data is largely unlabeled and un-categorized. In the present study, we explore how Deep Learning can be utilized for addressing some important problems in Big Data Analytics, including extracting complex patterns from massive volumes of data, semantic indexing, data tagging, fast information retrieval, and simplifying discriminative tasks. We also investigate some aspects of Deep Learning research that need further exploration to incorporate specific challenges introduced by Big Data Analytics, including streaming data, high-dimensional data, scalability of models, and distributed computing. We conclude by presenting insights into relevant future works by posing some questions, including defining data sampling criteria, domain adaptation modeling, defining criteria for obtaining useful data abstractions, improving semantic indexing, semi-supervised learning, and active learning.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Understanding the Role of Receptive Field of Convolutional Neural Network for Cloud Detection in Landsat 8 OLI Imagery

TL;DR: The authors explored the relationship between the receptive field size and performance of a cloud detection network and found that large receptive fields often leads to loss of spatial details and blurring of boundaries, therefore, it is crucial to understand the role of the receptive fields on the segmentation results, which has rarely been investigated for cloud detection tasks.
Journal ArticleDOI

Nature-Inspired Search Method and Custom Waste Object Detection and Classification Model for Smart Waste Bin

TL;DR: In this paper , a You Only Look Once (YOLO) based model was employed as the object detection algorithm to facilitate the classification of waste according to various categories at the point of waste collection.
Journal ArticleDOI

Sustainable response system building against insider-led cyber frauds in banking sector: a machine learning approach

TL;DR: In this article , the authors focus on the different types of insider-led cyber frauds that gained mainstream attention in recent large-scale fraud events involving prominent Indian banking institutions and propose a framework to ensure a sustainable cyber fraud mitigation ecosystem within the scope of the study.
Journal ArticleDOI

Industrial object and defect recognition utilizing multilevel feature extraction from industrial scenes with Deep Learning approach

TL;DR: In this article , a modified version of the Virtual Geometry Group (VGG) network, called Multipath VGG19, was proposed, which allows for extra local and global feature extraction (multi-level feature extraction) by making use of several processing paths.
Journal ArticleDOI

Assessing the predictive causality of individual based models using Bayesian inference intervention analysis: an application in epidemiology

TL;DR: A method for big data analytics (causal impact) that implements a Bayesian intervention approach to estimating the causal effect of a designed intervention on a time series is used to quantify the deviance between data and IBM outputs.
References
More filters
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Proceedings ArticleDOI

Histograms of oriented gradients for human detection

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.
Posted Content

Efficient Estimation of Word Representations in Vector Space

TL;DR: This paper proposed two novel model architectures for computing continuous vector representations of words from very large data sets, and the quality of these representations is measured in a word similarity task and the results are compared to the previously best performing techniques based on different types of neural networks.
Proceedings ArticleDOI

Object recognition from local scale-invariant features

TL;DR: Experimental results show that robust object recognition can be achieved in cluttered partially occluded images with a computation time of under 2 seconds.
Journal ArticleDOI

Reducing the Dimensionality of Data with Neural Networks

TL;DR: In this article, an effective way of initializing the weights that allows deep autoencoder networks to learn low-dimensional codes that work much better than principal components analysis as a tool to reduce the dimensionality of data is described.
Related Papers (5)