scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Beyond the hype

01 Apr 2015-International Journal of Information Management (Pergamon)-Vol. 35, Iss: 2, pp 137-144
TL;DR: The need to develop appropriate and efficient analytical methods to leverage massive volumes of heterogeneous data in unstructured text, audio, and video formats is highlighted and the need to devise new tools for predictive analytics for structured big data is reinforced.
About: This article is published in International Journal of Information Management.The article was published on 2015-04-01 and is currently open access. It has received 2962 citations till now. The article focuses on the topics: Analytics & Big data.
Citations
More filters
Journal ArticleDOI
TL;DR: In this article, the authors present a state-of-the-art review that presents a holistic view of the BD challenges and BDA methods theorized/proposed/employed by organizations to help others understand this landscape with the objective of making robust investment decisions.

1,267 citations


Cites background or methods from "Beyond the hype"

  • ...On the other hand, the challenges are significant such as data integration complexities (Gandomi & Haider, 2015), lack of skilled personal and sufficient resources (Kim, Trimi, & Chung, 2014), data security and privacy issues (Barnaghi, Sheth, & Henson, 2013), inadequate infrastructure and…...

    [...]

  • ...…say 3Vs [volume, velocity and variety] of data (e.g. Shah, Rabhi, & Ray, 2015), others reported 4Vs [volume, velocity, variety, and variability] of data (e.g. Liao, Yin, Huang, & Sheng, 2014) and 6Vs [volume, velocity, variety, veracity, variability, and value] of data (Gandomi & Haider, 2015)....

    [...]

  • ...Gandomi and Haider (2015) asserts the need to develop new solutions for predictive analytics for structured BD. Predictive analytics are principally based on statistical methods and seeks to uncover patterns and capture relationships in data....

    [...]

  • ...Big Data analytical methods – related to Q2 To facilitate evidence-based decision-making, organizations need efficient methods to process large volumes of assorted data into meaningful comprehensions (Gandomi & Haider, 2015)....

    [...]

  • ...Thus, the necessity to deal with inaccurate and ambiguous data is another facet of BD, which is addressed using tools and analytics developed for management and mining of unreliable data (Gandomi & Haider, 2015)....

    [...]

Journal ArticleDOI
TL;DR: A new CNN based on LeNet-5 is proposed for fault diagnosis which can extract the features of the converted 2-D images and eliminate the effect of handcrafted features and has achieved significant improvements.
Abstract: Fault diagnosis is vital in manufacturing system, since early detections on the emerging problem can save invaluable time and cost. With the development of smart manufacturing, the data-driven fault diagnosis becomes a hot topic. However, the traditional data-driven fault diagnosis methods rely on the features extracted by experts. The feature extraction process is an exhausted work and greatly impacts the final result. Deep learning (DL) provides an effective way to extract the features of raw data automatically. Convolutional neural network (CNN) is an effective DL method. In this study, a new CNN based on LeNet-5 is proposed for fault diagnosis. Through a conversion method converting signals into two-dimensional (2-D) images, the proposed method can extract the features of the converted 2-D images and eliminate the effect of handcrafted features. The proposed method which is tested on three famous datasets, including motor bearing dataset, self-priming centrifugal pump dataset, and axial piston hydraulic pump dataset, has achieved prediction accuracy of 99.79%, 99.481%, and 100%, respectively. The results have been compared with other DL and traditional methods, including adaptive deep CNN, sparse filter, deep belief network, and support vector machine. The comparisons show that the proposed CNN-based data-driven fault diagnosis method has achieved significant improvements.

1,240 citations


Cites methods from "Beyond the hype"

  • ...This provides new opportunities for the data-driven fault diagnosis methods to make full use of the massive mechanical data [5], and has received more and more attentions from the researchers and engineers....

    [...]

Journal ArticleDOI
TL;DR: This paper presents a comprehensive discussion on state-of-the-art big data technologies based on batch and stream data processing based on structuralism and functionalism paradigms and strengths and weaknesses of these technologies are analyzed.

964 citations

Journal ArticleDOI
TL;DR: The role of big data in supporting smart manufacturing is discussed, a historical perspective to data lifecycle in manufacturing is overviewed, and a conceptual framework proposed in the paper is proposed.

937 citations

Journal ArticleDOI
Qinglin Qi1, Fei Tao1
TL;DR: The similarities and differences between big data and digital twin are compared from the general and data perspectives and how they can be integrated to promote smart manufacturing are discussed.
Abstract: With the advances in new-generation information technologies, especially big data and digital twin, smart manufacturing is becoming the focus of global manufacturing transformation and upgrading. Intelligence comes from data. Integrated analysis for the manufacturing big data is beneficial to all aspects of manufacturing. Besides, the digital twin paves a way for the cyber-physical integration of manufacturing, which is an important bottleneck to achieve smart manufacturing. In this paper, the big data and digital twin in manufacturing are reviewed, including their concept as well as their applications in product design, production planning, manufacturing, and predictive maintenance. On this basis, the similarities and differences between big data and digital twin are compared from the general and data perspectives. Since the big data and digital twin can be complementary, how they can be integrated to promote smart manufacturing are discussed.

856 citations

References
More filters
Book ChapterDOI
01 Jan 2011
TL;DR: This chapter introduces the basics of data mining, reviews social media, discusses how to mine social media data, and highlights some illustrative examples with an emphasis on social networking sites and blogs.
Abstract: The rise of online social media is providing a wealth of social network data. Data mining techniques provide researchers and practitioners the tools needed to analyze large, complex, and frequently changing social media data. This chapter introduces the basics of data mining, reviews social media, discusses how to mine social media data, and highlights some illustrative examples with an emphasis on social networking sites and blogs.

165 citations

Book ChapterDOI
01 Jan 2012
TL;DR: This chapter provides a survey of the major work on named entity recognition and relation extraction in the past few decades, with a focus on work from the natural language processing community.
Abstract: Information extraction is the task of finding structured information from unstructured or semi-structured text. It is an important task in text mining and has been extensively studied in various research communities including natural language processing, information retrieval and Web mining. It has a wide range of applications in domains such as biomedical literature mining and business intelligence. Two fundamental tasks of information extraction are named entity recognition and relation extraction. The former refers to finding names of entities such as people, organizations and locations. The latter refers to finding the semantic relations such as FounderOf and HeadquarteredIn between entities. In this chapter we provide a survey of the major work on named entity recognition and relation extraction in the past few decades, with a focus on work from the natural language processing community.

158 citations


"Beyond the hype" refers background in this paper

  • ...Two sub-tasks in IE are Entity Recogniion (ER) and Relation Extraction (RE) (Jiang, 2012)....

    [...]

Book ChapterDOI
01 Jan 2011
TL;DR: Analysis of interaction networks in the social context can result in the discovery of important patterns and potentially shed light on important properties governing the growth of such networks.
Abstract: Data sets originating from many different real world domains can be represented in the form of interaction networks in a very natural, concise and meaningful fashion. This is particularly true in the social context, especially given recent advances in Internet technologies and Web 2.0 applications leading to a diverse range of evolving social networks. Analysis of such networks can result in the discovery of important patterns and potentially shed light on important properties governing the growth of such networks.

95 citations


"Beyond the hype" refers methods in this paper

  • ...In this regard, community detection is similar to clustering (Aggarwal, 2011), a data mining technique used to partition a data set into disjoint subsets based on the similarity of data points....

    [...]

Journal ArticleDOI
TL;DR: This research contributes to developing a new system for profiling company BI factors from news articles, to providing new empirical findings to enhance understanding in BI factor extraction and categorization, and to addressing an important yet under-explored concern of BI analysis.

57 citations

BookDOI
21 Sep 2010
TL;DR: This edited volume deals with different CI techniques for solving real world Power Industry problems and provides the power utilities with innovative solutions for efficient analysis, optimal operation and control and intelligent decision making.
Abstract: Computational Intelligence (CI) is one of the most important powerful tools for research in the diverse fields of engineering sciences ranging from traditional fields of civil, mechanical engineering to vast sections of electrical, electronics and computer engineering and above all the biological and pharmaceutical sciences. The existing field has its origin in the functioning of the human brain in processing information, recognizing pattern, learning from observations and experiments, storing and retrieving information from memory, etc. In particular, the power industry being on the verge of epoch changing due to deregulation, the power engineers require Computational intelligence tools for proper planning, operation and control of the power system. Most of the CI tools are suitably formulated as some sort of optimization or decision making problems. These CI techniques provide the power utilities with innovative solutions for efficient analysis, optimal operation and control and intelligent decision making. This edited volume deals with different CI techniques for solving real world Power Industry problems. The technical contents will be extremely helpful for the researchers as well as the practicing engineers in the power industry.

54 citations