About: Pune Institute of Computer Technology is a based out in . It is known for research contribution in the topics: Deep learning & Cloud computing. The organization has 525 authors who have published 386 publications receiving 2430 citations.
Papers published on a yearly basis
••01 Nov 2017
TL;DR: This paper explores the use of AutoRegressive Integrated Moving Average (ARIMA) forecasting on the time series data collected from various sensors from a Slitting Machine, to predict the possible failures and quality defects, thus improving the overall manufacturing process.
Abstract: The industrial Internet of Things (IIoT) is the use of Internet of Things (IoT) technologies in manufacturing which harnesses the machine data generated by various sensors and applies various analytics on it to gain useful information. The data captured by the machines is usually accompanied by a date time component which proves vital for predictive modelling. This paper explores the use of AutoRegressive Integrated Moving Average (ARIMA) forecasting on the time series data collected from various sensors from a Slitting Machine, to predict the possible failures and quality defects, thus improving the overall manufacturing process. The use of Machine Learning thus proves a vital component in IIoT having use cases in quality management and quality control, lowering the cost of maintenance and improving the overall manufacturing process.
••27 Jun 2011
TL;DR: Decision Tree algorithms can be used as a replacement for statistical procedures to find data, to extract text, to find missing data in a class, to improve search engines and it also finds various applications in medical fields.
Abstract: A decision tree is a tree whose internal nodes can be taken as tests (on input data patterns) and whose leaf nodes can be taken as categories (of these patterns). These tests are filtered down through the tree to get the right output to the input pattern. Decision Tree algorithms can be applied and used in various different fields. It can be used as a replacement for statistical procedures to find data, to extract text, to find missing data in a class, to improve search engines and it also finds various applications in medical fields. Many Decision tree algorithms have been formulated. They have different accuracy and cost effectiveness. It is also very important for us to know which algorithm is best to use. The ID3 is one of the oldest Decision tree algorithms. It is very useful while making simple decision trees but as the complications increases its accuracy to make good Decision trees decreases. Hence IDA (intelligent decision tree algorithm) and C4.5 algorithms have been formulated.
••01 Nov 2011
TL;DR: In this paper, the state of the art from 1970s of machine printed and handwritten Devanagari optical character recognition (OCR) is discussed in various sections of the paper.
Abstract: In India, more than 300 million people use Devanagari script for documentation. There has been a significant improvement in the research related to the recognition of printed as well as handwritten Devanagari text in the past few years. State of the art from 1970s of machine printed and handwritten Devanagari optical character recognition (OCR) is discussed in this paper. All feature-extraction techniques as well as training, classification and matching techniques useful for the recognition are discussed in various sections of the paper. An attempt is made to address the most important results reported so far and it is also tried to highlight the beneficial directions of the research till date. Moreover, the paper also contains a comprehensive bibliography of many selected papers appeared in reputed journals and conference proceedings as an aid for the researchers working in the field of Devanagari OCR.
TL;DR: Various feature extraction and classification techniques associated with the offline handwriting recognition of the regional scripts are discussed in this survey, which will serve as a compendium not only for researchers in India, but also for policymakers and practitioners in India.
Abstract: Offline handwriting recognition in Indian regional scripts is an interesting area of research as almost 460 million people in India use regional scripts. The nine major Indian regional scripts are Bangla (for Bengali and Assamese languages), Gujarati, Kannada, Malayalam, Oriya, Gurumukhi (for Punjabi language), Tamil, Telugu, and Nastaliq (for Urdu language). A state-of-the-art survey about the techniques available in the area of offline handwriting recognition (OHR) in Indian regional scripts will be of a great aid to the researchers in the subcontinent and hence a sincere attempt is made in this article to discuss the advancements reported in this regard during the last few decades. The survey is organized into different sections. A brief introduction is given initially about automatic recognition of handwriting and official regional scripts in India. The nine regional scripts are then categorized into four subgroups based on their similarity and evolution information. The first group contains Bangla, Oriya, Gujarati and Gurumukhi scripts. The second group contains Kannada and Telugu scripts and the third group contains Tamil and Malayalam scripts. The fourth group contains only Nastaliq script (Perso-Arabic script for Urdu), which is not an Indo-Aryan script. Various feature extraction and classification techniques associated with the offline handwriting recognition of the regional scripts are discussed in this survey. As it is important to identify the script before the recognition step, a section is dedicated to handwritten script identification techniques. A benchmarking database is very important for any pattern recognition related research. The details of the datasets available in different Indian regional scripts are also mentioned in the article. A separate section is dedicated to the observations made, future scope, and existing difficulties related to handwriting recognition in Indian regional scripts. We hope that this survey will serve as a compendium not only for researchers in India, but also for policymakers and practitioners in India. It will also help to accomplish a target of bringing the researchers working on different Indian scripts together. Looking at the recent developments in OHR of Indian regional scripts, this article will provide a better platform for future research activities.
01 Dec 2017
TL;DR: The authors applied term frequency-inverse document frequency (TF-IDF) of bi-grams and probabilistic context free grammar (PCFG) detection to a corpus of about 11,000 articles.
Abstract: This paper explores the application of natural language processing techniques for the detection of ‘fake news’, that is, misleading news stories that come from non-reputable sources. Using a dataset obtained from Signal Media and a list of sources from OpenSources.co, we apply term frequency-inverse document frequency (TF-IDF) of bi-grams and probabilistic context free grammar (PCFG) detection to a corpus of about 11,000 articles. We test our dataset on multiple classification algorithms — Support Vector Machines, Stochastic Gradient Descent, Gradient Boosting, Bounded Decision Trees, and Random Forests. We find that TF-IDF of bi-grams fed into a Stochastic Gradient Descent model identifies non-credible sources with an accuracy of 77.2%, with PCFGs having slight effects on recall.
Showing all 525 results
|Mahinder Pal Singh Bhatia
|Sheetal S. Sonawane
|A. N. Gaikwad
|Balwant A. Sonkamble
|Sheetal U. Bhandari
Related Institutions (5)
Amrita Vishwa Vidyapeetham
11K papers, 76.1K citations
Maulana Azad National Institute of Technology
3.3K papers, 52.6K citations
National Institute of Technology, Rourkela
10.7K papers, 150.1K citations
12.7K papers, 86K citations
Motilal Nehru National Institute of Technology Allahabad
5K papers, 61.8K citations