scispace - formally typeset
Open AccessJournal ArticleDOI

From Digitized Images to Online Catalogs Data Mining a Sky Survey

Usama M. Fayyad, +2 more
- 15 Mar 1996 - 
- Vol. 17, Iss: 2, pp 51-66
Reads0
Chats0
TLDR
This work focuses on the automation of the cataloging effort of a major sky survey and the availability of digital libraries in general, and the SKICAT system automates the reduction and analysis of the three terabytes worth of images.
Abstract
The value of scientific digital-image libraries seldom lies in the pixels of images For large collections of images, such as those resulting from astronomy sky surveys, the typical useful product is an online database cataloging entries of interest We focus on the automation of the cataloging effort of a major sky survey and the availability of digital libraries in general The SKICAT system automates the reduction and analysis of the three terabytes worth of images, expected to contain on the order of 2 billion sky objects For the primary scientific analysis of these data, it is necessary to detect, measure, and classify every sky object SKICAT integrates techniques for image processing, classification learning, database management, and visualization The learning algorithms are trained to classify the detected objects and can classify objects too faint for visual classification with an accuracy level exceeding 90 percent This accuracy level increases the number of classified objects in the final catalog threefold relative to the best results from digitized photographic sky surveys to date Hence, learning algorithms played a powerful and enabling role and solved a difficult, scientifically significant problem, enabling the consistent, accurate classification and the ease of access and analysis of an otherwise unfathomable data set

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

From Data Mining to Knowledge Discovery in Databases

TL;DR: An overview of this emerging field is provided, clarifying how data mining and knowledge discovery in databases are related both to each other and to related fields, such as machine learning, statistics, and databases.
Patent

Data mining for managing marketing resources

TL;DR: In this paper, a method for managing a marketing campaign includes the following: first, a data mining engine is trained with a set of training data comprising the user data base, and a predetermined characteristic pertaining to the market campaign is input to the engine such that, in response to the input, a subset of the users in the database is obtained that have the highest correlation to the characteristic.
Journal ArticleDOI

Knowledge Discovery Via Multiple Models

TL;DR: CMM, a meta-learner that seeks to retain most of the accuracy gains of multiple model approaches, while still producing a single comprehensible model, is proposed and evaluated.
Book

Machine learning in games: a survey

TL;DR: This paper provides a survey of previously published work on machine learning in game playing around a variety of problems that typically arise in gamePlaying and that can be solved with machine learning methods.
Patent

Method for apparatus for efficient mining of classification models from databases

TL;DR: In this paper, the authors proposed a method for the construction of a classification model from a large database by minimizing the number of database scans and making as much use of the computer's fast main memory (RAM) as possible.
References
More filters
Book

C4.5: Programs for Machine Learning

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.
Book

Classification and regression trees

Leo Breiman
TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.
Journal ArticleDOI

Eigenfaces for recognition

TL;DR: A near-real-time computer system that can locate and track a subject's head, and then recognize the person by comparing characteristics of the face to those of known individuals, and that is easy to implement using a neural network architecture.