scispace - formally typeset
Open AccessJournal ArticleDOI

Principal Component Analysis: A Natural Approach to Data Exploration

Reads0
Chats0
TLDR
In this paper, the potential of using PCA for dimensionality reduction is illustrated on several real-world datasets, and several theoretical and practical aspects of PCA are reported in an accessible and integrated manner.
Abstract
Principal component analysis (PCA) is often applied for analyzing data in the most diverse areas. This work reports, in an accessible and integrated manner, several theoretical and practical aspects of PCA. The basic principles underlying PCA, data standardization, possible visualizations of the PCA results, and outlier detection are subsequently addressed. Next, the potential of using PCA for dimensionality reduction is illustrated on several real-world datasets. Finally, we summarize PCA-related approaches and other dimensionality reduction techniques. All in all, the objective of this work is to assist researchers from the most diverse areas in using and interpreting PCA.

read more

Citations
More filters
Journal ArticleDOI

A similarity approach to cities and features

TL;DR: In this article , the authors consider 20 European cities from 5 respective countries and with comparable populations, each of which characterized in terms of four topological as well as one geometrical feature, and then map into respective networks by considering their pairwise similarity as gauged by the coincidence methodology, which consists of combining the Jaccard and interiority indices.
Journal ArticleDOI

Coincidence complex networks

TL;DR: In this article , the authors describe the application of the real-valued Jaccard and realvalued coincidence similarity indices for translating generic datasets into networks, where two data elements are linked whenever the similarity between their respective features, gauged by some similarity index, is greater than a given threshold.
Journal ArticleDOI

Consumers’ Attitudes and Purchase Intention for a Vitamin-Enriched Extra Virgin Olive Oil

TL;DR: In this article , the authors examined Italian consumer preferences for extra virgin olive oil (EVOO) enriched with vitamins and analyzed the key drivers that affect consumer choices for this product, and assessed consumers' intention to purchase the enriched product compared to the conventional one.
Journal ArticleDOI

Computational Modeling of Stereotype Content in Text

TL;DR: This article presented a computational method to mine large, naturally occurring datasets of text for sentences that express perceptions of a social group of interest, and then map these sentences to the two-dimensional plane of perceived warmth and competence for comparison and interpretation.
Journal ArticleDOI

Spatiotemporal Evolution of the Urban Thermal Environment Effect and Its Influencing Factors: A Case Study of Beijing, China

TL;DR: In this paper , the authors analyzed the spatiotemporal evolution of the urban thermal environment effect within the sixth ring road of Beijing was analyzed by inversion of remote sensing data to obtain the LST in 2004, 2009, 2014, and 2019.
References
More filters
Journal Article

Visualizing Data using t-SNE

TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
Journal ArticleDOI

Regularization and variable selection via the elastic net

TL;DR: It is shown that the elastic net often outperforms the lasso, while enjoying a similar sparsity of representation, and an algorithm called LARS‐EN is proposed for computing elastic net regularization paths efficiently, much like algorithm LARS does for the lamba.
Journal ArticleDOI

Cluster analysis and display of genome-wide expression patterns

TL;DR: A system of cluster analysis for genome-wide expression data from DNA microarray hybridization is described that uses standard statistical algorithms to arrange genes according to similarity in pattern of gene expression, finding in the budding yeast Saccharomyces cerevisiae that clustering gene expression data groups together efficiently genes of known similar function.
Journal ArticleDOI

Eigenfaces for recognition

TL;DR: A near-real-time computer system that can locate and track a subject's head, and then recognize the person by comparing characteristics of the face to those of known individuals, and that is easy to implement using a neural network architecture.
Trending Questions (3)
What is purpose for principal component analysis?

The purpose of principal component analysis (PCA) is to assist researchers in analyzing and interpreting data in various fields.

What is pcc analysis?

PCA (Principal Component Analysis) is a technique used for analyzing data in various fields. It involves data standardization, visualization of results, outlier detection, and dimensionality reduction.

What's the principal component analysis?

Principal component analysis (PCA) is a technique used for analyzing data in various fields. It involves data standardization, visualization of results, outlier detection, and dimensionality reduction.