What are the future works in "A two level approach for scene recognition∗" ?

In future work, the authors intend to investigate unsupervised clustering methods for low-level image patch classification. In particular, the authors plan to apply their unsupervised, iterative LDA-GMM algorithm [ 18 ]. The authors also plan to investigate a hybrid approach where classified images are used as labeled data to compute an initial LDA projection, which is then subsequently refined with new, unlabeled images using iterative LDA-GMM. Finally, because LDA is only optimal when each class has a Gaussian density with a common covariance matrix, the non-parametric discriminant analysis ( proposed in [ 34 ] ) will be tested as a means to generalize their approach to a more comprehensive image database which may contain thousands of various kinds of photos.

What contributions have the authors mentioned in the paper "A two level approach for scene recognition∗" ?

In this paper, the authors present a stratified approach to both binary ( outdoor-indoor ) and multiple category of scene classification. The authors then extract some very simple features from those PDRMs, and use them to train a bagged LDA classifier for 10 scene categories. To test this classification system, the authors created a labeled database of 1500 photos taken under very different environment and lighting conditions, using different cameras, and from 43 persons over 5 years.

How do the authors get a set of LDA scene classifiers over these feature vectors?

By employing the random subspace method [12, 28] and bootstrapping [31], the authors obtain a set of LDA scene classifiers over these feature vectors.

How do the authors evaluate the membership density of the image patches for each class?

Once the authors obtain 20 Gaussian mixture models {πik, P (z; θik), i = 1, 2, ..., 20} for 20 material classes, the authors can evaluate the membership density values of image patches for each material class.

How do the authors prepare the training data?

To prepare the training data, the authors manually crop image regions for each material in their database, and randomly draw dozens of 25 by 25 pixel patches from each rectangle.

What is the classification method for indoor-outdoor scenes?

An misclassified outdoor photo.moment features of PRDMs are useful in outdoor scenes, but reduce the recognition rate for indoor scenes.

What is the common method used to find the marginal distributions of the basic filter banks?

For texture modeling, Zhu et al [35] pursue features to find the marginal distributions which are also the linear combinations of the basic filter banks, but they use a much more complex method (Monte Carlo Markov Chain) to stochastically search the space of linear coefficients.

What is the combination of LDA and Gaussian mixture models?

the authors describe a combination of LDA and Gaussian mixture models that achieves a good balance of discrimination and smoothness.

(Open Access) A two level approach for scene recognition (2005) | Le Lu

Q: How do the authors create multiple LDA classifiers?

To improve the classification rate, the authors have implemented variations on random subspace generation [12, 28] and bootstrapping [31] to create multiple LDA classifiers.

Q: What is the simplest way to compute the density of a material class?

For any given photo, the authors scan local image patches, extract their color-texture feature vector, normalize each of its components from 0 to 1 [1], project it to the lower dimensional subspace Z computed by LDA, and finally compute the density value given by equation (1) for all 20 material classes.

Q: How many mixtures are used for each class?

The number of mixtures gc and the model parameters {πck, θck} for each material class c are initialized by spectral clustering [21] and learned in an iterative Expectation-Maximization manner [31, 7] where gc ranged from 4 to 8 depending on the material class.

Q: What is the way to evaluate the texture discrimination performance of Haralick?

The authors evaluate their texture discrimination performances experimen-2The reference and neighbor pixel intensities normally need to be quantized into 16 or less levels instead of 256 which results in not too sparse GLCM.tally in section 4 and find Haralick features generally perform better.

A Two Level Approach for Scene Recognition

∗

Le Lu Kentaro Toyama Gregory D. Hager

Computer Science Department Microsoft Research Computer Science Department

Johns Hopkins University One Microsoft Way Johns Hopkins University

Baltimore, MD 21218 Redmond, WA 98052 Baltimore, MD 21218

Abstract

Classifying pictures into one of several semantic cat-

egories is a classical image understanding problem. In

this paper, we present a stratiﬁed approach to both binary

(outdoor-indoor) and multiple category of scene classiﬁca-

tion. We ﬁrst learn mixture models for 20 basic classes of

local image content based on color and texture information.

Once trained, these models are applied to a test image, and

produce 20 probability density response maps (PDRM) in-

dicating the likelihood that each image region was produced

by each class. We then extract some very simple features

from those PDRMs, and use them to train a bagged LDA

classiﬁer for 10 scene categories. For this process, no ex-

plicit region segmentation or spatial context model are com-

puted.

To test this classiﬁcation system, we created a labeled

database of 1500 photos taken under very different envi-

ronment and lighting conditions, using different cameras,

and from 43 persons over 5 years. The classiﬁcation rate

of outdoor-indoor classiﬁcation is 93.8%, and the classiﬁ-

cation rate for 10 scene categories is 90.1%. As a byprod-

uct, local image patches can be contextually labeled into

the 20 basic material classes by using Loopy Belief Propa-

gation [33] as an anisotropic ﬁlter on PDRMs, producing

an image-level segmentation if desired.

1 Introduction

Classifying pictures into semantic types of scenes [24,

26, 22] is a classical image understanding problem which

requires the effective interaction of high level semantic in-

formation and low level image observations. Our goal is

to build a very practical prototype for scene classiﬁcation

of typical consumer photos, along the lines of the Kodak

system [22]. Thus, we are interested in systems that are ac-

curate, efﬁcient, and which can work with a wide range of

photos and photographic quality.

Given the extremely large within-category variations in

typical photographs, it is usually simpler and thus easier to

break the problem of scene classiﬁcation into a two-step

∗

The work was partially performed when the ﬁrst author was a summer

intern in Microsoft Research.

process. In this paper, we ﬁrst train local, image patch

based color-texture Gaussian Mixture models (GMM) to

detect each of 20 materials in a local image patch. These

models are used to scan an image and generate 20 local re-

sponses for each pixel. Each response map, called a Prob-

ability Density Response Map (PDRM), can be taken as a

real-valued image indicating the relative likelihood of each

material at each image location. We then compute moments

from the response maps and form a feature vector for each

photo. By employing the random subspace method [12, 28]

and bootstrapping [31], we obtain a set of LDA scene clas-

siﬁers over these feature vectors. These classiﬁcation re-

sults are combined into the ﬁnal decision through bagging

[2]. After learning the local and global models, a typical

1200 × 800 image can be classiﬁed in less than 1 second

with our unoptimized Matlab implementation. Therefore

there is a potential to develop a real-time scene classiﬁer

upon our approach. A complete diagram of our approach is

showninFigure1.

There are several related efforts in this area. Luo et al.

[19, 22] propose a bottom-up approach to ﬁrst ﬁnd and label

well-segmented image regions, such as water, beach, sky,

and then to learn the spatial contextual model among re-

gions. A Bayesian network codes these relational depen-

dencies. By comparison, we do not perform an explicit

spatial segmentation, and we use relatively simple (LDA-

based) classiﬁcation methods. Perona et al. [8, 30] present

a constellation model of clustered feature components for

object recognition. Their method works well for detecting

single objects, but strongly depends on the performance and

reliability of the interest detector [13]. In the case of scene

classiﬁcation, we need to model more than one class of ma-

terial, where classes are non-structural and do not have sig-

niﬁcant features (such as foliage, rock and et al.) [13]. This

motivates our use of a GMM on the feature space. In order

to maintain good stability, we estimate the GMM in a lin-

ear subspace computed by LDA. These density models are

quite ﬂexible and can be used to model a wide variety of

image patterns with a good compromise between discrimi-

nation and smoothness.

Kumar et al. [14, 15] propose the use of Markov random

ﬁeld (MRF)-based spatial contextual models to detect man-

made buildings in a natural landscape. They build a multi-

scale color and textual descriptor to capture the local depen-

dence among building and non-building image blocks and

use MRF to model the prior of block labels. In our work,

we have found that simple local labeling sufﬁces to gener-

ate good classiﬁcation results; indeed regularization using

loopy belief propagation method [33] yields no signiﬁcant

improvement in performance. Thus, we claim that there is

no need to segment image regions explicitly for scene clas-

siﬁcation as other authors have done [22, 19, 15].

Linear discriminant analysis (LDA) is an optimization

method to compute linear combinations of features that

have more power to separate different classes. For texture

modeling, Zhu et al [35] pursue features to ﬁnd the mar-

ginal distributions which are also the linear combinations

of the basic ﬁlter banks, but they use a much more com-

plex method (Monte Carlo Markov Chain) to stochastically

search the space of linear coefﬁcients. In our case, the goal

is not to build a generative model for photos belonging to

different scenes, but simply to discriminate among them.

We show a simple method such as LDA, if designed prop-

erly, can be very effective and efﬁcient to build a useful clas-

siﬁer for complex scenes.

We organize the r est of the paper as follows. In sec-

tion 2, we present the local image-level processing used

to create PDRMs. In section 3, we describe how PDRMs

are processed to perform scene classiﬁcation. Experimen-

tal results and analysis on the performance of patch based

material detector and image based scene classiﬁcation on a

database of 1500 personal photos taken by 43 users using

traditional or digital cameras over the last 5 years are given

in section 4. Finally we summarize the paper and discuss

the future work in section 5.

2 Local Image-Level Processing

The role of image-level processing is to roughly classify

local image content at each location in the image. The gen-

eral approach is to compute feature vectors of both color

and texture, and then develop classiﬁers for these features.

In our current implementation, we have chosen to perform

supervised feature classiﬁcation. Although arguably less

practical than corresponding unsupervised methods, super-

vised classiﬁcation permits us to control the structure of the

representations built at this level, and thereby to better un-

derstand the relationship between low-level representations

and overall system performance.

In this step, we compute 20 data driven probabilistic den-

sity models to describe the color-texture properties of image

patches of 20 predeﬁned materials

. These 20 categories

The vocabulary of materials to be detected is designed by considering

their popularity in the usual family photos. This deﬁnition is, of course,

not unique or optimized.

are: building, blue sky, bush, other (mostly trained with hu-

man clothes), cloudy sky, dirt, mammal, pavement, pebble,

rock, sand, skin, tree, water, shining sky, grass, snow, car-

pet, wall and furniture.

To prepare the training data, we manually crop image re-

gions for each material in our database, and randomly draw

dozens of 25 by 25 pixel patches from each rectangle. Al-

together, we have 2000 image patches for each material.

Some examples of the cropped images and sampled image

patches are shown in Figure 2. For simplicity, we do not

precisely follow the material boundaries in the photos while

cropping. Some outlier features are thus included in the

training patches. Fortunately these outliers are smoothed

nicely by learning continuous mixture density models.

Multi-scale image representation and automatic scale se-

lection problem has been a topic of intense discussion over

the last decade [17, 20, 13, 6, 14]. In general, the approach

of most authors has been to ﬁrst normalize i mages with re-

spect to the estimated scale of local image regions before

learning. However it is not a trivial problem to reliably re-

cover the local image scales for a collection of 1500 family

photos. We instead choose to train the GMM using the raw

image patches extracted directly from the original pictures.

For the labeled image patches with closer and coarser views,

their complex color-texture distributions can will be approx-

imated by a multi-modal Gaussian mixture model during

clustering.

2.1 Color-Texture Descriptor for Image Patches

Our ﬁrst problem is to extract a good color-texture de-

scriptor which effectively allows us to distinguish the ap-

pearance of different materials. In the domain of color, ex-

perimental evaluation of several color models has not indi-

cated signiﬁcant performance differences among color rep-

resentations. As a result, we simply represent the color of

an image patch as the mean color in RGB space.

There are also several methods to extract texture feature

vectors for image patches. Here we consider two: ﬁlter

banks, and the Haralick texture descriptor. Filter banks have

been widely used for 2 and 3 dimensional texture recogni-

tion. [16, 5, 27]. We apply the Leung-Malik (LM) ﬁlter

bank [16] which consists of 48 isotropic and anisotropic

ﬁlters with 6 directions, 3 scales and 2 phases. Thus, each

patch is represented by a 48 component feature vector.

The Haralick texture descriptor [10] is designed for im-

age classiﬁcation and has been adopted in the area of im-

age retrieval [1]. Haralick texture measurements are de-

rived from the Gray Level Co-occurrence Matrix (GLCM).

GLCM is also called the Grey Tone Spatial Dependency

Matrix which is a tabulation of how often different combi-

nations of pixel brightness values (grey levels) occur in an

image region. GLCM texture considers the relation between

two pixels at a time, called the reference and the neighbor

pixel. Their spatial relation can be decided by two fac-

image patches

for blue sky

image patches

for tree

image patches

for pavement

Labeled Image Patches

for each of 20 materials

Patch-based Discriminative

Gaussian Mixture Density Model

for each of 20 materials

Patch Based Color-Texture

Feature Extraction

LDA Projection for Material

Classes

GMM for

pavement

GMM for tree

GMM for blue

sky

PDRM for

pavement

PDRM for tree

PDRM for blue

sky

Probability Density Response

Maps for each of 20 materials

Moments Feature Extraction and

Vectorization of Each PDRM

LDA Projection for Scene

Categories (Bootstrapping+Random

Subspace Sampling)

Bagging of LDA Classifiers for

Scene Categories

Patch-Level Processing Image-Level Procesing

Figure 1: The diagram of our two level approach for scene recognition. The dashed line boxes are the input data or output learned

models; the solid line boxes represent the functions of our algorithm.

Figure 2: (a, c, e, g) Examples of cropped subimages of building, building under closer view, human skin, and grass respectively. (b, d, f,

h) Examples of image patches of these materials including local patches sampled from the above subimages. Each local image patch is 25

by 25 pixels.

tors, the orientation and offset. Given any image patch, we

search all the pixel pairs satisfying a certain spatial relation

and record their second order gray level distributions with

a 2 dimensional histogram indexed by their brightness val-

ues

. Haralick also designed 14 different texture features

[10] based on the GLCM. We selected 5 texture features

including dissimilarity, Angular Second Moment (ASM),

mean, standard deviation (STD) and correction. Deﬁnitions

for these can be found in Appendix A.

There is no general argument that the ﬁlter bank features

or Haralick feature is a better texture descriptor. We eval-

uate their texture discrimination performances experimen-

The reference and neighbor pixel intensities normally need to be quan-

tized into 16 or less levels instead of 256 which results in not too sparse

GLCM.

tally in section 4 and ﬁnd Haralick features generally per-

form better.

2.2 Discriminative Mixture Density Models for 20

Materials

The color and texture features for 2000 image patches

form, in principle, an empirical model for each material.

However, classifying new patches against the raw features

would require the solution to a high-dimensional nearest-

neighbor problem, and the result would be sensitive to noise

and outliers. Instead, we compute a continuous membership

function using a Gaussian mixture model.

Although we have 2000 training samples, our feature

vectors have 40 dimensions, so the training set is still too

sparse to learn a good mixture model without dimensional

reduction. Because one of our purposes is to maximize the

discrimination among different materials, Linear Discrim-

inant Analysis (LDA) [31] was chosen to project the data

into a subspace where each class is well separated. The

LDA computation is reviewed in appendix B.

When each class has a Gaussian density with a common

covariance matrix, LDA is the optimal transform to sepa-

rate data from different classes. Unfortunately the material

color-texture distributions all have multiple modes because

the training image patches are sampled from a large variety

of photos. Therefore we have two options: employ LDA to

discriminate among 20 material classes; or use LDA to sep-

arate all the modes of materials. Although the latter seems

closer to the model for which LDA was designed, we found

its material classiﬁcation rate is worse because the optimal

separation among the multiple modes within the same ma-

terial class is irrelevant. Therefore we choose the former.

The LDA computation provides a projection of the origi-

nal feature space into a lower-dimensional feature space Z.

We assume that the color-texture features of each material

class is described by a ﬁnite mixture distribution on Z of

the form

P (z|c)=



k=1

G(z; µ

, Σ

),c=1, 2, ..., 20 (1)

where the π

are the mixing proportions (



k=1

=1)

and G(z; µ

, Σ

) is a multivariate Gaussian function de-

pending on a parameter vector θ

. The number of mix-

tures g

and the model parameters {π

,θ

} for each ma-

terial class c are initialized by spectral clustering [21] and

learned in an iterative Expectation-Maximization manner

[31, 7] where g

ranged from 4 to 8 depending on the mate-

rial class. As a summary, discriminative Gaussian mixture

models are obtained by applying LDA across the material

classes and learning the GMM within each material class,

respectively.

3 Global Image Processing

Once we obtain 20 Gaussian mixture models

{π

,P(z; θ

),i =1, 2, ..., 20} for 20 material classes,

we can evaluate the membership density values of image

patches for each material class. For any given photo, we

scan local image patches, extract their color-texture feature

vector, normalize each of its components from 0 to 1 [1],

project it to the lower dimensional subspace Z computed

by LDA, and ﬁnally compute the density value given by

equation (1) for all 20 material classes. The result is 20

real-valued grid maps

representing membership support

for each of the 20 classes. An example is shown in Figure

3. Two examples of the local patch labeling for indoor and

outdoor photos are shown in Figure 4.

Our next goal is to classify the photos into one of ten

The size of the map depends on the original photo size and the patches’

spatial sampling intervals.

Skin

Bush

Furniture

Wall&Curtain

Pebbel

Blue Sky

Water

Pavement

Sand

Rock

Bush

Grass

Other

Building

(a) (b)

Figure 4:

(a) The local patch material labeling results of an in-

door photo. (b) The local patch material labeling results of an

outdoor photo. Loopy belief propagation is used for enhancement.

The colored dots represent the material label and the boundaries

are manually overlayed for illustration purpose only.

categories: cityscape, landscape, mountain, beach, snow,

other outdoors, portrait, party, still life and other indoor. In

order to classify photos, we must still reduce the dimension

of the PDRMs to a manageable size. To do this, we compute

the zeroth, ﬁrst, and second order moments of each PDRM.

Intuitively, the zeroth moment describes the prevalence of a

given material class in an image; the ﬁrst moment describes

where it occurs, and the second moment its spatial ”spread”.

The moment features from the 20 PDRMs are combined in

a global feature vector Y.

Using the scene category labels of the training photos,

we now compute the LDA transform that attempts to sep-

arate the training feature vectors of different categories.

For the indoor-outdoor recognition, the LDA projected sub-

space has only one dimension. As a typical pattern classiﬁ-

cation problem, we can ﬁnd the optimal decision boundary

from the training data and apply it to the other testing data.

Finding decision boundaries for 10 scene category recog-

nition is more complex. In practice, it is very difﬁcult to

train a GMM classiﬁer because of the data is too sparse over

the 10 categories. As a result, we have used both the near-

est neighbor and Kmeans [31] classiﬁers for this decision

problem.

We have found that the standard method for creating an

LDA classiﬁer works well for indoor-outdoor scene clas-

siﬁcation, but the classiﬁcation results for 10 scene cate-

gories is not good enough to constitute a practical proto-

type. To improve the classiﬁcation rate, we have imple-

mented variations on random subspace generation [12, 28]

and bootstrapping [31] to create multiple LDA classiﬁers.

These classiﬁers are combined using bagging [2]. Recall

that LDA is a two step process that ﬁrst computes the singu-

lar value decomposition (SVD) [9] of the within-class scat-

ter matrix S

, then, after normalization, computes SVD

on the between-class scatter matrix S



. After the ﬁrst step,

is divided into the principal subspace S

of the nonzero

eigenvalues Λ

and their associated eigenvectors U

, and

the null subspace S

with the zero eigenvalues Λ

and cor-

responding eigenvectors U

. In the traditional LDA trans-

form, only S

is used for the whitening of S

and nor-

Figure 3: (a) Photo 1459#. (b) Its conﬁdence map. (c, d, e, f, g) Its support maps of blue sky, cloud sky, water, building and skin. Only

the material classes with the signiﬁcant membership support are shown.

malization of S

while S

is discarded (see equation 10 in

Appendix B). Chen et al. [4] have found that the null s ub-

space S

satisfying U

=0also contains important

discriminatory information. Here we make use of this ob-

servation by uniformly sampling an eigenvector matrix U

from {U

∪ U

} and use it in place of U in the initial LDA

projection step. Several projections (including the original

LDA projection matrix) are thus created.

In the second step of LDA, the subset V

of the full

eigenvector matrix V with the largest eigenvalues, nor-

mally replaces V in equation (10). It is also possible that

there is useful discriminative information in the subspace

{V − V

}. Therefore we employ a similar sampling strat-

egy as [28] in the context of PCA by ﬁrst sampling a small

subset of eigenvectors V

of {V − V

}, then replacing V

with the joint subspace {V

∪ V

} in equation 10.

Finally we also perform bootstrapping [31] by sampling

subjects of the training set and creating LDA classiﬁers

for these subsets. By the above three random sampling

processes, we learn a large set of LDA subspaces and classi-

ﬁers which we combine using the majority voting (bagging)

methods [2]. In Section 4, we show the bagged recognition

rates of 20 classiﬁers from bootstrapping replicates and 20

from random subspace sampling.

4 Experiments

Our photo collection currently consists of 540 indoor

and 860 outdoor customer photos. We randomly select half

of them as the training data and use other photos as the

testing data. We have also intentionally minimized redun-

dancy when collecting photos, i.e., only one photo is se-

lected when there are several similar pictures.

We ﬁrst address the problem of the image patch based

color-texture feature description and classiﬁcation. Com-

parison of the recognition rates of 1200 testing image

patches for each material class for different color-texture

descriptors, different numbers of training patches and dif-

ferent classiﬁers is provided in Figure 6 (a,b). In partic-

ular, we have also benchmarked the LDA+GMM model

against a brute-force nearest neighbor classiﬁer. Let x

and

represent an image patch feature vector before and af-

ter the LDA projection, respectively. The nearest neighbor

classiﬁer computes the class label of a testing patch j as

the label of that training patch l such that x

− x

 =

min

{x

− x

} where i ranges over the training image

patches of all material classes. The GMM classiﬁer simply

building

blue sky

bush

other

c-sky

dirt

mammal

pavement

pebble

rock

sand

skin

tree

water

s-sky

grass

snow

carpet

furniture

wall

Figure 5: The pairwise confusion matrix of 20 material

classes. The indexing order of the confusion matrix is

shown on the left of the matrix. The indexing order is sym-

metrical.

chooses the maximal class density, i.e. the class c

∗

such that

P (z

∗

) = max

c=1,2,...,20

{P (z

|c)}.

Comparing the plots shown in Figure 6, the classiﬁer

based on the Maximum Likelihood of GMM density func-

tions outperforms the Nearest Neighbor classiﬁer, thus val-

idating the use of the LDA+GMM method. We also com-

pared the recognition rates of 4 different feature combina-

tions and found that the Haralick texture descriptor com-

bined with the mean color of the image patch yields the best

results. Finally, in Figure 6 (b), we see that the LDA+GMM

method improves the recognition rate signiﬁcantly when in-

creasing the training image patch from 500, becoming sta-

ble after 2000 patches.

Figure 5 shows the confusion rate using the GMM clas-

siﬁers learned from 2000 training image patches per class.

The size of the white rectangle in each grid is proportional

to the pairwise recognition error ratio. The largest and

smallest confusion rates are 23.6% and 0.24%, respectively.

From Figure 5, we see that pebble, rock and sand classes

are well separated which shows that our patch-level learn-

ing process achieves a good balance of Haralick texture

and color cues by ﬁnding differences of the material classes

with the similar color. There is signiﬁcant confusion among

grass, bush and tree due to their similar color and texture

distribution. For some material classes, such as furniture,

carpet, and other, the overall confusion rates are also high.

For global classiﬁcation, we have found that ﬁrst order

A two level approach for scene recognition

Figures

Citations

Machine learning

Texture features for image classification and retrieval.

Landmark recognition with compact BoW histogram and ensemble ELM

Landmark recognition with sparse representation classification and extreme learning machine

Landmark recognition with sparse representationclassification and extreme learning machine

References

Maximum likelihood from incomplete data via the EM algorithm

Matrix computations

Textural Features for Image Classification

Bagging predictors

Machine learning

Related Papers (5)

A Bayesian hierarchical model for learning natural scene categories

Distinctive Image Features from Scale-Invariant Keypoints

Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

Video Google: a text retrieval approach to object matching in videos

Visual categorization with bags of keypoints

Frequently Asked Questions (13)

Q1. What are the future works in "A two level approach for scene recognition∗" ?

Q2. What contributions have the authors mentioned in the paper "A two level approach for scene recognition∗" ?

Q3. How do the authors get a set of LDA scene classifiers over these feature vectors?

Q4. How do the authors create multiple LDA classifiers?

Q5. What is the simplest way to compute the density of a material class?

Q6. How do the authors evaluate the membership density of the image patches for each class?

Q7. How do the authors prepare the training data?

Q8. How many mixtures are used for each class?

Q9. What is the classification method for indoor-outdoor scenes?

Q10. What is the way to evaluate the texture discrimination performance of Haralick?

Q11. What is the common method used to find the marginal distributions of the basic filter banks?

Q12. What is the way to separate data from different classes?

Q13. What is the combination of LDA and Gaussian mixture models?