Name: CS-Homework 2 Solution
SKU: 13149
Price: 30.00 USD
Availability: InStock

Description

5/5 – (2 votes)

1 (10 points) PCA algorithm

Give at least two algorithms that could take data set X = fx₁; : : : ; x_N g, x_t 2 Rⁿ ¹; 8t as input, and output the first principal component w. Specify the computational details of the algorithms, and discuss the advantages or limitations of the algorithms.

2 (10 points) Factor Analysis (FA)

Calculate the Bayesian posterior p(yjx) of the Factor Analysis model x = Ay + + e, with p(xjy) = G(xjAy + ; _e), p(y) = G(yj0; _y), where G(zj ; ) denotes Gaussian distribution density with mean and covariance matrix .

3	(10 points) Independent Component Analysis (ICA)
Explain why maximizing non-Gaussianity could be used as a principle for ICA estimation.
4	(50 points) Dimensionality Reduction by FA
Consider the following Factor Analysis (FA) model,
	x = Ay + + e;	(1)
	p(xjy) = G(xjAy + ; ²I);	(2)
	p(y) = G(yj0; I);	(3)

where the observed variable x 2 Rⁿ, the latent variable y 2 R^m, and G(zj ; ) denotes Gaussian distribution density with mean and covariance matrix . Write a report on experimental comparisons on model selection performance by BIC, AIC on selecting the number of latent factors, i.e., dim(y) = m.

tushikui@sjtu.edu.cn

Specifically, you need to randomly generate datasets based on FA, by varying some setting values, e.g., sample size N, dimensionality n and m, noise level ², and so on. For example, set N = 100; n = 10; m = 3; ² = 0:1; = 0, and assign values for A 2 R^{n m}. The generation process is as follows:

Randomly sample a y_t from Gaussian density G(yj0; I), with dim(y) = m = 3;

Randomly sample a noise vector e_t from Gaussian density G(ej0; ²I), with ² = 0:1,

e_t 2 Rⁿ;

Get x_t = Ay_t + + e_t.

Collect all the x_t as the dataset X = fx_tg^N_t=1.

The two-stage model selection process for BIC, AIC is as follows:

Stage 1: Run EM algorithm on each dataset X for m = 1; :::; M, and calculate the log-likelihood

	^	^
	value ln[p(Xj _m)], where _m is the maximum likelihood estimate for parameters;
Stage 2: Select the optimal m by
		m = arg max_m=1;:::;M J(m);				(4)
		^		d_m		(5)
		J_AIC (m) = ln[p(Xj _k)]		d_m		(5)
		^	ln N			(6)
		J_BIC (m) = ln[p(Xj _k)]			d_m
		J_BIC (m) = ln[p(Xj _k)]		2	d_m
	You may set M = 5, if you generate the dataset X based on n = 10; m = 3.
The following codes might be useful.
Python:	https://scikit-learn.org/stable/modules/generated/sklearn.
decomposition.FactorAnalysis.html#sklearn.decomposition.FactorAnalysis

5 (20 points) Spectral clustering

Use experiments to demonstrate that when spectral clustering works well, when it would fail.

Summarize your results.

The following codes might be helpful.

Python: https://scikit-learn.org/stable/modules/generated/sklearn.cluster. SpectralClustering.html

CS-Homework 2 Solution

Share this:

Share this:

Description

Share this:

Related products

Homework 1 Extracting Data from a CSV file Solution

Homework 5: Heap ADT using STL Solution

Examining the Effect of Cache Parameters and Program Factors on Cache Hit Rate Solution

Programming Assignment II CustomFTP Server SOlution

Lab 4: Bash Script and Bitwise Operations Solution