Statistical Inference and Machine Learning Homework 2 Solution

~~$30.00~~ $24.00

This assignment can be solved in groups of 1 up to 5 students. You must mention the name of all the participants. Note that all the students in a group will get the same grade. Deadline: 25 November 2020, 23:59 (No late submissions will be accepted) Upload a single pdf file on Moodle containing your…

Description

5/5 – (2 votes)

This assignment can be solved in groups of 1 up to 5 students. You must mention the name of all the participants. Note that all the students in a group will get the same grade.

Deadline: 25 November 2020, 23:59 (No late submissions will be accepted)

Upload a single pdf file on Moodle containing your solution.

1 Feature Selection [60 pts]

Algorithm:

Given a dataset S = {(Y ⁱ, Xⁱ)}ⁿ_i=1 of n instances, where features X = (X₁, . . . , X_d) 2 R^d, and labels

= {1,…,K}.

- For each value of the label Y = k

– Estimate density p(Y = k)

- For each feature X_i, i = {1, . . . , d}

– Estimate its density p(X_i)

– For each value of the label Y = k, estimate the density p(X_i|Y = k)

– Score feature X_i, i = {1, . . . , d}, using

	_x_i₂^X_X,y2Y ^p(xi^{, y) log}2⁽	p(x_i, y)
I(X_i, Y ) =			)	(1)
		p(x_i)p(y)

where X and Y denote the support sets of X_i and Y .

Choose those feature X_i with high score I_i

Insight: Informativeness of a feature

We are uncertain about label Y before seeing any input.

– Suppose we quantify using entropy H(Y ), defined as

H(Y ) = − p(y) log₂ p(y) (2)

y2Y

where Y denotes the support sets of Y .

Statistical Inference and Machine Learning Homework 2 Solution

Share this:

Share this:

Description

Share this:

Related products

Programming II Assignment 4: Patient Location

Lab 5: Introduction to OpenGL Solution

Task 5 Process Synchronization Solution

Assignment-2 Solution

Assignment_4 Solution