Name: HW1 Solution
SKU: 427
Price: 30.00 USD
Availability: InStock

Description

5/5 – (2 votes)

Problem 1 (10pt): Independence and un-correlation

(5pt) Suppose X and Y are two continuous random variables, show that if X and Y are independent, then they are uncorrelated.

(5pt) Suppose X and Y are uncorrelated, can we conclude X and Y are independent? If so, prove it, otherwise, give one counterexample. (Hint: consider X U nif orm[ 1; 1] and Y = X²)

Problem 2 (15pt): [Minimum Error Rate Decision] Let !_max(x) be state of nature for which

P (!_maxjx) P (!_ijx) for all i = 1; : : : ; c.

(5pt) Show that P (!_maxjx) ¹_c

(5pt) Show that for minimum-error-rate decision rule, the average probability of error is

given by

P (error) = 1 P (!_maxjx)p(x)dx

(3) (5pt) Show that P (error) ^c_c ¹

Problem 3 (10pt): [Likelihood Ratio] Suppose we consider two category classi cation, the class conditionals are assumed to be Gaussian, i.e., p(xj!₁) = N(4; 1) and p(xj!₂) = N(8; 1), based on prior knowledge, we have P (!₂) = ¹₄ . We do not penalize for correct classi cation, while for misclassi cation, we put 1 unit penalty for misclassifying !₁ to !₂ and put 3 unit for misclassifying !₂ to !₁. Derive the bayesian decision rule using likelihood ratio.

Problem 4 (15pt): [Minimum Risk, Reject Option] In many machine learning applications, one has the option either to assign the pattern to one of c classes, or to reject it as being unrecognizable. If the cost for reject is not too high, rejection may be a desirable action. Let

>0; i = j and i; j = 1; : : : ; c

( ij!j) = _r; i = c + 1

^: _s; otherwise

where _r is the loss incurred for choosing the (c + 1)-th action, rejection, and _s is the loss incurred for making any substitution error.

(5pt) Derive the decision rule with minimum risk.
(5pt) What happens if _r = 0?
(5pt) What happens if _r > _s?

Problem 5 (25pt): [Maximum Likelihood Estimation (MLE)] A general representation of a

@LL( )

exponential family is given by the following probability density:

p(xj ) = h(x) expf ^T T (x) A( )g

is natural parameter.

h(x) is the base density which ensures x is in right space. T (x) is the su cient statistics.

A( ) is the log normalizer which is determined by T (x) and h(x). exp(:) represents the exponential function.

1. (5pt) Write down the expression of A( ) in terms of T (x) and h(x).

1. (10pt) Show that _@^@ A( ) = E T (x) where E (:) is the expectation w.r.t p(xj ).

1. (10pt) Suppose we have n i.i.d samples x₁; x₂; : : : ; x_n, derive the maximum likelihood esti-mator for . (You may use the results from part(b) to obtain your nal answer)

Problem 6 (25pt): [Logistic Regression, MLE] In this problem, you need to use MLE to derive and build a logistic regression classi er (suppose the target/response y 2 f0; 1g):

(1) (5pt) Suppose the classi er is y = x^T , where contains the weight as well as bias parame-

ters. The log-likelihood function is LL( ), what is ?

1. (20pt) Write the codes to build and train the classi er on Iris plant dataset (https:// archive.ics.uci.edu/ml/datasets/iris). The iris dataset contains 150 samples with 4 features for 3 classes. To simplify the problem, we only consider: (a) two classes, i.e., virginica and non-virginica; (b) The rst 2 types of features for training, i.e., sepal length and sepal width. Based on these simpli ed settings, train the model using gradient descent. Please show the classi cation results. (Note that (1) you could split the iris dataset into train/test set. (2) You could visualize the results by showing the trained classi er overlaid on the train/test data. (3) You could tune several hyperparameters, e.g., learning rate, weight initialization method etc, to see their e ects.

You could use sklearn or other packages to load and process the data, but you can not use the package to train the model).

HW1 Solution

Share this:

Share this:

Description

Share this:

Related products

Programming II Assignment 4: Patient Location

Lab 5 Task 4 System Calls Summary Solution

Lab 1: Checkerboard Solution

Assignment_4 Solution

Assignment-2 Solution