Name: Data Science Lab Exercise (Decision Tree) Solution
SKU: 2360
Price: 30.00 USD
Availability: InStock

Data Science Lab Exercise (Decision Tree) Solution

~~$30.00~~ $24.00

UCI ML Repository contains many datasets for classification. You need to find 5 datasets with at least 10 attributes https://archive.ics.uci.edu/ml/datasets.php Complete the following tables and calculate accuracy using (1) Use 10 x 10 Fold CV (ii) 70% Holdout approach repeated 100 times Show all the standard deviations in the table. Briefly discuss advantages / disadvantages…

Description

Rate this product

UCI ML Repository contains many datasets for classification. You need to find 5 datasets with at least 10 attributes

https://archive.ics.uci.edu/ml/datasets.php

Complete the following tables and calculate accuracy using (1) Use 10 x 10 Fold CV (ii) 70% Holdout approach repeated 100 times

Show all the standard deviations in the table. Briefly discuss advantages / disadvantages of hold out and cross validation approach. Analysis the result. Which approach is good and why? Why some approaches unable to perform well in some data sets.

Dataset1 Dataset2

Dataset3

Dataset4

Dataset5

DT using gini

(without pruning)

DT using gini

(with pruning)

DT using entropy

(without pruning)

DT using entropy

(with pruning)

Hint: Check ccp_alpha parameter for pruning. Use ccp_alpha = 0.015 for pruning

Data Science Lab Exercise (Decision Tree) Solution

Share this:

Share this:

Description

Share this:

Related products

Programming II Assignment 4: Patient Location

Programming II Assignment 2: Collections Solution

Lab 2 File Management System Calls Solution

Assignment 2 Solution

One of the philosophies behind Unix is the motto – Assignment 1 Solution