# Data science | Computer Science homework help

Classification: Basic Concepts

Decision Tree Induction

Bayes Classification Methods

Rule-Based Classification

Assignment 4

1- Consider the data in the following table:

TID

Home Owner

Marital Status

Annual Income

Defaulted   Borrower

1

Yes

Single

[120 – < 150K]

No

2

No

Married

[90 – < 120K]

No

3

No

Single

[60 – < 90K]

No

4

Yes

Married

[120 – < 150K]

No

5

No

Divorced

[90 – < 120K]

Yes

6

No

Married

[60 – < 90K]

No

7

Yes

Divorced

[120 – < 150K]

No

8

No

Single

[90 – < 120K]

Yes

9

No

Married

[60 – < 90K]

No

10

No

Single

[90 – < 120K]

Yes

Let Defaulted Borrower be the class label attribute.

a) Given a data tuple X = (Home Owner= No, Marital Status= Married, Income= \$120K). What would a naive Bayesian classification of the Defaulted Borrower for the tuple be?

2- Consider the training example in the following table for a binary classification problem.

Customer ID

Gender

Car Type

Shirt Size

Class

1

M

Family

S

C0

2

M

Sports

M

C0

3

M

Sports

M

C0

4

M

Sports

L

C0

5

M

Sports

XL

C0

6

M

Sports

XL

C0

7

F

Sports

S

C0

8

F

Sports

S

C0

9

F

Sports

M

C0

10

F

Luxury

L

C0

11

M

Family

L

C1

12

M

Family

XL

C1

13

M

Family

M

C1

14

M

Luxury

XL

C1

15

F

Luxury

S

C1

16

F

Luxury

S

C1

17

F

Luxury

M

C1

18

F

Luxury

M

C1

19

F

Luxury

M

C1

20

F

Luxury

L

C1

a) Find the gain for Gender, Car Type, and Shirt Size.

b) Which attribute will be selected as the splitting attribute?