机器学习代写 | SIT720 Machine Learning Assessment Task 4: Problem solving task

本次澳洲代写是机器学习的一个assessment task,需要输出程序和报告

Questions

1. What is an ensemble classifier? Name some of the popular ensemble methods (at least three) and which
one you prefer and why? (2 marks)

2. Let’s assume we have a noisy dataset. You want to build a classifier model. Which classifier is appropriate
for your dataset and why? (2 marks)

Background

In the modern world, customer details are very important to suggest any product for buying. Gender, age and
education have impact on level of consumption of different products. So, it is essential for businesses to
analyse their customer details to better understand consumer behaviour and their impact on various products.

Dataset filename: Customer relationship marketing (CRM).csv

Dataset description: This dataset includes data on customer details and their response to buy any products.

The data contains 20 attributes and 9134 records.

Features and labels: The attribute names are listed below.

I. State
II. Customer Lifetime Value
III. Response
IV. Coverage
V. Education
VI. Effective To Date
VII. EmploymentStatus
VIII. Gender
IX. Income
X. Location Code
XI. Marital Status
XII. Monthly Premium Auto
XIII. Months Since Last Claim
XIV. Number of Open Complaints
XV. Number of Policies * Policy
XVI. Renew Offer Type
XVII. Sales Channel
XVIII. Total Claim Amount
XIX. Vehicle Class

Questions

4. Analyse the importance of the features for predicting customer response using two different approaches.
Explain the similarity/difference between outcomes. (5 marks)

5. Create three supervised machine learning (ML) models except any ensemble approach for predicting
customer response. (10 Marks)

a. Report performance score using a suitable metric. Is it possible that the presented result is an
overfitted one? Justify.

b. Justify different design decisions for each ML model used to answer this question.

c. Have you optimised any hyper-parameters for each ML model? What are they? Why have you
done that? Explain.

d. Finally, make a recommendation based on the reported results and justify it.

6. Build three ensemble models for predicting customer response. (6 Marks)

a. When do you want to use ensemble models over other ML models?

b. What are the similarities or differences between these models?

c. Is there any preferable scenario for using any specific model among set of ensemble models?

d. Write a report comparing performances of models built in question 5 and 6. Report the best
method based on model complexity and performance.

e. Is it possible to build ensemble model using ML classifiers other than decision tree? If yes, then
explain with an example.