[Get it solved] Using the data in the Excel file, Sales Data, perform K-M...

Check Out Our Work & Get Yours Done

Submit Work

Download Sample

Enroll in the complete course for only $250 USD*

Order Now

Submit work Offers

Using the data in the Excel file, Sales Data, perform K-Means Clustering on the data.

others

Description

ITEC 320 Homework 3: Clustering and Classification

Due: November 17

1) Using the data in the Excel file, Sales Data, perform K-Means Clustering on the data. Use the all the attributes as input, except the Customer and Percent Gross Profit attributes. Review the clusters and create various plot variable combos.

Do a detailed interpretation of your results. Do you see any interesting patterns?

2) You are on an analytics team at the Really Big Financial Corporation. You market specialized financial products geared for different income levels of potential customers.

You do not want to waste time and money to market the products to individuals that are not a good fit based on their income level.

You have downloaded and cleaned up a set of data from the US Census Department. The data includes demographic and other data from a Census survey.

· From the dataset “Census Data” try to predict who makes more the $50,000.00 dollars per year and who makes less.

· Use Training and Testing Sets

· The Target attribute is “Income”: <= 50K, >50K.

· Try both Decision Trees and K-NN.

· How accurate are your predictions?

Note: This is a large dataset (over 32,000 rows), so K-NN runs a little slow, it may take 30 seconds or so for K-NN to run.

3) For the Titanic problem performed in the Lab, now try to use Support Vector Machine (SVM) Classification. How does this compare to the accuracy of Decision Trees and K-NN?

· Caution: SVM only works with numeric dependent attributes, so use the Dataset: Titanic passengers numeric, where sex is converted to 1 – Female, 0- Male.

· Note: SVM does not like missing values, so you will need to use the Replace Missing Values Operator, before the Select Attributes Operator to replace missing values for Age to the Average.

For each of the above problems, please interpret your results and provide all supporting model output and diagrams. (e.g. Clusters, Clusters diagrams, decision trees, performance matrix results etc.) If you feel ambitious try Naïve Bayes and Neural Networks as well.

Related Questions in others category

Applying psychology skills training (PST) principles to a practical setting, design a program or intervention to prevent or improve a particular situation.

To use a function, you must start with an equal sign, then reference the function by its name: IF. Next, inside a set of parentheses, you need to provide three arguments:

Mass media is a powerful cultural construct. This is where we get ideas about who we are and should be and we form impressions of people, events, issues, and cultural life.

The goal of this assignment is to create a text based word guessing game. The project is completely described in this file,

A block of mass 5 kg sits at rest on an incline making an angle of 30 degrees to the horizontal.

Describe and use entity relationships in database design

What barriers and gaps affect learners and employers when educationally disadvantaged adults try to improve their labour market situation by pursuing more education and training?

We've Called This A "Reflective Application Assignment" With The Intention That Your Deliverable Will Be Based On Applying The Ideas

In 2018, almost 11% of the world’s population was undernourished. Over the years, there had been a trend of decline in hunger.

Discuss how exercise such as those discussed in this weeks reading related to aerobic, muscular fitness, flexibility and functional fitness can improve your health and be used as preventative medicine for many of today’s chronic diseases.

Get Higher Grades Now

Tutors Online

Description

Drop Files Here Or Click to Upload

Get Free Quote!

421 Experts Online

Connect With Us

We Provide Services Across The Globe

Disclaimer: The reference papers or solutions provided by Calltutors.com serve as model papers or solutions for students or professionals and are not to be submitted as it is to any institutions. These documents are intended to be used for research and reference purposes only. University and company's logo's are the property of respected owners. We don't have affiliation with the mentioned universities. By using our services means, you agree to our Honor Code , Privacy Policy , Terms & Conditions , Payment , Refund & Cancellation Policy.

Enroll in the complete course for only $250 USD*

Using the data in the Excel file, Sales Data, perform K-Means Clustering on the data.

others

Description

Get instant assignment help service

Related Questions in others category

Policy

Exploring

Other

Connect With Us

We Provide Services Across The Globe