Your organization, a consumer automobile research firm, wishes to analyze data from a study of fuel economy among the major automobile models to determine how the variables in the data set correlate with fuel economy.

data mining

Description

Option #1: Statistical Analysis for an Automobile Research Firm

Your organization, a consumer automobile research firm, wishes to analyze data from a study of fuel economy among the major automobile models to determine how the variables in the data set correlate with fuel economy. You are tasked with developing a better understanding of the variables in the CARS data set. Management wants you to explore this data set to determine if the data is suitable for use in the next phase of their upcoming analytics project.

You are required to conduct two analyses for this assignment.

A.     Statistical Analysis:

o    Use SAS University Edition to conduct these statistical tasks:

§  Summary statistics:

§  Use MSRP, Invoice, MPG-City, and MPG-Highway as your analysis variables.

§  Use Make as your classification variable.

§  Distribution analysis:

§  Use MSRP, Invoice, MPG-City, and MPG-Highway as your analysis variables.

·         Important Notes: PLEASE READ!

o    Statistical tasks are located under Tasks and Utilities > Tasks > Statistics.

o    The data set can be found in Libraries > SASHELP > CARS.

o    To do this assignment you will need a SAS University Edition on your computer. User name and PW are provided:

o    UN# [email protected]

o    PW# [email protected] (C with Caps on).

o    Once you have SAS Studio opened you can find Statistics file under Task & Utilities and CAR file in Libraries, both on the left side in SAS Studio.

B.     Cluster Analysis

o    Conduct the following cluster analysis task:

§  Cluster variables:

§  Determine which variables, if any, appropriately cluster the variables to account for variability.

§  Limit your analysis to 10 clusters.

Submit an analysis of each of the variables used (MSRP, Invoice, MPG-City, and MPG-Highway). Include any tables, histograms, or scatterplot graphs necessary to support your analysis. Also, based on the cluster variables analysis, which variables, if any, can function as cluster variables? Provide tables, histograms, and other graphs to support your conclusion.

The final analysis report must meet the following requirements:

·         Be 4-6 pages in length, not including the cover and references pages.

·         Your paper should include an introduction, a body with at least two fully developed paragraphs, and a conclusion.

·         Be supported with at least three peer-reviewed, scholarly references, and one citation from the course textbooks

 Thank you.


Related Questions in data mining category