Data Mining

Total Assignments: 235

“Upcoming delivery” is coming replenishments data. The Qty will be delivered to each store this week (around Wednesday) accordingly.

Background: There are some pages in the excel file.   “Sheet 1” is last week’s inventory data of each store (total 11 stores till now: 15001~15011), including “existing stock Qty”(till last Sunday) & total “Sales Qty”(last Monday ...

you will implement a simple k-means clustering algorithm. K-means is a popular iterative clustering

In this assignment, you will implement a simple k-means clustering algorithm. K-means is a popular iterative clusteringalgorithm which finds k groups in a given training dataset. Inputs of the algorithm are: 1) k= Number of clusters to befound, and 2) Number of iterations to be perform...

(Data is flexible because I don't a set data, but company has to be Adobe Inc, the topic need to be related to how to help Adobe Inc.

Tableau Assignment: Interactive Application(Data is flexible because I don't a set data, but company has to be Adobe Inc, the topic need to be related to how to help Adobe Inc. expend sales market share from  difficult challenge, price strategy,  operations action, etc.) Tableau Packaged Workbook (.twbx) because instructor and I am able to run it and do presentation. ...

The project is largely focused on the process, and less on the findings. I will be looking for your adherence to the process of conducting a data analytic project, as well as your reflections on what you learned from going through the process.

PROJECT INSTRUCTIONS You have gotten a taste of what it takes to be a data professional. Now you get a chance to put it all together! Your Final Project is to choose a subject area (domain) that you are interested in and conduct you...

Apply dimensionality reduction feature selection in R using the attached dataset.

Apply dimensionality reduction feature selection in R using the attached dataset. interpret the result....

The candidate will have to attempt five questions in all, selecting atleast one question from each unit.

Instruction to the Candidate: The candidate will have to attempt five questions in all, selecting atleast one question from each unit.   ...

Provide the summary statistics for all the variables from the dataset. Explain some of the key aspects of the dataset.

Using SAS, build a k-nearest neighbor algorithm to predict the quality of wine from different factors. (input files are provided):1 - Provide the summary statistics for all the variables from the dataset. Explain some of the key aspects of the dataset.2 - Review the SAS code in the file knn.sas and for each SAS statement, provide explanation of the code as comments3 - Perform the k-NN using k = 1, 2, and 3. For each case, provid...

Many labor-intensive production operations experience a learning curve effect. The learning curve specifies that the cost to produce a unit is a function of cumulative production, that is, as production volume increases, the cost to produce each unit drop

1. Learning Curve Based Production Planning Open the file Learning.xlsx. Many labor-intensive production operations experience a learning curve effect. The learning curve specifies that the cost to produce a unit is a function of cumulative production, that is, as production ...

This is a short exercise to test your ability to construct and evaluate machine learning models.

  Introduction: This is a short exercise to test your ability to construct and evaluate machine learning models. You should produce a machine learning model and an evaluation metric. The goal of this exercise is not to produce a state-of-the-art machine learning model. Which model you use, and how you evaluate, is up to you. ...

What are the Objectives for the Theme that you selected.

Trends around healthcare: particularly child mortality in the united states, with a breakdown of causes, and by income level What are the Objectives for the Theme that you selected.What are the KPIs that will help tell your story.Create your Story which has Dashboards and Charts.What is the Actionable information that you would want your audience to wa...

The primary objective is to use classification techniques learnt so far. Each loan is graded (A to G) based on the risk, with A being least risky and G being the highest risk category.

We will revisit the Lending Club data for this week’s assignment. The company has existed since 2007 and have provided millions of personal loans since then. ...

The section should include history and background of organization’s name, and the industry associated with the organization.

For this project, you will write a 2-3 page APA formatted paper on a how your job/occupation or school major connects to Data Mining. You will select a particular industry you may be associated with and describe how you personally connect with that industry and Data Mining. You should provide discussion, references, and so on, in sufficient details. The paper should inclu...

The assignment guides you through the feature selection techniques.

The Third Part of the Assignment of DM 2020-2021 Introduction The assignment guides you through the feature selection techniques. It is recommended to follow the assignment in the given order since the result of some questions might depend on answers to previous steps. The questions are detailed in the provided Jupyter notebook. The Dataset Dataset: First, visit UCI machine learning repository. The link is provided here: https://archive.ics.uci.edu/ml/i...

RGV, Inc., sells international specialty food items to buyers from around the world. Management is currently realizing that issues such as rising shipping costs due to gas prices as well as competition are cutting into the company's profit margins.

RGV, Inc., sells international specialty food items to buyers from around the world. Management is currently realizing that issues such as rising shipping costs due to gas prices as well as competition are cutting into the company's profit margins. In an effort to reduce cost and increase profits, they ...

Construct a Horizontal Bar Chart to contrast US GDP per capita with the rest of the world for the year 2017. To provide context for the numbers, create a Distribution band to enclose between 50% and 150% of the average GDP per capita.

Questions 1-4 are based on the dataset gdp_per_capita.csv. The data was obtained from the World Indicators Dataset from WorldBank. Original data was filtered, cleaned and restructured. 1. Construct a Horizontal Bar Chart to contrast US GDP per capita with the rest of the world for the year 2017. To provide context for the numbers, create a Distribution band to enclose between 50% and 150% of the average GDP per capita. Lastly, highlight United States by creating a Group. The first few rows...

Your organization, a consumer automobile research firm, wishes to analyze data from a study of fuel economy among the major automobile models to determine how the variables in the data set correlate with fuel economy.

Option #1: Statistical Analysis for an Automobile Research FirmYour organization, a consumer automobile research firm, wishes to analyze data from a study of fuel economy among the major automobile models to determine how the variables in the data set correlate with fuel economy. You are tasked with developing a better understanding of the var...