PROF-740 Fundamentals of Data Analytics Assignment 1 FALL 2021 Submission Date: October 20st, 2021, 23:00 STUDENT NAME: STUDENT ID: INSTRUCTIONS: Read carefully the instructions: Submit the answers to the questions in PDF document to myCourses by the due date. Other formats are not accepted and are not graded.Use the same format of the assignment. Don’t change or delete any part of the assignment.Late submission for each day will penalize 2% of the total grade. After one week, the assignment will be not graded.Assignment without a name is not gradedThe file submitted in Mycourses should have the following format: “PROF740_firstname_lastname_hw_1.pdf”.The maximum word limit per essay question is 300 words.Short essay questions will be checked for plagiarism using Turnitin software.The two datasets used in this assignment are uploaded in Mycourses. BACKGROUND The purpose of this assignment is to understand the discipline of the data mining and the data exploration using statistics. QUESTIONS PART A: SHORT ESSAY QUESTIONS- (30points): Question 1 (6 points): Provide the definition of data mining Question 2 (6 points): How can outliers be identified in a dataset? Question 3 (6 points): Provide the definition for descriptive and predictive analytics. Question 4 (6 points): List and briefly define the dispersion measures of descriptive statistics. Question 5 (6 points): What is the relationship between statistics and data mining? PART B: PRACTICE EXERCISES Question 6: (40points) The average time that Americans commute to work is 27.7 minutes. The average commute times in minutes for 48 cities are as presented in the dataset with name “USCITIES_COMMUTE.xls”. You need answer the following questions and present your work using R scripts. a. What is the mean commute time for these 48 cities? (8pts) b. What is the median commute time for these 48 cities? (8pts) c. What is the mode for these 48 cities? (8pts) d. What is the variance and standard deviation of commute times for these 48 cities? (8pts) e. What is the third quartile of commute times for these 48 cities? (8pts) Question 7: (30points) AVON PERFUMES ON-LINE COMMERCE AVON manufactures and sells quality perfumes products at its plant and retail store located in Edinburgh, Scotland. Three years ago, the company developed a web site and began selling its products over the Internet. Web site sales have exceeded the company’s expectations, and management is now considering strategies to increase sales even further. To learn more about the web site customers, a sample of 50 AVON online transactions was selected from the previous month’s sales. Data showing the day of the week each transaction was made, the type of browser the customer used, the time spent on the web site, the number of web pages viewed, and the amount spent by each of the 50 customers are contained in the file named “AvonPerfumesSales”. Managerial Report Use the methods of descriptive statistics to learn about the customers who visit the AVON Perfumes web site. Include the following in your report. 1. Graphical and numerical summaries for the length of time the shopper spends on the web site, the number of pages viewed, and the mean amount spent per transaction. Discuss what you learn about AVON perfumes’ online shoppers from these numerical summaries. (10pts) 2. Summarize the frequency, the total dollars spent, and the mean amount spent per transaction for each day of week. Discuss the observations you can make about AVON perfumes’ business based on the day of the week? (10pts) 3. Summarize the frequency, the total dollars spent, and the mean amount spent per transaction for each type of browser. Discuss the observations you can make about AVON perfumes’ business based on the type of browser? (10pts)