Wk 7
Complete the problems below from the textbook. You will need to use the “Baseball 2016 Data,” “Lincolnville School District Bus Data,” and the “Century National Bank Data” files for this assignment. The files are located in the topic materials.
- Chapter 13 – Problem 63
- Chapter 13 – Problem 64
- Chapter 14 – Problem 35
- Chapter 14 Case A – Century National Bank
For problems requiring computations, please ensure that your Excel file includes the associated cell computations and/or statistics output. This information is needed in order to receive full credit on these problems.
Submit output in one Excel file.
>ChartDataSheet_
/2 /2 1 :0 .3 1
3 827 1
.5
2 851.5 2 806 2 741 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 741 2 851.5 741 1 Miles 1 6
5 75
973
0 14 2 1 10 2 Bluebird 0 55 1008 2 Bluebird 1 55 5 65
Bluebird 0 55 10 Keiser 0 55 8 Bluebird 0 42 7 3
Bluebird 0 55 9 Bluebird 0 55 10 Bluebird 0 55 10 Bluebird 0 55 4 Bluebird 0 55 6 6 Keiser 0 55 6 Keiser 1 55 3
3 Bluebird 0 55 9
3 8 Bluebird 0 55 8 3 11698 9 Keiser 0 55 7 Keiser 1 6 3 9 8 4 6 Bluebird 1 55 9 6 Keiser 1 55 9 Keiser 0 55 9 5
Bluebird 0 55 1 9
Bluebird 0 42 6 Keiser 0 42 8 9 Keiser 1 55 9 11 Bluebird 0 55 9 7 Bluebird 0 42 6 9 6
7 Thompson 0 14 8 Thompson 1 14 8 Bluebird 0 55 8 Bluebird 0 55 10 14 4 6 9
4 Bluebird 1 55 10 Thompson 0 6 9 10802 Bluebird 0 55 8 Bluebird 0 55 7 Keiser 0 42 4 Keiser 0 55 6 Bluebird 0 55 9 Keiser 0 42 10 Bluebird 0 42 4 Bluebird 0 55 2 4 Bluebird 1 55 10 Keiser 0 14 4 12 7 2 7 Bluebird 0 42 5284 10 Bluebird 0 42 7 Bluebird 1 55 10 Bluebird 0 55 5 6 6 11 7 10 Bluebird 0 55 5 Bluebird 1 55 8 9 >Sheet 1 2 2 1 0 1 01
1 0 0 1 31
10 4 0 1 3 22
6 0 1 4 3 0 1 1 6 3 0 0 3 10 8 1 0 1 69
6 4 0 0 4 5
18 6 0 0 2 12 6 1 0 3 12 7 1 0 1 6 5 0 0 1 10 3 1 0 1 12 4 0 0 1 3 1 0 4 7 2 0 0 3 5 4 0 0 4 7 2 0 0 1 20 7 1 0 4 15 4 1 0 3 11 2 0 1 1 17 3 0 0 3 10 7 0 0 2 8 4 1 0 3 8 4 0 1 2 8 6 1 0 3 7 5 1 1 3 11 2 0 0 2 10 4 1 1 2 6 2 1 0 2 2 7 1 0 4 4 1 0 0 1 0
4 5 1 0 1 6 7 1 0 2 8 4 0 0 4 12 7 0 1 3 11 5 0 0 1 16 8 0 0 3 14 4 1 0 4 7 5 1 0 1 10 6 1 1 2 11 4 0 1 3 4 3 0 0 4 6 9 0 1 4 11 7 0 0 4 7 1 0 1 1 14 5 0 0 2 16 8 0 0 2 14 4 1 0 2 12 5 1 0 2 13 3 1 0 1 18 5 0 1 4 12 4 0 0 2 9 5 1 1 3 8 4 1 0 4 6 4 1 0 2 12 3 0 1 2 14 5 1 0 2 >Sheet 1 2 2 1 0 1 01
1 0 0 1 31
10 4 0 1 3 22
6 0 1 4 3 0 1 1 6 3 0 0 3 10 8 1 0 1 69
6 4 0 0 4 5
18 6 0 0 2 12 6 1 0 3 12 7 1 0 1 6 5 0 0 1 10 3 1 0 1 12 4 0 0 1 3 1 0 4 7 2 0 0 3 5 4 0 0 4 7 2 0 0 1 20 7 1 0 4 15 4 1 0 3 11 2 0 1 1 17 3 0 0 3 10 7 0 0 2 8 4 1 0 3 8 4 0 1 2 8 6 1 0 3 7 5 1 1 3 11 2 0 0 2 10 4 1 1 2 6 2 1 0 2 2 7 1 0 4 4 1 0 0 1 0
4 5 1 0 1 6 7 1 0 2 8 4 0 0 4 12 7 0 1 3 11 5 0 0 1 16 8 0 0 3 14 4 1 0 4 7 5 1 0 1 10 6 1 1 2 11 4 0 1 3 4 3 0 0 4 6 9 0 1 4 11 7 0 0 4 7 1 0 1 1 14 5 0 0 2 16 8 0 0 2 14 4 1 0 2 12 5 1 0 2 13 3 1 0 1 18 5 0 1 4 12 4 0 0 2 9 5 1 1 3 8 4 1 0 4 6 4 1 0 2 12 3 0 1 2 14 5 1 0 2 Ch 13 Problem 63 Refer to the Baseball 2016 data, which reports information on the 2016 Major League Baseball season. Let attendance be the dependent variable and total team salary, in millions of dollars, be the independent variable. Determine the regression equation and answer the following questions. a. Draw a scatter diagram. From the diagram, does there seem to be a direct relationship between the two variables? b. What is the expected attendance for a team with a salary of $80.0 million? c. If the owners pay an additional $30 million, how many more people could they expect to attend? d. At the .05 significance level, can we conclude that the slope of the regression line is positive? Conduct the appropriate test of hypothesis. e. What percentage of the variation in attendance is accounted for by salary? f. Determine the correlation between attendance and team batting average and between attendance and team ERA. Which is stronger? Conduct an appropriate test of hypothesis for each set of variables. Problem 64 Refer to the Lincolnville School bus data. Develop a regression equation that expresses the relationship between age of the bus and maintenance cost. The age of the bus is the independent variable. A. Draw a scatter diagram. What does this diagram suggest as to the relationship between the two variables? Is it direct or indirect? Does it appear to be strong or weak? B. Develop a regression equation. How much does an additional year add to the maintenance cost? What is the estimated maintenance cost for a 10-year-old bus? C. Conduct a test of hypothesis to determine whether the slope of the regression line is greater than zero. Use the .05 significance level. Interpret your finding from parts a,b, and c in a brief report. Problem 35 Refer to the Lincolnville School district data. First, add a variable to change the type of engine (diesel or gas) to a qualitive variable. If the engine type is diesel, then set the qualitive variable to 0. If the engine type is gasoline, then set the qualitative variable to 1. Develop a regression equation using statistical software with maintenance cost as the dependent variable and age, odometer miles, miles since last maintenance, and engine type as the independent variables. a. Develop a correlation matrix. Which independent variables have strong or weak correlations with the dependent variable? Do you see any problems with multicollinearity? b. Use a statistical software package to determine the multiple regression equation. How did you select the variables to include in the equation? How did you use the information from the correlation analysis? Show that your regression equation shows a significant relationship. Write out the regression equation and interrupt its practical application. Report and interrupt R-square. c. Develop a histogram or a stem-and-leaf display of the residuals from the final regression equation developed in part (a,b) for residuals analysis. Is it reasonable to conclude that the normality assumption has been met? d. Plot the residuals against the fitted values from the final regression equation developed in part (c) against the fitted values of Y. Plot the residuals on the vertical axis and the fit-ted values on the horizontal axis. Case A Refer the Century Bank data. Using checking account balance as the dependent variable and using as independent variable the number of ATM transactions, the number of other services used, whether the individual has a debit card, and whether interest is paid on the particular account, write a report indicating which of the variables seem related to the account balance and how well they explain the variation in account balance. Should all of the independent variables proposed be used in the analysis or can some be dropped?
2
This worksheet contains values required for MegaStat charts.
Boxplot
1
8
0
10
6
3
9
2
7
4
2
806
3 806
3
827
1 827
3
8
5
2
851.5
2
908
1 851.5
1 806
1
669.5
3 669.5
1
737.
75
3 737.75
1
919.75
3 919.75
1
988
3 988
Boxplot 1/28/2010 16:06.08
2 806
3 806
3 827
1 827
3 827
3 851.5
2 851.5
2 908
2 851.5
1 851.5
1 806
2 806
1 669.5
3 669.5
1 737.75
3 737.75
1 919.75
3 919.75
1 988
3 988
Dotplot 1/28/2010 16:06.08
741 1
751
7
57
757 2
760
774
775
780
784
785
790
792
798
799
799 2
800
802
803
804
806 1
806 2
809
8
12
815
815 2
815 3
816
816 2
817
817 2
817 3
818
819
819 2
821
822
823
826
827 1
827 2
827 3
828
830
831
832
835
836
837
8
38
8
39
8
42
842 2
842 3
844
8
45
846
847
848
849
851 1
853
856
857
857 2
858
859
859 2
864
865
866
866 2
869
870
873
874
882
883
885
895
908 1
Boxplot 1/28/2010 16:08.20
2 806
3 806
3 827
1 827
3 827
3 851.5
2 851.5 2 895
1 851.5
1 806
2 806
1 669.5
3 669.5
1 737.75
3 737.75
1 919.75
3 919.75
1 988
3 988
1.8
980
1.8
1008
Dotplot 1/28/2010 16:08.20
751 1
757 1
757 2
760 1
774 1
775 1
780 1
784 1
785 1
790 1
792 1
798 1
799 1
799 2
800 1
802 1
803 1
804 1
806 1
806 2
809 1
812 1
815 1
815 2
815 3
816 1
816 2
817 1
817 2
817 3
818 1
819 1
819 2
821 1
822 1
823 1
826 1
827 1
827 2
827 3
828 1
830 1
831 1
832 1
835 1
836 1
837 1
838 1
839 1
842 1
842 2
842 3
844 1
845 1
846 1
847 1
848 1
849 1
851 1
853 1
856 1
857 1
857 2
858 1
859 1
859 2
864 1
865 1
866 1
866 2
869 1
870 1
874 1
882 1
883 1
885 1
895 1 980 1
1008 1
Bus
Data Set 3 –Lincolnville School District Bus Data
ID
Manufacturer
Engine Type (0=diesel)
Capacity
Maintenance Cost
Age
Odometer
Miles
10
Keiser
14
464
5
43
11
396
Thompson
1072
21858
11969
122
Bluebird
55
9394
116580
11967
751 Keiser 0 14
1078
22444
11
948
279
22672
11925
500
53
29
507
11922
Variables
520
4794
119130
11896
759
3952
87872
11889
ID = Bus identification number
714
3742
7
370
11837
875
4376
97947
11
814
Manufacturer = Source of the bus (Bluebird, Keiser, or Thompson)
600
4832
119860
11800
953
5160
117700
11798
Engine type = If the engine is diesel then engine type = 0; if the engine is gasoline, then engine type = 1)
101
1955
41096
11789
358
2775
70086
11782
Capacity = number of seats on the bus
29 Bluebird 1 55
5352
69438
11781
365
3065
63384
11778
Maintenance cost = dollars spent to maintain a bus last year
162
314
31266
11757
686
156
34674
11707
Age = number of years since the bus left the manufacturer
370 Keiser 1 55
7766
86528
11
704
887
3743
93672
11698
Odometer Miles = total number of miles traveled by a bus
464 Bluebird 1 55
2
5
40
34530
948 Keiser 0 42
4342
97956
11691
Miles = number of miles traveled since last maintenance
678
33
61
75229
11668
481
3097
34362
11662
43 Bluebird 1 55
8263
102969
11615
704 Bluebird 0 55
4218
83424
11610
814 Bluebird 0 55
2028
40824
11576
39 Bluebird 1 55
5821
69444
11533
699
9069
98307
11518
75 Bluebird 0 55
3011
71970
11462
693
9
193
101889
11461
989
4795
10
660
11
418
982
505
10276
1
135
321
2
732
70122
11358
724
3
754
91968
11344
732 Keiser 0 42
4640
101196
11342
880
8410
97065
11336
193 Thompson 0 14
5922
128711
11248
884
4364
92457
11231
57 Bluebird 0 55
3190
79240
11222
731
3213
68526
11
168
61 Keiser 0 55
4139
10
353
11148
135 Bluebird 0 55
3560
76426
11127
833
3920
90968
11112
671
6733
89792
11100
692
3770
93248
11048
200
5168
103700
11018
754 Keiser 0 14
7380
146860
11003
540 Bluebird 1 55
3656
4
5284
10945
660 Bluebird 1 55
6213
64434
10911
353 Keiser 1 55
427
45744
10902
482
10575
116534
10802
398
4752
95922
984
3809
87664
10760
977
3769
79422
10759
705
2152
47596
10755
767
2985
71538
10726
326
4563
107343
10724
120
4723
110320
10674
554
1826
44604
10662
695
1061
23152
10633
9 Keiser 1 55
3527
46848
10591
861
9669
106040
10551
603
2116
44384
10518
156 Thompson 0 14
6212
140460
10473
427 Keiser 1 55
6927
73423
10355
883 Bluebird 1 55
1881
20742
10344
168 Thompson 1 14
7004
83006
10315
954
101000
10235
768
3173
71778
10227
490
10133
106240
10210
725
2356
57065
10209
45 Keiser 0 55
3124
60102
10167
38 Keiser 1 14
5976
61662
10140
314 Thompson 0 6
5408
128117
10128
507 Bluebird 0 55
3690
72849
10095
40 Bluebird 1 55
9573
118470
10081
918
2470
53620
10075
387
6863
89960
10055
418 Bluebird 0 55
4513
104715
10000
2
1
Balance
ATM
Services
Debit
Interest
City
1
7
5
6
1
3
4
0
74
8
9
15
10
18
16
14
1886
17
740
1593
11
2
12
1554
1474
19
13
1218
1006
2215
20
137
167
343
2557
2276
1494
2144
1995
1053
1526
1120
1838
1746
1616
1958
634
580
1
32
1675
789
1735
1784
1326
2051
1044
1885
1790
765
1645
32 2 0 0 0 3
1266
890
2204
2409
1338
2076
1708
2138
2375
1455
1487
1125
1989
2156
2
1
Balance
ATM
Services
Debit
Interest
City
1
7
5
6
1
3
4
0
74
8
9
15
10
18
16
14
1886
17
740
1593
11
2
12
1554
1474
19
13
1218
1006
2215
20
137
167
343
2557
2276
1494
2144
1995
1053
1526
1120
1838
1746
1616
1958
634
580
1
32
1675
789
1735
1784
1326
2051
1044
1885
1790
765
1645
32 2 0 0 0 3
1266
890
2204
2409
1338
2076
1708
2138
2375
1455
1487
1125
1989
2156
Problem 63
Problem 64
Problem 35
Case A