**Question**

Question;1) (5 points) Cost of developing Software Data. The three basic structural elements of data processing system are files, flows, and processes. Files are collections of permanent records in the system, flows are data interfaces between the system and the environment, and processes are functionally defined logical manipulations of the data. An investigation of the cost of developing software as related to files, flows, and processes was conducted. The ANOVA table below is the partial output from and the multiple linear regression analysis.Source Degrees of freedom Sum of Squares Mean Squares F p-valRegression 3 <0.0001Residual 3945.1 Total 24 18023.2 (1 point) Fill out the rest of the ANOVA table based on the provided information. (1 point) Use the table to calculate the coefficient of determination, R2. Interpret. (3 points) Perform a model utility test (f test) from the ANOVA table. Significance level = 0.01. State the Null and Alternative Hypotheses. State the F statistic along with the numerator and denominator degrees of freedom and p-value. What can you conclude from the test?2) (17 points) Multiple Linear Regression:The electric power (Power) consumed each month at a chemical plant is under study. It is assumed that the monthly electric consumption is related to four factors: average ambient temperature?(Temp), number of days in the month (Days), average product purity in percent (Purity), and the volume of production in tons (Volume). Use the data and code provided under the Data Sets and Code tab to do the following. (3 points) Provide and describe a scatterplot matrix. Are there any relationships between the response variable and the predictors?Are there anyvisible relationships between the predictors? (3 points) Fit a least squares regression model using all variables. From the overall F test (model utility test)is there evidence that at least one variable is a significant predictor of power consumption? Assume a significance level of 0.05. Include R summary output. (2 points) How well does the model fit the data? In other words, what is R^2? (2 points) Give the estimated least squares regression equation. (2 points) Interpret the estimated slope of temperature. (2 points) Provide a power prediction in kW for when Temp = 50?, Days = 25, Purity = 90%, and Volume = 100 Tons. (1 point) In part e) the observed power for the above predictor values is 288 kW. How far off is the predicted value from the observed value? (2 Points) Plot the residuals from the model. Are conditions satisfied? Briefly describe the plot. 3) (28 points) Model Selection: In the power consumption example, not all predictors are significant/ needed in the model. 