- Statistics assignment
- ·· 4 pages, 1000 words Australia
- Open 8 Refers
- (+61) 45069****
- Apr 30, 2015 | Thursday 04:51 pm | 59 days 16 hrs ago
Statistics assignment
Part C
Table of Contents
Question 1) Statistical inference topic 7. 3
Question 2) Simple linear regression model topic 8. 3
a). 3
b). 4
Question 3) multiple linear regression model topic 9. 6
a). 8
b). 8
Reference. 9
Question 1)Statistical inference topic 7
For the two variables Dom Time and Non time there are 125 values. Now the average value of Dom time and Non time are 0.43 and 0.45 respectively. Since average value of Non Time is greater than the average value of Dom Time, it can be concluded that reaction time is not quicker using the dominating hand.
Question 2) Simple linear regression model topic 8
In this problem we have to find the relationship between age and height, using simple linear regression model.
a)
The scatter plot of these two variables is given by-
This scatter plot indicates that age and height are related, and sign of this relationship is positive.
b)
The following output is obtained from SPSS
Descriptive Statistics | |||
Mean | Std. Deviation | N | |
height | 161.8080 | 13.26267 | 125 |
age | 13.6880 | 1.85096 | 125 |
Table: 1
Correlations | |||
height | age | ||
Pearson Correlation | height | 1.000 | .603 |
age | .603 | 1.000 | |
Sig. (1-tailed) | height | . | .000 |
age | .000 | . | |
N | height | 125 | 125 |
age | 125 | 125 |
Table: 2
Model Summary | |||||||||
Model | R | R Square | Adjusted R Square | Std. Error of the Estimate | Change Statistics | ||||
R Square Change | F Change | df1 | df2 | Sig. F Change | |||||
1 | .603a | .363 | .358 | 10.62655 | .363 | 70.152 | 1 | 123 | .000 |
a. Predictors: (Constant), age Table: 3 | |||||||||
b. Dependent Variable: height |
Coefficients | ||||||
Model | Unstandardized Coefficients | Standardized Coefficients | t | Sig. | ||
B | Std. Error | Beta | ||||
1 | (Constant) | 102.700 | 7.121 | 14.423 | .000 | |
age | 4.318 | .516 | .603 | 8.376 | .000 | |
a. Dependent Variable: height Table: 4 |
From table 1 it is cleared that correlation between age and height .603, and significance value (p value) of this correlation is .000. Since p value of this correlation is less than 0.05, it can be said that correlation between age and height are statistically significant. So age and height are positively correlated. Value of coefficient of determination (R square) is 0.363; this value is obtained from table2. . R square is a measure, which is used to measure how close the actual data to the fitted regression line. Simply R square is the percentage or proportion of the total variation which is explained by this multiple regression model. 0.363 is not so much high. So it can be concluded that we cannot get good fit using this model. Here the regression equation(least square regression line) is Yt= 102.700 + 4.318Xt, here Y is the dependant variable, i.e. height and X is the independent variable i.e. age, and t represents the total number of variable, for this problem t is 125. In regression analysis it is very important to study the significance of the estimated coefficients. In regression model the confidents indicates the average change in dependent variable with respect to per unit change in independent variable. To test this significance of coefficient consider the following hypothesis –
H0: there is no relationship between age and height against the alternative H1: there is relationship between age and height (DiMaggio, 2013).
By regression analysis we can get the significance value of coefficients. If this significance value is less than 0.05, then we can reject the null hypothesis at 0.05 level of significance.
From the table 4 it can be said that significance value (p value) of the coefficient of independent variable (age) is .000. So it can be said that height depends on age(Thomson, and Emery, 2014).
Question 3) multiple linear regression model topic 9
The following output is obtained from SPSS
Correlations | ||||
height | age | sex | ||
Pearson Correlation | height | 1.000 | .603 | -.227 |
age | .603 | 1.000 | .010 | |
sex | -.227 | .010 | 1.000 | |
Sig. (1-tailed) | height | . | .000 | .005 |
age | .000 | . | .458 | |
sex | .005 | .458 | . | |
N | height | 125 | 125 | 125 |
age | 125 | 125 | 125 | |
sex | 125 | 125 | 125 |
Table: 1
Model Summaryb | |||||||||
Model | R | R Square | Adjusted R Square | Std. Error of the Estimate | Change Statistics | ||||
R Square Change | F Change | df1 | df2 | Sig. F Change | |||||
1 | .646a | .417 | .408 | 10.20634 | .417 | 43.692 | 2 | 122 | .000 |
a. Predictors: (Constant), sex, age Table: 2 | |||||||||
b. Dependent Variable: height |
Coefficientsa | ||||||
Model | Unstandardized Coefficients | Standardized Coefficients | t | Sig. | ||
B | Std. Error | Beta | ||||
1 | (Constant) | 111.506 | 7.322 | 15.229 | .000 | |
age | 4.334 | .495 | .605 | 8.752 | .000 | |
sex | -6.164 | 1.831 | -.233 | -3.367 | .001 | |
Table: 3 |
a)
From table 1 it can be said that correlation between sex and height -0.227, and significance value (p value) of this correlation is .005. Since significance value of this correlation is less than 0.05, it can be said that correlation between sex and height are statistically significant. So age and height are negatively correlated for this data set. Value of coefficient of determination (R square) is 0.417; this value is obtained from table2. . R square is a measure, which is used to measure how close the actual data to the fitted regression line. Simply R square is the percentage or proportion of the total variation which is explained by this multiple regression model (Menke, 2012). Since 0.417 is not so much high, it can be concluded that we cannot get good fit using this model. Here the regression equation (least square regression line) is Yt= 111.506 + 4.334Xt – 6.164 X’t, here Y is the dependant variable ( height) and X is the first independent variable(age), and X’t is the second independent variable (sex), t represents the total number of variable, for this problem t is 125. In regression analysis it is very important to study the significance of the estimated coefficients. In regression model the confidents indicates the average change in dependent variable with respect to per unit change in independent variable. To test this significance of coefficient consider the following hypothesis –
H0: there is no relationship between sex and height against the alternative H1: there is relationship between sex and height(endat, and Piersol, 2011).
By regression analysis we can get the significance value of coefficients. If this significance value is less than 0.05, then we can reject the null hypothesis at 0.05 level of significance(Gelman, et al, 2014).
Table 3 indicates that significance value of the coefficient of sex is .001. Since 0 .001 is less than 0.05, it can be concluded that height depends on sex(Agresti, 2013).
b)
R square value of the first regression model (question 2) is 0.363, and R square value of the second regression model is .417. It is known that R square is the measure of goodness of fit. Higher the value of R square, better the fit. So, sex makes a significant contribution to the regression model(DiMaggio, 2013).
Reference
Hair, J. F. (2010). Multivariate data analysis.
Gelman, A., Carlin, J. B., Stern, H. S., and Rubin, D. B. (2014). Bayesian data analysis (Vol. 2). Chapman
Gelman, A., Carlin, J. B., Stern, H. S., and Rubin, D. B. (2014). Bayesian data analysis (Vol. 2). C
Agresti, A. (2013). Categorical data analysis. John Wiley & Sons.
endat, J. S., and Piersol, A. G. (2011). Random data: analysis and measurement procedures (Vol. 729). John Wiley & Sons.
Menke, W. (2012). Geophysical data analysis: discrete inverse theory. Academic press.
Thomson, R. E., & Emery, W. J. (2014). Data analysis methods in physical oceanography. Newnes.
DiMaggio, C. (2013). Introduction (pp. 1-5). Springer New York.