Part A
The study used the height and weight data, which was retrieved from the Kaggle database. The data contains 3 variables, which include gender, weight, and height. https://www.kaggle.com/mustafaali96/weight-height
1.Research question
Is there any relationship between the height and weight of individuals?
2.Target population
The target population are the people seeking to known their body weight through the use of height, particularly the sick.
3.Main variables
The main variables in the dataset are weight and height, which are both classified as ratio level of measurement.
4.Issues in data collection
The data collection from the Kaggle database is free; however, the user is expected to register the use.
5.Graphical format
The scatter plot above exhibits that weight and height had a strong linear relationship, whereby an increase in height may result to an increase in weight.
Part B
Question 1
a)Comparing two variables
The most appropriate graph to compare or display weight and height is the scatter plot. The scatter plot is tool that exposes the relationship between two continuous variables.
The. scatter plot above exhibits that weight and height had a strong linear relationship, whereby an increase in height may result to an increase in weight.
b)Proportion of Categorical variable
Among the three variables, gender is a categorical variable, thus the appropriate chart to describe gender is pie chart. Pie chart exposes the proportion of a categorical variable is percentage and number of levels within the variables.
The chart above exhibits that 50% of the respondents were female and 50% were male.
Question 2
a)Number of classes
Weight has 110 observations
Let k be the number of class intervals and n is the sample size.
The results above show that 27 is greater than the sample size 110 hence the researcher’s decision is suggestion of using 7 as the number of classes is correct.
b)Class width
The class width for weight is given is 20
c)Histogram
Bin Frequency
110 2
130 21
150 25
170 26
190 19
210 11
230 5
250 1
The chart above shows that weight is skewed to the right.
Question 3
a)Numerical summary
Descriptive Statistics Height Weight
Mean 66.03457 158.2645
Median 65.87537 154.9426
Mode #N/A #N/A
Standard Deviation 3.642515 29.67871
Sample Variance 13.26792 880.8257
Range 16.45348 135.0475
Minimum 58.75249 106.846
Maximum 75.20597 241.8936
1st Quartile 63.14417 134.4348
3rd Quartile 68.35192 180.0381
b)Comment on descriptive Statistics
The table above exhibits the summary statistics of both height and weight variables. As shown, height record a mean of 66.034 with a standard deviation of 3.642; besides, the maximum height was 75.206 whereas the minimum height was 58.752 thus forming and range of 16.453. On the other side, weight record a mean of 158.265 with a standard deviation of 29.679; besides, the maximum height was 241.894 whereas the minimum height was 106.846 thus forming and range of 135.048.
Question 4
The study aimed at exploring the relationship between height and weight. Thus, data was retrieved from the Kaggle database, whereby 110 observations were incorporated in the study. Among the 110 participants 55% were female whereas 45% were male. As evident, height record a mean of 66.034 with a standard deviation of 3.642 whereas weight record a mean of 158.265 with a standard deviation of 29.679. Consequently, the scatter plot above exhibits that weight and height had a strong linear relationship, whereby an increase in height may result to an increase in weight. Generally, the study successfully answered the research