Hypothesis testing of a single mean--verbal and math ability of CS students

This activity provides independent practice in use of the one-sample t-test (comparing a sample mean to a population mean) within the context of the 4 steps of hypothesis testing:

State the appropriate null and alternative hypotheses, Ho and Ha.
Obtain a random sample, collect relevant data, and check whether the data meet the conditions under which the test can be used. If the conditions are met, summarize the data by a test statistic.
Find the p-value of the test.
Based on the p-value, decide whether or not the results are significant and draw your conclusions in context.^[1]

OBS: ID number
GPA: The 3-semester grade-point average (0-4 scale)
HSM: average high school grade in math (1-10 scale, with 10=A, 9=A-, etc.)
HSS: average high school grade in science (1-10 scale, with 10=A, 9=A-, etc.)
HSE: average high school grade in English (1-10 scale, with 10=A, 9=A-, etc.)
SATM: SAT Mathematics score (circa 1980-82)
SATV: SAT Verbal score (circa 1980-82)
SEX: 1=male; 2=female

The researchers might have been interested in how the SAT Math and Verbal scores for the computer science students compared with scores from other university students. Let's assume the following population values so you can practice how to compare the sample mean to the population mean:

Mean SATM for all university students = 539
Mean SATV for all university students = 498

For the analysis, the significance level, α, is set at .05.

Dataset

Obtain the dataset from one of the following:

class website: csdata.por (portable file format)
website accompanying Introduction to the Practice of Statistics by Moore, McCabe, and Craig (zip files in various formats)
dataset list: csdata.por (portable file format)

Analyses

The following instructions and guiding questions will step you through the analysis process. Copy and paste the following sections ("SATM", "SATV", and "Summarize") into a word processor. Provide responses as indicated.

SATM

Let μ be the mean SATM score for the population of students at the university. State the hypotheses that are being tested in this problem.
Data collection and examination
- Look at the data. Using SPSS, calculate descriptive statistics and create a histogram (see instructions). Describe the data and shape of the distribution.
- Explain why the conditions which allow us to safely use the one-sample t test are met.
- Would it be valid to use the t test if the data were highly skewed with a few large outliers? Explain.
- Using SPSS, run the one-sample t test procedure.
- Report the value of the test statistic.
- How is the t statistic calculated (write the formula)?
- Describe what this t statistic value means.
Report the p-value for the statistical test.
Interpret the analysis results in the context of the research question.
- Indicate whether or not Ho is rejected. Provide evidence.
- Draw conclusions based on the results, given the context of the research question.
- If Ho is rejected, report a confidence interval appropriate to the given significance level.

SATV

Let μ be the mean SATV score for the population of students at the university. State the hypotheses that are being tested in this problem.
Data collection and examination
- Look at the data. Using SPSS, calculate descriptive statistics and create a histogram (see instructions). Describe the data and shape of the distribution.
- Explain why the conditions which allow us to safely use the one-sample t test are met.
- Using SPSS, run the one-sample t test procedure.
- Report the value of the test statistic.
Report the p-value for the statistical test.
Interpret the analysis results in the context of the research question.
- Indicate whether or not Ho is rejected. Provide evidence.
- Draw conclusions based on the results, given the context of the research question.
- If Ho is rejected, report a confidence interval appropriate to the given significance level.

Summarize

Integrate your findings from the two analyses for SATM and SATV.
What limitations (related to sample, research design, choice of analyses...) affect the validity of this research?

References

↑ Open Learning Initiative. Statistics. Retrieved from the Open Learning Initiative web site http://oli.web.cmu.edu/openlearning/forstudents/freecourses/statistics.
↑ Campbell, P.F. and McCabe, G.P. (1984). "Predicting the success of freshman in a computer science major." Communications of the ACM, pp. 1108-1113.

[1] Open Learning Initiative. Statistics. Retrieved from the Open Learning Initiative web site http://oli.web.cmu.edu/openlearning/forstudents/freecourses/statistics.

[2] Campbell, P.F. and McCabe, G.P. (1984). "Predicting the success of freshman in a computer science major." Communications of the ACM, pp. 1108-1113.

[1]

[2]

Hypothesis testing of a single mean--verbal and math ability of CS students

Contents

Research question

Dataset

Analyses

SATM

SATV

Summarize

References

Navigation menu

Personal tools

Namespaces

Variants

Views

Actions

Search

Navigation

Community

Print/export

Tools