Wading Through the Data Swamp:
Program Evaluation 201

Module 1: Descriptive Statistics in Evaluation - Page 13 of 21

Mean Alcohol Use
Alcohol Icon

Now let's look at these same kids' response to the GPRA Youth Survey Alcohol Question (Question 1).

Days of Alcohol Use, Past 30 Days

Picture of a scroll Scrolling Table! You can use the table below to scroll through the data.

Student # Days Used (x)
1 2
2 1
3 2
4 2
5 2
6 1
7 2
8 0
9 4
10 1
11 2
12 2
13 2
14 1
15 2
16 1
17 2
18 5
19 1
20 2
21 2
22 2
23 0
24 2
25 1
26 2
27 2
28 1
29 2
30 2
31 0
32 2
33 1
34 0
35 2
36 2
37 1
38 2
39 1
40 2
41 0
42 2
43 2
44 1
45 2
46 2
47 2
48 1
49 2
50 2
 

Here's how the scores look plotted on a graph.

bar graph displaying cumulative results of participants use of alcohol in the past 30 days

For these alcohol use scores, the mean is 1.64.
The median is 2 (half the scores are above 2, half below).
The mode is also 2 (the most common response).

The responses to this question show that nearly all (90 percent) of the kids were drinking some alcohol before they entered the program, but few of them drink that much. (At least not as much as the marijuana smokers used pot). Only two said they drank more than 1 to 2 days in the past 30.

This figure looks much more like a bell shape than the marijuana curve but is still not normally distributed. In order to know for sure we would need to check the skewness of the distribution. This can be done by an evaluator using a statistical software package such as SPSS or SAS.

This example shows us that we can have almost the same mean for past month use of alcohol and marijuana. However, the numbers of kids using are extremely different, as shown by the differences in the shape of the distributions. In this case, 30 percent of kids report using marijuana compared to 90 percent of kids using alcohol.

A red flag with the word warning written on it.

When Data Are Not Normally Distributed

It is important to know that some statistical tests may require that the data are normally distributed. If your data are not normally distributed, an evaluator will be able to use statistical methods to normalize your data. Ask your evaluator about this.