Descriptive Statistics and Mean Centers

Statistical Analysis of Test Scores

I have been asked to analyze the test scores from two high schools in the Eau Claire area. These schools are Eau Claire Memorial and Eau Claire North. As test scores are usually used as an indicator on the success of a school, by comparing the two, people can get a sense of which school is doing better. In the past, Eau Claire North has always scored less on standardized test compared to Memorial which leaves some people in the community wondering if Eau Claire North has an issue. To fix this, members of the community have suggested that the teachers are to blame and should be replaced.

It is my job to look further into the issue and either confirm or deny the public's allegations. To do so, I will examine a sample of each schools test scores and use various statistical methods to make my analysis. These tools are the following:

Range: The amount between the lowest data point and the highest data point. To find this, you subtract the lowest from the highest. The resulting value is your range. For the test scores, that would be the lowest score and the highest score.

Mean: The statistical average of your dataset. It is found by taking the sum of your data points and dividing it by the number of values. This is the average score on the test for each dataset.

Median: The exact center of your data. Once ordered from smallest to largest the middle value or the middle of the two middle values is your median. In regards to the test scores, this may not be one of the scores, but rather the middle of the two center ones.

Mode: The value that repeats the most in a data set. The test score that is most commonly gotten.

Kurtosis: The shape your date makes when it is distributed and the change in shape compared to that of a normal distribution. This is usually a measure of if the shape is wider or thinner than normal.

Skewness: The amount the data points differ from the mean, higher or lower. When more data points fall on one side or the other of the mean, the data is skewed either positively or negatively, often showing outliers in your data.

Standard Deviation: Value that displays how the data is located around the mean and the distance that a data point is from the mean.

                                        Figure 1: Data spreadsheet
                       Figure 2: Eau Claire Memorial Test Scores Standard Deviation Calculation
                       Figure 3: Eau Claire North Test Scores Standard Deviation


After doing the calculations for both samples, figure 1 displays my findings. There are some key comparisons to consider when trying to figure out which school does better. First, when looking at the range, you can see that EC North’s range is lower than that of EC Memorial’s. This means that EC North’s scores are closer together in the span of 83 points. EC Memorial’s scores are distributed more sparsely covering 91 points. North’s scores are thus more consistent than Memorial’s, but not by much.  Secondly, when you examine each schools score means, you can see that North averages a better score on standardize tests by roughly 2.5 points. That value debunks the idea that Memorial does better on these tests. Additionally, the median for North’s data set is also higher than Memorials, another indicator that North’s students’ scores are over all higher. Even the mode, a near pointless measure is higher for EC North. From those four figures, it is already apparent that EC North does better on standardize testing than EC Memorial. Next, looking at the kurtosis values, North is within the normal distribution while Memorial has a platykurtic, or wider distribution. This is important and relates back to range showing that the Memorial scores are less consistent. The 6th measurement is skewness, and while they are both negatively skewed, EC North has a greater skew meaning that more of their scores compared to Memorial are above the mean. Lastly, when we look at the standard deviation, EC North has a smaller value of 23.635. This means that 68% of the scores fall within 23.635 points from the mean as opposed to 27.158 points for Memorial.

                Now that is a lot of figures and values and to many it does not mean much. The overall point of the analysis is that the public’s assumption that Eau Claire North scores worse than Memorial is false. Yes, Memorial has high scores and seemingly more of them, but when it is broken down and analyzed, the statistics point to North as the stronger school. Regarding the proposal to fire teachers from Eau Claire North, they should feel safe in knowing that they are having success in their methods and it is showing on standardized tests. The biggest indicators of this in my opinion are the range, mean and skewness. North’s range is smaller than Memorial’s showing that as a whole, their scores are closer together. That is even more important when you look at the mean. First, the mean is higher than Memorials’. Simple as that, their scores are better. Lastly, skewness shows us that North had fewer scores below the mean than above it. Most people in the sample score above it.

Part II
Mean Centers and Weighted Mean Centers

Tasked with finding a couple of mean centers for the state of Wisconsin, I joined my dataset with a shapefile of Wisconsin using ArcMap. The data used is from the United States Census Bureau. A Mean Center is the middle of a geographical area based upon the x and y values of that area. A Weighted Mean Center is a mean center that uses specific data to find the middle of. For this assignment, that is population, meaning the weighted mean center will find the center of the state based on the distribution of population in the state. I was asked to find 3 mean centers, the geographical mean center, the mean center weighted by the population in the year 2000 and the mean center weighted by the population of 2015. Each point is located on the map below.

                     Figure 4: Mean Centers of Wisconsin Based On Geography and Populations

As you can see in figure 4, the geographical mean center, as indicated by the green star is located in the center of the state in eastern Wood County. Used as a reference point, the differences from the geographical center compared to when you weight it by population is drastic. The mean center weighted by the 2000 population is symbolized by the red circle. When you take the center based off population it becomes clear that the majority of the state’s population is located in the southern counties. Additionally, more people live in eastern Wisconsin than western. This is shown by the location of the mean center for the 2000 population, located in Green Lake County, noticeably shifted south west from the geographical center. This is not surprising as Wisconsin’s two largest cities, Milwaukee and Madison are both in this region of the state. Next, when you add the mean center based on the population in 2015, we see another shift, this time south of the 2000 mean center. While the shift is minor, it does indicate some changes throughout the state. In that 15 year period, the southern part of the state became even more concentrated with population.

With these changes in mean center, we can infer they have changed for a number of reasons. First, in the middle of the time period examined, 2000 to 2015, the United States had a major recession. This recession hit rural areas especially hard. People struggled with money and many lost their homes or relocated to find new jobs. As the northern part of the state is largely rural, the shift in where population is located even farther south could have to do with the recession. As more and more people relocate to the south to find jobs, the mean center shifts south, following them. Another possibility is the trend that it is becoming increasingly harder to make a decent living off agriculture. This trend follows the same outcomes as the one stated above, forcing framers to move to more urban areas for jobs. A major reason for the population centers to be shifted towards the east is the urban development of the eastern part of the state. Areas like Milwaukee County and Brown County have been steadily growing, meeting the demands of the national economy.


While these reasons are all just speculation, it goes to show that changes in mean centers weighted by population are subject to a wide range of variables. I’m sure that these moves could be seen in many states and that the trend of population shifting from rural to urban is not limited to Wisconsin, but seen across the nation. Using a map to visualize where population is focused gives the reader a lot of insight to the makeup of the state. You wouldn’t even need to know much about Wisconsin to make assumptions based off the mean centers. 

Comments

Popular posts from this blog

Coffee Shops Customer Report, San Francisco

Trader Joe's Site Selection (Geog 352)

Regression Analysis