Blog 4
March 8, 2025
Thomas Hoppner Blog 4
1) Provide any data that you’ve already collected in a readable and comprehensible form (this might be in chart or table format, as quotes assembled from various media, as some sort of word cloud or graph, etc.).
2) Explain this data to your reader. What is significant here and why? What does it mean in terms of your thesis/hypothesis/theory? Does it confirm, refute, or qualify your central claim?
3) If your data set is incomplete, you can explain what you still need to do to complete your data and why (we should already know how if your method section is any good).
4) Whether complete or incomplete, evaluate your data. What does it seem to be missing for you to make a full and comprehensive argument? What has surprised you? What value does this data have for your inquiry, for your field, or for various fields? Why does this data matter and to whom?
1)
2) I did all the statistical analysis on google sheets and I first used ANOVA (analysis of variance) tests for every basketball statistic(rebounds, steals, blocks) and I found that none of them had a pval of less than 0.05. This means that none of the values for each separate division were statistically significant in differing from each other. So, I used Cohen’s d which was used to calculate effect sizes for differences in game-related statistics (d = 0.2 for small effects, d = 0.5 for moderate effects, and d = 0.8 for large effects). I individually calculated each static’s effect size from D1-D2, D1-D3, and D2-D3. To list a few, I found that for Free throw percentage, the effect size from D1-D3 was 0.86(L) and from D2-D3 was 1.0(L), as I investigated, the mean for D3 was very low compared to the other Divisions, making free throw percentage (percent of free throws made) an important differentiating statistic for Division 3. I found offensive rebounds has an effect size of 0.47(M) from D1-D2 but low for D2-D3 and D1-D3. I also found that for turnovers, the effect size for D1-D3 and D2-D3 was 0.51 and 0.61 respectively, so D3 clearly is different from the other two divisions in this aspect; as I looked at the mean, they actually had much less turnovers than D1 and D2 which is odd since they are like a worse division which means a worse division is losing the ball(turnover) less, this interesting point does bring up strategic ideas. Specifically, maybe D1 and D2 are willing to be more risky on offense and are okay with turning the ball over more, or D3 is just super careful in not turning the ball over, or something else. There’s a lot more but the whole thing does somewhat support my claim that there are differences in the amount of each statistic per division but not to the level of pval<0.05. There are a few more tests I could do(I do not know what to focus on now) that could bring more depth into my method. My main thing I’m missing is a bigger data set for Division 3 and Division 1 I feel like. When I look at the variance for these two, It is usually higher than DIvision 2 which has 10 teams compared to the 8 for D1 and 4 for D3 but there’s nothing I can really do about it for this paper in my opinion. At first when I saw that I has no statistically significant values at pval less than 0.05, I thought I was done for but as I saw Conhen’s D on some sources with values greater than 0.05, I was surprised to find some actual data. That data provides interesting theories about not just Region X but also each NCAA division in general. Although the sample isnt so big, there can be implications for other region’s leagues and conferences and differing levels of NCAA play for many sports like soccer and baseball. This matters because finding out which statistics separate the better divisions from the worse once can give people who are either playing collegiate basketball now, are planning to play, or are just spectating, a clear idea as to which statistics to look at and try to improve upon to make it to the next level. SImilarly, coaches and staff at colleges can use this data to refine their game strategies and improve practices/training their players to tailor them to stronger Division’s expectations.
Think about the original curiosity that led to your inquiry. What other areas of inquiry might that same curiosity lead to? This statistical analysis stuff could point me in the direction of sports science. However, I feel like it picked this just because I enjoyed my time playing basketball in Atlanta and college basketball so I felt this was an exciting idea. I feel like more than just sports, data analytics can lead me to business or finance or anything like that. 
Reader Interactions
Comments
Leave a Reply
You must be logged in to post a comment.

Interesting findings! Did you use publicly available data?
If I’m understanding this right, it seems like your results aren’t statistically significant? It looks like you found another source with similar P-values that you’re relying on for your final conclusions, but other than that, is there another way you can maybe increase the significance of your results?