Lily Gebhart, DIMACS REU 2023

DIMACS REU 2023

General Information

Student:	Lily Gebhart
Mentor:	John Kolassa
School:	Occidental College
E-mail:	gebhart (at) oxy (dot) edu
Project:	Approximations for Kurtosis and Continuity on the Prentice Test

Project Description

Nonparametric statistics is a sub-field of statistics involving minimal assumptions about the distribution of data, making it applicable to the analysis of real-world phenomena. The test of Prentice is a non-parametric statistical test for the two-way analysis of variance using ranks. The null distribution of this test is approximated using the Chi-square distribution. However, the exact null distribution deviates from the Chi-square approximation in certain cases commonly found in applications, motivating adjustments to the distribution. This summer, we presented adjustments to this null distribution, and that of related tests with non-polynomial scoring systems, correcting for continuity, skewness, and kurtosis in the multivariate case.

Weekly Log

Week 1: May 31 - June 2

This week, I spent most of my time building an understanding of the two problems in nonparametric statistics I will be working on this summer. I read papers [1], [2], [3], and [4] in the list below, sections of [5], and other miscellaneous resources. This helped me understand how to implement code to tackle the impact of adjustments of kurtosis in the Kruskal-Wallis and Friedman tests. I was also able to find code corresponding to improvements made to the Friedman Test in [3] below.

Lastly, I prepared my presentation for the introductory presentations hosted next Monday. Here, the background I had built over the course of the week paid off in helping me create the presentation more quickly than I thought.

Week 2: June 5 - June 9

On Monday, I gave my introductory presentation to the REU cohort. It was exciting to see what everybody would be working on over the summer!

Over the course of the week, I continued to review background material for the Kruskal-Wallis test project. I continued to review [2] to understand how to replicate the results so that I can build off of them later this summer. I also reviewed Chapters 1-3 and portions of Chapter 6 of [6] below which built my background understanding of asymptotics and Edgeworth Series in multivariate and univariate forms. I also spent some time reviewing content from Real Analysis, Measure Theory, Probability Theory, and Complex Analysis using online resources.

Towards the end of the week, I began looking at how to calculate the 1st - 4th cumulants for a multivariate rank sum distribution. I will likely continue with this next week and hopefully get code running for the results in [2].

Week 3: June 12 - June 16: This week, I started to code up the approximation from [2] for the Kruskal-Wallis test distribution under the null hypothesis using R. By the end of the week, I had this written up and was able to start making comparisons to the known Kruskal-Wallis test distribution and the Chi-Square distribution that the Kruskal-Wallis distribution is often approximated using. This next week, I will continue working on the comparisons to other tests to ensure that my code for the Yarnold Approximation is correct, and implement a few other measures to make the code faster.

General Information

Project Description

Weekly Log

References & Links

Acknowledgements