Using R Coding and Then ChatGPT to Figure Out an Outbreak

Ultimately, you’re the human with the lived experience, and GPTs are tools. Use them (your experience and the tools) wisely.

René F. Najera, MPH, DrPH
8 min readMar 17, 2024

--

A future epidemiologist? Not quite… (Image via DALL-E artificial intelligence from OpenAI.)

In my last post, I wrote about investigating an outbreak of food borne disease, and how the p-value for the statistical analysis I did while doing so can be changed. The p-value was initially not significant, but increasing the sample size made it significant without changing the measure of association. It’s a form of p-hacking if you’re a little too obsessed with p-values.

For this post, I will use a sample of 1,000 fake people who all ate at a fictional big event and became sick. I will then show you how we can determine the most likely food source of the outbreak. And I will finish with a discussion of the statistical analysis I did using ChatGPT’s Advanced Data Analysis tool.

I will link to the data and code at the end of the post.

First, Look At Your Data

--

--

René F. Najera, MPH, DrPH
René F. Najera, MPH, DrPH

Written by René F. Najera, MPH, DrPH

DrPH in Epidemiology. Public Health Instructor. Father. Husband. "All around great guy." https://linktr.ee/rene.najera

No responses yet