Data Set: BioNTech Adolescents

Nick Paterno February 6th, 2022

The data set is biontech_adolescents. This data set is available on openintro.org/data and in the `openintro` R package.

On March 31, 2021, Pfizer and BioNTech announced that "in a Phase 3 trial in adolescents 12 to 15 years of age with or without prior evidence of SARS-CoV-2 infection, the Pfizer-BioNTech COVID-19 vaccine BNT162b2 demonstrated 100% efficacy and robust antibody responses, exceeding those recorded earlier in vaccinated participants aged 16 to 25 years old, and was well tolerated." These results are from a Phase 3 trial in 2,260 adolescents 12 to 15 years of age in the United States. In the trial, 18 cases of COVID-19 were observed in the placebo group (n = 1,129) versus none in the vaccinated group (n = 1,131).

A summary table from the Phase 3 Pfizer-BioNTech Covid-19 vaccine trial for adolescents 12 to 15 years of age. The first row shows 1131 out of 1131 adolescents in the vaccine group did not get covid and no one got covid. The second row shows that 1111 out of 1129 adolescents in the placebo group did not get covid and 18 did get covid.

This data set provides a great opportunity for students to practice some basic statistics with real world data that they see in the news on a daily basis. Instead of just taking the news as it is given they can actually do the computations themselves with the raw data. In particular, this data set allows them to conduct a chi-square test for independence to see if getting the vaccine has an effect on contracting COVID-19. Below are a few lines of code - and the output - that students can run to show that there is a relationship between not getting the vaccine and getting COVID at the 99.9% significance level.

An image of two lines of R code used to run a Chi-Square test for independence. The results of the test are in the table below.
Results from running a Chi-Square test for independence in R. The results show a test statistic of 16.21 and a p-value of 0.0000566.