An Introduction to Statistics: Choosing the Correct Statistical Test (2024)

Journal List
Indian J Crit Care Med
v.25(Suppl 2); 2021 May
PMC8327789

As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsem*nt of, or agreement with, the contents by NLM or the National Institutes of Health.
Learn more: PMC Disclaimer | PMC Copyright Notice

Indian J Crit Care Med. 2021 May; 25(Suppl 2): S184–S186.

doi:10.5005/jp-journals-10071-23815

PMCID: PMC8327789

PMID: 34345136

Priya Ranganathan¹

Author information Copyright and License information PMC Disclaimer

Abstract

The choice of statistical test used for analysis of data from a research study is crucial in interpreting the results of the study. This article gives an overview of the various factors that determine the selection of a statistical test and lists some statistical testsused in common practice.

How to cite this article: Ranganathan P. An Introduction to Statistics: Choosing the Correct Statistical Test. Indian J Crit Care Med 2021;25(Suppl 2):S184–S186.

What is the Research Hypothesis?

Sometimes, a study may just describe the characteristics of the sample, e.g., a prevalence study. Here, the statistical analysis involves only descriptive statistics. For example, Sridharan et al. aimed to analyze the clinical profile, species distribution, and susceptibility pattern of patients with invasive candidiasis.³ They used descriptive statistics to express the characteristics of their study sample, including mean (and standard deviation) for normally distributed data, median (with interquartile range) for skewed data, and percentages for categorical data.

Studies may be conducted to test a hypothesis and derive inferences from the sample results to the population. This is known as inferential statistics. The goal of inferential statistics may be to assess differences between groups (comparison), establish an association between two variables (correlation), predict one variable from another (regression), or look for agreement between measurements (agreement). Studies may also look at time to a particular event, analyzed using survival analysis.

Are the Comparisons Matched (Paired) or Unmatched (Unpaired)?

Observations made on the same individual (before–after or comparing two sides of the body) are usually matched or paired. Comparisons made between individuals are usually unpaired or unmatched. Data are considered paired if the values in one set of data are likely to be influenced by the other set (as can happen in before and after readings from the same individual). Examples of paired data include serial measurements of procalcitonin in critically ill patients or comparison of pain relief during sequential administration of different analgesics in a patient with osteoarthritis.

What are the Type of Data Being Measured?

The test chosen to analyze data will depend on whether the data are categorical (and whether nominal or ordinal) or numerical (and whether skewed or normally distributed). Tests used to analyze normally distributed data are known as parametric tests and have a nonparametric counterpart that is used for data, which is distribution-free.⁴ Parametric tests assume that the sample data are normally distributed and have the same characteristics as the population; nonparametric tests make no such assumptions. Parametric tests are more powerful and have a greater ability to pick up differences between groups (where they exist); in contrast, nonparametric tests are less efficient at identifying significant differences. Time-to-event data requires a special type of analysis, known as survival analysis.

How Many Measurements are Being Compared?

The choice of the test differs depending on whether two or more than two measurements are being compared. This includes more than two groups (unmatched data) or more than two measurements in a group (matched data).

Tests for Comparison

(Table 1 lists the tests commonly used for comparing unpaired data, depending on the number of groups and type of data. As an example, Megahed and colleagues evaluated the role of early bronchoscopy in mechanically ventilated patients with aspiration pneumonitis.⁵ Patients were randomized to receive either early bronchoscopy or conventional treatment. Between groups, comparisons were made using the unpaired t test for normally distributed continuous variables, the Mann–Whitney U-test for non-normal continuous variables, and the chi-square test for categorical variables. Chowhan et al. compared the efficacy of left ventricular outflow tract velocity time integral (LVOTVTI) and carotid artery velocity time integral (CAVTI) as predictors of fluid responsiveness in patients with sepsis and septic shock.⁶ Patients were divided into three groups— sepsis, septic shock, and controls. Since there were three groups, comparisons of numerical variables were done using analysis of variance (for normally distributed data) or Kruskal–Wallis test (for skewed data).

Table 1

Tests for comparison of unpaired data

Type of data	Two groups	More than two groups
Nominal	Chi-square test or Fisher's exact test
Ordinal or skewed	Mann–Whitney U-test (Wilcoxon rank sum test)	Kruskal–Wallis test^*
Normally distributed	Unpaired t-test	Analysis of variance (ANOVA)^*

Open in a separate window

^*To be followed by post hoc testing

A common error is to use multiple unpaired t-tests for comparing more than two groups; i.e., for a study with three treatment groups A, B, and C, it would be incorrect to run unpaired t-tests for group A vs B, B vs C, and C vs A. The correct technique of analysis is to run ANOVA and use post hoc tests (if ANOVA yields a significant result) to determine which group is different from the others.

(Table 2 lists the tests commonly used for comparing paired data, depending on the number of groups and type of data. As discussed above, it would be incorrect to use multiple paired t-tests to compare more than two measurements within a group. In the study by Chowhan, each parameter (LVOTVTI and CAVTI) was measured in the supine position and following passive leg raise. These represented paired readings from the same individual and comparison of prereading and postreading was performed using the paired t-test.⁶ Verma et al. evaluated the role of physiotherapy on oxygen requirements and physiological parameters in patients with COVID-19.⁷ Each patient had pretreatment and post-treatment data for heart rate and oxygen supplementation recorded on day 1 and day 14. Since data did not follow a normal distribution, they used Wilcoxon's matched pair test to compare the prevalues and postvalues of heart rate (numerical variable). McNemar's test was used to compare the presupplemental and postsupplemental oxygen status expressed as dichotomous data in terms of yes/no. In the study by Megahed, patients had various parameters such as sepsis-related organ failure assessment score, lung injury score, and clinical pulmonary infection score (CPIS) measured at baseline, on day 3 and day 7.⁵ Within groups, comparisons were made using repeated measures ANOVA for normally distributed data and Friedman's test for skewed data.

Table 2

Tests for comparison of paired data

Type of data	Two groups	More than two groups
Nominal	McNemar's test	Cochran's Q
Ordinal or skewed	Wilcoxon signed rank test	Friedman test^*
Normally distributed	Paired t-test	Repeated measures ANOVA^*

Tests for Association between Variables

(Table 3 lists the tests used to determine the association between variables. Correlation determines the strength of the relationship between two variables; regression allows the prediction of one variable from another. Tyagi examined the correlation between ETCO₂ and PaCO₂ in patients with chronic obstructive pulmonary disease with acute exacerbation, who were mechanically ventilated.⁸ Since these were normally distributed variables, the linear correlation between ETCO₂ and PaCO₂ was determined by Pearson's correlation coefficient. Parajuli et al. compared the acute physiology and chronic health evaluation II (APACHE II) and acute physiology and chronic health evaluation IV (APACHE IV) scores to predict intensive care unit mortality, both of which were ordinal data. Correlation between APACHE II and APACHE IV score was tested using Spearman's coefficient.⁹ A study by Roshan et al. identified risk factors for the development of aspiration pneumonia following rapid sequence intubation.¹⁰ Since the outcome was categorical binary data (aspiration pneumonia— yes/no), they performed a bivariate analysis to derive unadjusted odds ratios, followed by a multivariable logistic regression analysis to calculate adjusted odds ratios for risk factors associated with aspiration pneumonia.

Table 3

Tests for assessing the association between variables

Type of data	Test
Correlation
Both variables normally distributed	Pearson's correlation coefficient
One or both variables ordinal or skewed	Spearman's or Kendall's correlation coefficient
Nominal data	Chi-square test; odds ratio or relative risk (for binary outcomes)
Regression
Continuous outcome	Linear regression analysis
Categorical outcome (binary)	Logistic regression analysis

Open in a separate window

Tests for Agreement between Measurements

(Table 4 outlines the tests used for assessing agreement between measurements. Gunalan evaluated concordance between the National Healthcare Safety Network surveillance criteria and CPIS for the diagnosis of ventilator-associated pneumonia.¹¹ Since both the scores are examples of ordinal data, Kappa statistics were calculated to assess the concordance between the two methods. In the previously quoted study by Tyagi, the agreement between ETCO₂ and PaCO₂ (both numerical variables) was represented using the Bland–Altman method.⁸

Table 4

Tests for assessing agreement between measurements

Type of data	Test
Categorical data	Cohen's kappa
Numerical data	Intraclass correlation coefficient (numerical) and Bland–Altman plot (graphical display)

Open in a separate window

Tests for Time-to-Event Data (Survival Analysis)

Time-to-event data represent a unique type of data where some participants have not experienced the outcome of interest at the time of analysis. Such participants are considered to be “censored” but are allowed to contribute to the analysis for the period of their follow-up. A detailed discussion on the analysis of time-to-event data is beyond the scope of this article. For analyzing time-to-event data, we use survival analysis (with the Kaplan–Meier method) and compare groups using the log-rank test. The risk of experiencing the event is expressed as a hazard ratio. Cox proportional hazards regression model is used to identify risk factors that are significantly associated with the event.

Hasanzadeh evaluated the impact of zinc supplementation on the development of ventilator-associated pneumonia (VAP) in adult mechanically ventilated trauma patients.¹² Survival analysis (Kaplan–Meier technique) was used to calculate the median time to development of VAP after ICU admission. The Cox proportional hazards regression model was used to calculate hazard ratios to identify factors significantly associated with the development of VAP.

Summary

The choice of statistical test used to analyze research data depends on the study hypothesis, the type of data, the number of measurements, and whether the data are paired or unpaired. Reviews of articles published in medical specialties such as family medicine, cytopathology, and pain have found several errors related to the use of descriptive and inferential statistics.^12–15 The statistical technique needs to be carefully chosen and specified in the protocol prior to commencement of the study, to ensure that the conclusions of the study are valid. This article has outlined the principles for selecting a statistical test, along with a list of tests used commonly. Researchers should seek help from statisticians while writing the research study protocol, to formulate the plan for statistical analysis.

Orcid

Priya Ranganathanhttps://orcid.org/0000-0003-1004-5264

Footnotes

Source of support: Nil

Conflict of interest: None

References

1. Ranganathan P, Gogtay NJ. An introduction to statistics - data types, distributions and summarizing data. Indian J Crit Care Med. 2019;23(Suppl 2):S169–S170. doi:10.5005/jp-journals-10071-23198. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

2. Nayak BK, Hazra A. How to choose the right statistical test? Indian J Ophthalmol. 2011;59(2):85–86. doi:10.4103/0301-4738.77005. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

3. Sridharan S, Gopalakrishnan R, Nambi PS, Kumar S, Nandini S, Ramasubramanian V. Clinical profile of non-neutropenic patients with invasive candidiasis: a retrospective study in a tertiary care center. Indian J Crit Care Med. 2021;25(3):267–272. doi:10.5005/jp-journals-10071-23748. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

4. Hopkins S, Dettori JR, Chapman JR. Parametric and nonparametric tests in spine research: why do they matter? Global Spine J. 2018;8(6):652–654. doi:10.1177/2192568218782679. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

5. Megahed MM, El-Menshawy AM, Ibrahim AM. Use of early bronchoscopy in mechanically ventilated patients with aspiration pneumonitis. Indian J Crit Care Med. 2021;25(2):146–152. doi:10.5005/jp-journals-10071-23718. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

6. Chowhan G, Kundu R, Maitra S, Arora MK, Batra RK, Subramaniam R, et al. Efficacy of left ventricular outflow tract and carotid artery velocity time integral as predictors of fluid responsiveness in patients with sepsis and septic shock. Indian J Crit Care Med. 2021;25(3):310–316. doi:10.5005/jp-journals-10071-23764. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

7. Verma CV, Arora RD, Mistry HM, Kubal SV, Kolwankar NS, Patil PC, et al. Changes in mode of oxygen delivery and physiological parameters with physiotherapy in covid-19 patients: a retrospective study. Indian J Crit Care Med. 2021;25(3):317–321. doi:10.5005/jp-journals-10071-23763. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

8. Tyagi D, Manjunath BG, Jakka S, Chandra S, Chaudhry D. Correlation of PaCO₂ and ETCO₂ in COPD patients with exacerbation on mechanical ventilation. Indian J Crit Care Med. 2021;25(3):305–309. doi:10.5005/jp-journals-10071-23762. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

9. Parajuli BD, Shrestha GS, Pradhan B, Amatya R. Comparison of acute physiology and chronic health evaluation II and acute physiology and chronic health evaluation IV to predict intensive care unit mortality. Indian J Crit Care Med. 2015;19(2):87–91. doi:10.4103/0972-5229.151016. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

10. Roshan R, Sudhakar GD, Vijay J, Mamta M, Amirtharaj J, Priya G, et al. Aspiration during rapid sequence induction: prevalence and risk factors. Indian J Crit Care Med. 2021;25(2):140–145. doi:10.5005/jp-journals-10071-23714. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

11. Gunalan A, Sistla S, Sastry AS, Venkateswaran R. Concordance between National Healthcare Safety Network (NHSN) Surveillance Criteria and Clinical Pulmonary Infection Score (CPIS) Criteria for Diagnosis of Ventilator-associated Pneumonia (VAP). Indian J Crit Care Med. 2021;25(3):296–298. doi:10.5005/jp-journals-10071-23753. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

12. Hasanzadeh Kiabi F, Alipour A, Darvishi-Khezri H, Aliasgharian A, Emami Zeydi A. Zinc supplementation in adult mechanically ventilated trauma patients is associated with decreased occurrence of ventilator-associated pneumonia: a secondary analysis of a prospective, observational study. Indian J Crit Care Med. 2017;21(1):34–39. doi:10.4103/0972-5229.198324. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

13. Yim KH, Nahm FS, Han KA, Park SY. Analysis of statistical methods and errors in the articles published in the Korean journal of pain. Korean J Pain. 2010;23(1):35–41. doi:10.3344/kjp.2010.23.1.35. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

14. Bahar B, Pambuccian SE, Barkan GA, Akdas Y. The use and misuse of statistical methods in cytopathology studies: review of 6 journals. Lab Med. 2019;50(1):8–15. doi:10.1093/labmed/lmy036. DOI: [PubMed] [CrossRef] [Google Scholar]

15. Nour-Eldein H. Statistical methods and errors in family medicine articles between 2010 and 2014-Suez Canal University, Egypt: a cross-sectional study. J Fam Med Prim Care. 2016;5(1):24–33. doi:10.4103/2249-4863.184619. DOI: [PMC free article] [PubMed] [CrossRef] [Google Scholar]

Articles from Indian Journal of Critical Care Medicine : Peer-reviewed, Official Publication of Indian Society of Critical Care Medicine are provided here courtesy of Indian Society of Critical Care Medicine

An Introduction to Statistics: Choosing the Correct Statistical Test (2024)

FAQs

How to choose the correct statistical test? ›

7 Essential Ways to Choose the Right Statistical Test

Research Question. ...
Formulation of Null Hypothesis. ...
Level of Significance in Study Protocol. ...
The Decision Between One-tailed and Two-tailed. ...
The Number of Variables to Be Analyzed. ...
Type of Data. ...
Paired and Unpaired Study Designs.

Nov 25, 2022

Find Out More ›

What is the correct test statistic? ›

The formula for the test statistic depends on the statistical test being used. Generally, the test statistic is calculated as the pattern in your data (i.e. the correlation between variables or difference between groups) divided by the variance in the data (i.e. the standard deviation).

Know More ›

When to use ANOVA vs t-test? ›

The Student's t test is used to compare the means between two groups, whereas ANOVA is used to compare the means among three or more groups. In ANOVA, first gets a common P value. A significant P value of the ANOVA test indicates for at least one pair, between which the mean difference was statistically significant.

Read On ›

What is the very first step in choosing the appropriate statistical test in a study? ›

Step 1: Consider Your Research Question

Think about your specific research question. For instance, if you want to investigate the relationship between two continuous variables, like blood pressure and heart rate in patients, you should consider using correlation analysis.

Read The Full Story ›

What is the most basic statistical test? ›

T-tests. A t-test, also called “Student's t-Test”, is typically used to determine if there is a significant difference between the means of some numeric variable between two groups.

Find Out More ›

Which statistical method is used to determine the reliability of a test? ›

Measuring Test-Retest Reliability

This is done by calculating the correlation coefficient. To do this, statistical analysis methods, like the Pearson correlation coefficient or Cronbach's alpha, can be used to find the correlation or relationship between the two sets of scores.

Get More Info Here ›

What is an example of a test statistic? ›

For example, the test statistic for a Z-test is the Z-statistic, which has the standard normal distribution under the null hypothesis. Suppose you perform a two-tailed Z-test with an α of 0.05, and obtain a Z-statistic (also called a Z-value) based on your data of 2.5. This Z-value corresponds to a p-value of 0.0124.

What are the basic components of a statistical test? ›

Components of a statistical test. Before observing the data, the null and alternative hypotheses should be stated, a significance level (α) should be chosen (often equal to 0.05), and the test statistic that will summarize the information in the sample should be chosen as well.

Tell Me More ›

What statistical test is used to determine correlation? ›

Pearson's correlation coefficient (r) is used to demonstrate whether two variables are correlated or related to each other. When using Pearson's correlation coefficient, the two vari- ables in question must be continuous, not categorical.

Get More Info Here ›

When would you use the t-test? ›

A t test is a statistical test that is used to compare the means of two groups. It is often used in hypothesis testing to determine whether a process or treatment actually has an effect on the population of interest, or whether two groups are different from one another.

What does ANOVA tell you? ›

ANOVA stands for Analysis of Variance. It is a statistical method used to analyze the differences between the means of two or more groups or treatments. It is often used to determine whether there are any statistically significant differences between the means of different groups.

Tell Me More ›

What does ANOVA stand for? ›

Analysis of Variance (ANOVA) is a statistical formula used to compare variances across the means (or average) of different groups.

Get More Info ›

What are the 5 basic methods of statistical analysis? ›

The five basic methods are mean, standard deviation, regression, hypothesis testing, and sample size determination. It is widely used by governments, businesses, banking entities, insurance companies, etc.

Find Out More ›

What statistical test compares two groups? ›

T-tests are used when comparing the means of precisely two groups (e.g., the average heights of men and women). ANOVA and MANOVA tests are used when comparing the means of more than two groups (e.g., the average heights of children, teenagers, and adults).

Keep Reading ›

What are the basics of statistics? ›

The basics of statistics include the measure of central tendency and the measure of dispersion. The central tendencies are mean, median and mode and dispersions comprise variance and standard deviation. Mean is the average of the observations. Median is the central value when observations are arranged in order.

Keep Reading ›

How do you decide if each question is statistical or Nonstatistical? ›

A statistical question is a question that can be answered by collecting data that vary. For example, “How old am I?” is not a statistical question, but “How old are the students in my school?” is a statistical question.

See Details ›

What statistical test to use to determine significant difference? ›

A t-test is an inferential statistic used to determine if there is a statistically significant difference between the means of two variables.

Discover More Details ›

When to use a chi-square test? ›

The chi-square test is a statistical tool used to check if two categorical variables are related or independent. It helps us understand if the observed data differs significantly from the expected data. By comparing the two datasets, we can draw conclusions about whether the variables have a meaningful association.

Find Out More ›

What is the best statistical test to compare two variables? ›

A chi-square test is used when you want to see if there is a relationship between two categorical variables.

See Details ›