Statistical significance
Help - Canadian station weather data and extreme winter events

Canada main  |  Help main  |  Return to previous page


Statistical significance is determined for the ratios of extreme weather events during extreme high index days and extreme low index days. The testing for statistical significance was done for all ratios, but the less complete the data record used to figure the ratio, the less meaningful the result of the significance testing. This is explained further below in an important note on interpreting ratios and their statistical significance in this analysis.


The test of significance aims to answer the question of whether any given ratio different from 1 is robust. Can it be said with 95% confidence that the ratio is different from 1 with more event days occurring during the index extreme as observed? Or more simply, is the ratio really the color (red or blue) that is shown?

The following procedure was used to determine whether the ratios are significant.

Figures showing event ratios for all stations versus the number of extreme weather days

Important consideration when judging ratios and statistical significance testing results
When evaluating the results of statistical significance testing (as well as the ratios themselves) calculated in cases where the data record is missing a great number of days, care must be taken not to draw unwarranted conclusions. A complete data record allows for equal numbers of high and low index days to possibly go into the extreme weather days ratios. However, a data record without data for a great number of days opens the likelihood of the number of high and low index days not being equal thus affecting the expected ratio results.

For example, consider a station with TMIN data from only 3 winters, and by chance those all 3 of those happened to be El Niño winters. That means there could be many high CTI days but no low CTI days, resulting in infinite TMIN CTI ratios like 15:0 or 18:0. Those large ratios may appear meaningful and significant, yet because they were based on data exclusively from high CTI index days it cannot be concluded that extreme TMIN (or anything) occur more frequently on high CTI index days.

A less extreme example would be a station with an approximately half-complete data record where most of the data is from either the earlier or later halves of the 1948-1997 period. Though over the whole 50 years the number of high and low index days are equal, that is not the case for a shorter segment of the period. For instance, there are more high NAM days later in the period than earlier, and half the low CTI days (La Niña) are from 1950-1956. So a SNOW record with many observations from the 1980s and 1990s and few from the 1950s and 1960s may result in ratios that indicate SNOW occurs much more frequently at that station on high NAM days, but that apparent relationship may only be the result of there being more high than low NAM days with data in the record.

On the station pages the popup window showing the weather variable observations by year along with the extreme index days by year are useful for helping determine for individual cases with much missing data whether ratios are strongly influenced by from when the weather data was available.