Thresholds
QC test thresholds for salinity were based on historical Coastal Monitoring Program data. Preliminary quality control (QC) was applied to the historical data, when obvious outliers and observations impacted by suspected sensor drift (due to biofouling) were omitted.
Separate thresholds were calculated for salinity measured in Inverness County, and salinity measured elsewhere. The stations in Inverness county are in the Bras d’Or Lake, which is a brackish estuary. The salinity measured at these stations is expected to be substantially different from salinity measured at marine stations.
Gross Range Test
Sensor Thresholds
The sensor thresholds were determined based on the manual for the aquaMeasure SAL (Table 1).
Table 1: Salinity sensor thresholds for the Gross Range Test.
User Thresholds
The salinity observations are relatively normally distributed for both groups of data (Figure 1), and so the mean and standard deviation were used to determine \(user_{min}\) and \(user_{max}\). The statistics and threshold values are shown in Table 2.
Table 2: Gross Range Test statistics and user thresholds for salinity.
Climatological Test
The Climatological Test was not applied to the salinity data because there were insufficient observations to calculate robust seasonal thresholds. Both data groups were missing observations for at least one month (Figure 2). The Gross Range Test user thresholds were used as placeholders until there is sufficient seasonal data to calculate salinity thresholds for the Climatological test.
Spike Test
The distribution of the spike value is skewed right (Figure 5), and so several upper quartile values (90th, 95th, and 99.7th quartile) were evaluated to use as the \(spike_{low}\).
There were relatively few large single-value spikes in the salinity data, and so the 99.7th quartile was selected to avoid false positives. \(spike_{high}\) was defined as 3 * \(spike_{low}\) to identify especially egregious spike values.
Table 3: Spike thresholds for salinity.
Rolling Standard Deviation Test
The distribution of rolling standard deviation is skewed right (Figure 6), and so several upper quartile values were evaluated to use as the \(rolling\_sd\_max\).
The 90th, 95th, and 99.7th quartile values were each applied to the the raw data (no preliminary QC) and the results inspected. The 90th quartile was determined to be too stringent, as it generated false positives, while The 99.7th quartile was determined to be too lenient, as it resulted in false negatives. The 95th quartile was considered the most useful threshold for both data groups.
Table 4: Rolling standard deviation threshold for salinity.