Statistical combination of different types of chlorofyll-a measurements in the Dutch North Sea

2. Data exploration

Both the EO and in-situ datasets were acquired through OSPAR, whereby the in-situ data can also downloaded through the ICES data portal. The chl-a indicator is used to address eutrophication, and thus only reflects concentrations in the upper 10 meters of the water column. While this depth limitation is inherent to the EO data, any in-situ measurements taken at greater depths are excluded from the analysis. Additionally, the indicator relies on growing season means, with the growing season, as defined by OSPAR, extending from March to September. The focus of this analysis is on these seven months, although algal blooms may occur outside this window, especially as climate change influences water temperatures in the North Sea1) 2). The analysis covers the period from 1998 to 2020, as both in-situ and EO data are available for these years.

2.1 In-situ data

The number of in-situ measurements and chl-a concentrations in the Southern North Sea (SNS) assessment area are summarized in Table 1. The sample size fluctuated over the study period, with a peak in 2019, featuring approximately 131 samples during the growing season. The spatial distribution of these measurements is shown in Figure 1, which reveals that sampling sites are unevenly distributed, with a higher concentration of samples taken near the British, Belgian, and Dutch coasts. Only a few sites on the Dutch Continental Shelf were sampled throughout all months of the growing season, whereas most sites were sampled only during a single month (although not all in the same month, see distribution over months sampled per year in Table 1).

2019 was the most sampled year, and the development of quantile chl-a concentrations during that year’s growing season is depicted in Figure 2. Although no data is available for March, the results show elevated chl-a concentrations during the spring bloom, particularly near the Dutch coast. However, the sparse spatial distribution of samples makes it difficult to discern broader patterns across the entire SNS area.

Table 1. Number of in-situ measurements1) per year during the growing season in the SNS assessment area, including mean and sd of chlorophyll-a concentrations
yearchl-a (mean)chl-a (sd)nn monthsn days
19984,764,1843723
199918,9015,83128415
200010,8510,263937
20016,546,7078737
20025,043,8599741
20036,808,59108743
20045,985,68107735
20054,805,08103740
20064,234,00102747
20074,915,8280739
20084,606,6265736
20093,384,6394746
20102,933,2392735
20114,174,93103746
20122,552,38117750
20132,513,17106746
20143,393,33110743
20154,254,11107747
20164,977,63100744
20174,156,3790743
20183,874,30101750
20192,754,66131656
20202,482,03117751
1) N is based on unique samples per site and timestamp, replicates are not considered.

2.2 Earth Observation data

EO data is extremely well sampled throughout the SNS assessment area (Table 2), with millions of measurements that span the entire duration of the growing season. All months, and almost all days, are sampled during each year in contrast to the in-situ data (Table 1). For more background on the acquisition of EO data, see Van der Zande et al. (2019)3) and Lavigne et al. (2021)4).

Table 2. Number of EO measurements per year during the growing season in the SNS assessment area including mean and sd of chlorophyll-a concentrations
yearchl-a (mean)chl-a (sd)nn monthsn days
19983,193,0820723627188
19993,734,3732943127202
20002,432,1427335087190
20014,214,8733304157181
20022,791,8941210147196
20033,543,2760326447204
20042,982,4050139507203
20052,922,4846418517197
20062,772,2855339967195
20073,483,4551267527204
20083,713,5149285487206
20092,853,0655068657209
20103,433,4151092997200
20113,133,1150195367197
20122,602,5652597187201
20133,593,6957575917195
20143,043,4351874527203
20153,033,3756198667205
20162,902,9463274107207
20172,632,5458829817208
20182,722,5961988997205
20192,732,8859425787208
20203,002,5266920377210

Table 2 highlights the temporal coverage of the Earth Observation (EO) data, while Figure 3 presents the spatial distribution of this data for the year 2019. To appropriately visualize the data, chl-a concentrations were aggregated to a 5x5 km grid (the original resolution of the EO data being 1x1 km). These aggregated values were then categorized into quantiles. The results demonstrate that the EO data effectively captures both spatial and temporal patterns of chl-a concentrations throughout the growing season. Notably, higher concentrations are observed during early phytoplankton blooms from March to May, with levels being generally elevated near the coast.

In the later years of the monitoring period, the EO data provides excellent coverage of the SNS area. However, coverage was less comprehensive in the earlier years (see differences in n Table 2; and Figure 4), with some months showing gaps in 5x5 km grid data. Despite these early gaps, the EO data coverage of the SNS area remains nearly complete.

2.3 Conclusion data exploration

Large differences exist between the spatial and temporal distribution of the in-situ and EO datasets, with the two datasets not being directly comparable, as illustrated in Table 3. The number of in-situ samples is substantially smaller than that of the EO data, both in terms of temporal coverage and spatial distribution. In-situ data is not representative of the entire SNS assessment area, and in many years, there are gaps where not all months of the growing season were sampled.

Given these discrepancies, a strong recommendation is to treat in-situ samples as individual data points alongside the EO data. Aggregating both datasets before calculating growing season averages may provide a more accurate representation of the overall conditions. If and how the current weighting method may introduce biases is explored in the next chapter.

Table 3. Yearly sample sizes of EO and in-situ data with corresponding percentages
yearn EOn in-situn total% in-situ% EO
199820723624320724050,002199,9979
1999329431212832944400,003999,9961
200027335083927335470,001499,9986
200133304157833304930,002399,9977
200241210149941211130,002499,9976
2003603264410860327520,001899,9982
2004501395010750140570,002199,9979
2005464185110346419540,002299,9978
2006553399610255340980,001899,9982
200751267528051268320,001699,9984
200849285486549286130,001399,9987
200955068659455069590,001799,9983
201051092999251093910,001899,9982
2011501953610350196390,002199,9979
2012525971811752598350,002299,9978
2013575759110657576970,001899,9982
2014518745211051875620,002199,9979
2015561986610756199730,001999,9981
2016632741010063275100,001699,9984
201758829819058830710,001599,9985
2018619889910161990000,001699,9984
2019594257813159427090,002299,9978
2020669203711766921540,001799,9983

1) Holt, J., Hughes, S., Hopkins, J., Wakelin, S. L., Penny Holliday, N., Dye, S., González-Pola, C., Hjøllo, S. S., Mork, K. A., Nolan, G., Proctor, R., Read, J., Shammon, T., Sherwin, T., Smyth, T., Tattersall, G., Ward, B., & Wiltshire, K. H. (2012). Multi-decadal variability and trends in the temperature of the northwest European continental shelf: a model-data synthesis. Progress in Oceanography, 106, 96–117.
2) Tinker, J., Lowe, J., Pardaens, A., Holt, J., & Barciela, R. (2016). Uncertainty in climate projections for the 21st century northwest European shelf seas. Progress in Oceanography, 148, 56–73.
3) Van der Zande, D., Lavigne H., Blauw A., Prins T., Desmit X., Eleveld M., Gohin F., Pardo S., Tilstone G., Cardoso Dos Santos J., Coherence in Assessment Framework of Chlorophyll A and Nutrients as Part of the EU Project ‘Joint Monitoring Programme of the Eutrophication of the North Sea With Satellite Data’ (Ref: DG ENV/MSFD Second Cycle/2016) (2019). Activity 2 Report (106 pp)
4) Lavigne H., Van der Zande D., Ruddick K., Cardoso Dos Santos J., Gohin F., Brotas V., Kratzer S., Quality-control tests for OC4, OC5 and NIR-red satellite chlorophyll-a algorithms applied to coastal waters (2021). Remote Sensing of Environment 255: 112237.