1 Introduction

This report provides contextual ecosystem information for scup (Stenotomus chrysops) on the Northeast U.S. Continental Shelf. Data extractions for spring and fall are confined to the scup stock area based on respective survey strata sets. The information is intended to span a range of potential factors affecting the productivity and distribution of scup, including: surface and bottom temperature and salinity, chlorophyll concentration, indices of habitat, diet composition, and abundance of key zooplankton prey of larval scup. These factors can be used to qualitatively inform the interpretation of population status and/or quantitatively to improve model responsiveness to ecosystem factors. The range and complexity of ecosystem data makes it unlikely to find the most relevant and comprehensive factor variables with a first evaluation; this process will require an iterative approach of evaluation and feedback. Additional indices can be included to address the needs of the working group.

Summary

Scup is a species which may be benefiting from warming waters in the NES LME. Estimates of biomass have generally been higher in the past 15 years or so than in the period prior. This corresponds with warming trends over the entire period covered in this report, and with the higher temperature regime found in the fall SST data. Salinity has also increased over time in the spring and fall both at the surface and bottom. Fall salinities show regime changes to a higher salinity values in the 2010s, indicating a possible avenue of research.

Interestingly, chlorophyll concentration has shown lower concentration regimes in the spring and fall. Thus, if lower chlorophyll concentration negatively affects scup, it may have a lower effect than those of higher temperatures and salinities. The spring chlorophyll concentration doesn’t show a decreasing linear trend as the fall does, and the regime change occurs in 2016 compared to 2013. This may be important because scup spawn in the spring.

Zooplankton abundances also map well to the increasing biomass of scup in the region. Centropages typicus is an important food source for many larval fish, and is typically among the most abundant copepods collected within the stock area defined for scup. After a low period of Centropages abundance in the mid 2000s, there appears to be an increase during the rest of the time series. The non-copepod zooplankton which are important to juvenile scup, as well as to the preferred prey items of adult scup, show also strongly increasing trends in abundance in the fall.

Scup is a seasonally migratory species, so an important feature of the scup stock areas is that adult scup are caught in the bottom trawl survey primarily offshore during the spring and inshore during the fall.

Stock area

Strata map for scup (*Stenotomus chrysops*) on the NE shelf

Figure 1.1: Strata map for scup (Stenotomus chrysops) on the NE shelf

A note on figures

Unless otherwise noted, time series in this document are represented by dark blue lines. Estimates of linear trends and regime shifts are included on plots when found to be significant (p < 0.05), and are shown by light blue and green lines respectively. Trend strengths and confidence intervals are also included in plot titles when trends are present. Trends were modeled using a GLS model selection approach (see Hardison et al. 2019 for details).

Last updated on July 25 2019.

2 Temperature

An optimal interpolation procedure was used to estimate NE Shelf surface and bottom temperatures for two seasonal time frames (see methods). The temperature estimates were standardized to April 3 and October 11 for spring and fall over the period 1968-2018. Surface and bottom temperature within the scup stock areas are shown below. The fall warming trend’s magnitude is greater than in the spring. As scup have been shown to be found in greatest abundance at temperatures from 13 C to 16 C (Fritz 1965), warming waters in the region may provide better habitat.

Bottom Temperature

Thermal conditions on the Northeast Shelf have changed in recent decades as reflected in changes in bottom water temperatures in the stock areas. In both spring and fall, bottom water temperatures show a linear increase over time, with the fall trend doubling the magnitude of the spring trend. In both the spring and the fall, there appears to be a regime change early in the time series from lower temperatures to higher temperatures in spite of strong interannual variability.

Surface Temperature

Surface temperature showed no significant trends in the spring, but in the fall there appear to be two rapid regime changes in 2011 and 2017, both strong increases in mean temperature after a relatively stable temperature regime from the start of the time series to 2010.

3 Salinity

An optimal interpolation procedure was used to estimate NE Shelf surface and bottom salinity for two seasonal time frames (see methods). Though collected with temperature data, reliable instrumentation limits this time series to 1992-2018. The salinity estimates were standardized to April 3 and October 11 for spring and fall. Surface and bottom salinities within the scup stock areas are shown below.

Bottom Salinity

Bottom salinity shows a significant increasing trend in the spring, with an increased salinity regime evident starting in 2000. While there is no significant trend in the fall, there does appear to be evidence for an increased salinity regime starting in 2012 and a return to a lower salinity regime in the last year of the time series.

Surface Salinity

Surface salinity shows an increasing linear trend of about the same magnitude in both the spring and the fall. Additionally there is evidence for an increased regime in both, starting in 2000 in the spring, and 2014 in the fall.

4 Chlorophyll

Chlorophyll concentration in the spring showed no significant linear trend but there appears to be evidence for a regime change to a lower chlorophyll concentration in 2016. In the fall, there is a significant decreasing linear trend with a lowered concentration regime beginning in 2013.

Chlorophyll concentration in the stock area

The concentration of chlorophyll was measured with a suite of satellite sensors and merged into a single dataset (see methods). Chlorophyll concentrations in the spring and fall scup stock areas are shown below.

5 Zooplankton

Zooplankton are important to larval scup, and while in situ stomach data is rare for larval fish, laboratory experiments demonstrate that they eat small zooplankton (Griswold and McKenney 1984). Since spawning occurs during the spring, the spring zooplankton indices may be more relevant than the fall indices with regard to larval food availability. Both juvenile and adult scup rely heavily on epibenthic and infaunal prey items, many of which feed directly on zooplankton, thus understanding plankton dynamics may yield insight on food availability and trophic transfer to scup. With the exception of Pseudocalanus spp., the recent trends for the dominant copepod abundance indices in spring show increasing levels. Centropages typicus, an important food item for larval fish, is the most abundant taxa in this region in spring, and shows an increasing trend in abundance following a period of low abundance in the mid 2000’s. In the fall, many of the copepod indices show a pattern of decline in abundance, however the non-copepod zooplankton all show strong increasing trends and are more likely to be consumed either directly by juvenile scup, or consumed by the prey items of both juvenile and adult scup, such as annelid worms, gastropods, and amphipods.

Copepod abundance

Non-copepod zooplankton abundance

6 Habitat and abundance

Occurrence probability

The probability of occurrence was estimated using random forest classification models (see methods). The mean annual probabilities were extracted for the spring and fall stock definition areas. These data provide an estimate of the potential use of the habitat associated with the stock definition. Fall occupancy has fluctuated between 0.3 to 0.4 over the time series, with a relatively long lower period between 1998 - 2008 followed by an increasing occupancy through 2018. Spring occupancy remained below 0.15 through 2001 and then has fluctuated more year to year than the early part of the time series with the two highest occupancy years occurring in 2012 and 2016.

Habitat area

Ecosystem area

Stock

In the spring, there is a significant linear trend of increasing area likely to be occupied by scup but no evidence for regime changes. Since 2000 the interannual variability from year to year appears to be greater in magnitude than prior to 2000. Interestingly, the fall analysis did not show a significant linear trend but did show multiple regime changes. A low area regime from 1991-1998 was bracketed by higher area regimes from 1981 - 1990 and from 1999 - 2007. In 2008, an even greater area regime change began which continues through 2018 and includes the 8 highest values in the series.

Minimum Population Size

Estimates of the biomass for the minimum population size of scup was determined by re-stratifying the habitat each year depending on the distribution of occupancy habitat (see methods). The procedure provides eight population size estimates for each stock, by season, based on the use of three estimation options. The estimates are denoted by a three-letter code (Y/N|Y/N|Y/N) where the first position indicates whether data were interpolated over habitat strata. The second position indicates whether the trawl data applied to the estimate was limited to the stations made in the stock area only or if trawls from other areas were used as well. The third position indicates whether the trawl data were subjected to a test for outliers, in which outliers were eliminated. Of the eight treatment options, the biomass estimate was based on the NYN option.

Before 2002, scup biomass was estimated to be the highest in 1978 with an apparently low period in the 1990’s. The period from 2000 to 2010 was characterized by almost annual shifts in biomass from high to low to high until 2007 when there was a three year increase. After 2010, biomass didn’t fall below 6.5 and four years were above 7 whereas only three years prior to 2010 were above 7.

A somewhat similar pattern appears in the fall for biomass as seen in the spring in the mid 1990’s where biomass was relatively low after high biomass from 1989-1991 which were 3 of the only 4 years with an estimated biomass above 7.25. After 1998, biomass tended to be higher, with 12 out of the 19 years having a biomass above 7.25.

7 Methods

Surface and Bottom Temperature and Salinity

7.0.1 Study System

The U.S. Northeast Shelf ecosystem, which roughly aligns with the boundaries of the Northeast U.S. Continental Shelf Large Marine Ecosystem (LME), is the study area for the surface and bottom thermal environments. Surface and bottom temperatures were estimated over a 0.1° latitude/longitude grid, termed the estimation grid, which circumscribes the range of ecosystem assessment areas in the region. The difference between the extents of the estimation grid from the extent of the LME relate to the resource management programs that are the sources of the data, which are focused on fishery management needs in the region.

7.0.2 Data Source

Temperature and salinity were collected on the Northeast Shelf as part of ongoing resource and ecosystem surveys conducted by the Northeast Fisheries Science Center. Water column temperatures have been collected contemporaneously to trawl tows associated with a bottom trawl survey beginning in the fall of 1963 and five years later during spring (Despres-Patanjo, Azarovitz, and Byrne 1988). In addition, the ecosystem has been surveyed by multiple sampling programs with varying sampling designs. The two most comprehensive monitoring programs over the study period were the MARMAP (1977-1987) and the Ecosystem Monitoring Program or EcoMon (1992-present) programs, both serving as shelf-wide surveys of the ecosystem (Sherman et al. 1998; J. Kane 2007). Temperature measurements were made with a mix of Conductivity Temperature and Depth (CTD) instruments, analog XBT, digital XBT, mechanical BT, glass thermometers (bottle temps) and trawl mounted temperature loggers instruments collecting either water column profiles or temperatures measured at targeted depths. Salinity measurements used in this analysis was limited to 1992-2017 when CTD instrument were used. Surface and bottom temperatures were identified from these measurements. Temperatures representing the spring period include data collected during the months of February to June; however, 99% of the samples were collected during the months of March to May. Likewise, the fall period samples include data collected during September to December, with 99% of the samples collected during September to November. The total number of surface temperature measurements were 14,540 and 14,666 for spring and fall, respectively; and, 14,450 and 14,656 for spring and fall bottom temperature, respectively. On average, there were 290 temperature measurements by season, depth, and year. The number of salinity observation per year were similar.

7.0.3 Interpolation Procedure

Seasonal surface and bottom temperature fields were estimated using an optimal interpolation approach. The optimal interpolation combined a climatological depiction of temperature and an annual estimates based on the data for a particular depth and season. (There were a number of precursor steps that are described below.). The surveys used to collect the data were conducted during the same period each year, however, there was variation in survey timing. To account for this, temperatures were standardized to a collection date of April 3 for spring surveys and October 11 for fall using linear regression for each depth and season over the sample grid (see Appendix, Figure A1). For each depth and season, annual shelf-wide mean temperatures were calculated using the data from sample grid locations with at least 80% of the time series present. The annual observations were then transformed to anomalies by subtracting the appropriate annual mean. All anomalies for a season and depth were combined over years into a single anomaly field or climatology.

The annual estimate of temperature for a depth and season was imputed by used universal kriging to estimate the temperature over the estimation grid with depth as a covariate. The kriging yielded temperature estimates and a variance estimate over the same grid. The optimal interpolation field was assembled by combining the annual estimate and information from the anomaly climatology. The climatology was re-leveled from anomaly values to temperatures by adding back the appropriate annual mean. For each location in the estimation grid, temperature was calculated as a weighted mean between the kriged annual estimate and the releveled climatology. The weightings in the calculations were partitioned based on the variance field of the annual kriging. The field was divided into quartiles from low to high variance with the weighting ratio of annual:climatology temperatures of 4:1, 3:1, 2:1, and 1:1, respectively. Hence, in areas of low variance the weighted mean was based on a weighting of 4:1, which would reflect a higher contribution of information from the annual estimate and thus be closer to an observation. In areas with high variance, the weighting ratio of 1:1 would reflect a greater effect of the climatology in determining the interpolation estimate.

The optimal interpolation temperature fields were evaluated using cross validation and a comparison to external data. The performance of optimal interpolation was compared to the predictive skill of using either the climatology or annual interpolation alone by doing ten random cross validations of each treatment. Each random draw of training and test sets sampled 3% of the data for the test set, or about 500 observations per draw. The temperature fields were fit with the training data and compared with the test set data. The lowest error rates were realized with the optimal interpolation contrasted with highest predictive error associated with fields based on the climatology alone (see Appendix, Figure A2). The spatial distribution of error had distinct depth and seasonal patterns. The spatial errors associated with the surface estimates were generally low with the exception of a few locations along the shelf break between latitudes 39-41°N; the spatial error in the bottom temperature varied by season, and was concentrated along the shelf break in spring and across the shelf between latitudes 36-40°N in fall (see Appendix, Figure A3). The optimal interpolation data were also compared to external data from other collection programs. The absolute errors between surface temperature data collected by satellites and the interpolation had interquartile ranges of approximately ±0.75°C (see Appendix, Figure A4). The absolute error in a comparison of interpolated bottom temperature to opportunist sampling was approximately ±0.75°C for spring bottom temperature and approximately 0.75-1.0°C in the fall. Salinity was estimated in the same way with the exception of collection correction, which was deemed unnecessary for the salinity data.

Map of the study system with half-degree sample grid (blue lines) and extent of estimation grid shown in beige points (a). Major features of the study system with shelf break marked in purple line (b). 100m depth shown as dashed line.

Figure 7.1: Map of the study system with half-degree sample grid (blue lines) and extent of estimation grid shown in beige points (a). Major features of the study system with shelf break marked in purple line (b). 100m depth shown as dashed line.

Chlorophyll data

Chlorophyll a concentration ([Chl]) data extracted from satellite remote-sensing databases based on measurements made with the Sea-viewing Wide Field of View Sensor (SeaWiFS), Moderate Resolution Imaging Spectroradiometer on the Aqua satellite (MODIS), Medium Resolution Imaging Spectrometer (MERIS), and Visible and Infrared Imaging/Radiometer Suite (VIIRS) sensors. We used the Garver, Siegel, Maritorena Model (GSM) merged data product at 100 km (equivalent to a 1° grid) and 8-day spatial and temporal resolutions, respectively, obtained from the Hermes GlobColour website. These four sensors provide an overlapping time series of [Chl] during the period 1997 to 2018 and were combined based on a bio-optical model inversion algorithm (Maritorena et al. 2010).

Zooplankton abundance

Zooplankton abundance is measured by the Ecosystem Monitoring Program (EcoMon), which conducts shelf-wide bimonthly surveys of the NES ecosystem (J. Kane 2007). Zooplankton and ichthyoplankton are collected throughout the water column to a maximum depth of 200 m using paired 61-cm Bongo samplers equipped with 333-micron mesh nets. Sample location in this survey is based on a randomized strata design, with strata defined by bathymetry and along-shelf location. Plankton taxa are sorted and identified. We used the bio-volume of the 18 most abundant taxonomic categories as potential predictor variables (Table 2). The zooplankton sample time series has some missing values, which necessitated the removal of spring data for the years 1989, 1990, 1991, and 1994 and fall data for the years 1989, 1990, and 1992. The data for each seasonal time frame was interpolated to a complete field over the estimation grid using ordinary kriging.

7.0.4

Variable name Taxa
acarspp Acartia spp.
calfin Calanus finmarchicus
chaeto Chaetognatha
cham Centropages hamatus
cirr Cirripedia
ctyp Centropages typicus
echino Echinodermata
evadnespp Evadne spp.
gas Gastropoda
hyper Hyperiidea
larvaceans Appendicularians
mlucens Metridia lucens
oithspp Oithona spp.
para Paracalanus parvus
penilia Penilia spp.
pseudo Pseudocalanus spp.
salps Salpa
tlong Temora longicornis

Occupancy models

Occupancy and productivity habitats for Atlantic menhaden were estimated with random forest classification and regression models using a suite of static and dynamic predictor variables. Variation in species presence or absence and biomass across space, bathymetry factors, productivity factors, and climate factors were tested. Models were constructed separately for spring and fall seasons. The response variables were the occurrence and catch-per-unit-effort of Atlantic menhaden in the Northeast Fisheries Science Center bottom trawl survey, which is a fishery-independent survey on the Northeast US Shelf. The survey is conducted in the spring and fall of the year and is based on a stratified random design, which provides both spatial and temporal depictions of fish and macroinvertebrate abundances (Grosslein 1969). The independent or predictor variable set included physical environment variables, habitat descriptors, zooplankton variables, and remote sensing variables; the variables will be described in more detail below. Occupancy models were fit as two-factor classification models (absence as 0; presence as 1) using the randomForest R package (Liaw and Wiener 2002).

Prior to fitting the model, the independent variable set was first tested for multi-collinearity among the predictors and correlated variables were eliminated (R package Evans and Murphy (2018), version 2.1-3). From this reduced set of predictors, the final model variables were selected utilizing the model selection criteria of (Murphy, Evans, and Storfer 2010) as implemented in rfUtilities. Productivity models were fit as regression models with log10 transformed biomass-per-unit-effort as the response variable and the same starting set of predictor variables as in the occupancy models. As with the occupancy models, independent variables were tested for multi-collinearity and the model selection criteria was applied. Habitat was estimated from the model fits over a standard 0.1° grid, which circumscribes the range of ecosystem assessment areas in the region.

Three types of visualizations were created from the output of the tree models. The first visualization was used to see the average probability of occupancy over space and the rate of change (Sen’s slope) in occupancy over the years. The second visualization was used to see the mean occupancy gradient magnitude, or frontal strength and the rate of change (Sen’s slope) in occupancy gradient magnitude over the years. Gradient magnitude was calculated by calculating the median of the occupancy probabilities with a moving window and then summing those medians with a moving window with a matrix of weights. The third visualization was used to see the average biomass over space and the rate of change (Sen’s slope) in biomass over the years. Trends in total occupancy habitat area, with occupancy probabilities of 25, 50, and 75% over time were plotted as well by calculating the sum of the area with occupancy probabilities at each percentage during each year.

Following from the 2018 ECSA for summer flounder, the methodology was changed by the conversion of salinity, surface and bottom, from a dynamic variable to a static variable. By so doing, the fitting period is extended from 1992-2017 to 1976-2018. Dynamic salinity data was limited by the time series of modern instrumentation, which began in 1992. In the 2018 model fits, dynamic salinity was not an important variable in any species model; hence, conversion to a static variable was of little consequence.

7.0.5 Minimum swept area abundance based on habitat informed dynamic re-stratification

Estimates of the numbers and biomass minimum population size of stocks was determined by re-stratifying the habitat each year depending on the distribution of occupancy habitat. The habitat area was restricted to the portion of the shelf associated with the stock based on the roster of bottom trawl strata consider to comprise the stock area. To raise an annual estimate, the habitat was partitioned into ten intervals based on the probability of occurrence from the species distribution model. The partitions were based on equal intervals of occurrence probability; hence, the size of each habitat strata could vary. The trawl catch per unit effort values were assigned to the appropriate habitat strata based on the habitat score where the trawl was made. Once the requisite trawl hauls associated with a habitat strata were identified, a mean catch per unit effort was determined and raised to a total minimum population estimate for that stratum assuming a constant trawl path area of 0.024 km 2. The total population was the sum of the estimates for the ten strata.

\(Total\;numbers\;or\;biomass =\sum_{i=1}^{10} (strata\;area/trawl\;path\;area) * mean\;CPUE\)

These estimates were conditioned on three options intended to ameliorate some of the issues associated with the variability in survey catch rates. The first allowed for the application of catch per unit effort data from either the stock area only or including tows from outside the stock area if they were collected in areas with habitat scores matching the probability from the intervals in the stock area. It was felt this may enhance the estimate by increasing the sample size for key strata. The second option provided the choice of performing a test for outliers on the trawl catch per unit effort data. The boxplot command in R was used to identify the outliers for removal. The last option involved whether to apply a smoothing across the mean catch per unit effort estimates in each strata. If the option was invoked, a loess smoother (span = 0.75) was applied to generate the smoothed catch rates; the procedure also had the benefit of interpolating and extrapolating a rate to a habitat strata that may not have had catch samples associated with it. Therefore, for each species/stock, an estimate of seasonal total abundance or biomass minimum population size will have eight realizations that test the sensitively of the calculations to these three formulation choices.

7.0.6 Predictor Variables

Static variables were kept constant over years where dynamic variables varied annually. Hence, the length of the time series of model fits is constrained by the shortest dynamic variable time series to meet the requirement of complete cases in the Random Forest fitting. The fitting time series was constrained to 1992 – 2016, which was determined by the length of the station salinity data.

7.0.7 Physical environment

Station data included observations made contemporaneously to survey bottom trawl stations. Depth of the station was used as a static variable in the analysis. The observed depth was used in model fitting where model predictions were based on depths from the ETOPO1 dataset, which provided Northeast Shelf bathymetry at a resolution of 0.0167° (below).

7.0.8

Estimation grid for predictor variable and habitat estimates, grid location spaced by 0.1° longitude and latitude.

Figure 7.2: Estimation grid for predictor variable and habitat estimates, grid location spaced by 0.1° longitude and latitude.

Surface and bottom water temperature and salinity were used as dynamic variables in the analysis. Temperature and salinity on the NE Shelf was collected using Conductivity/Temperature/Depth (CTD) instruments with the most complete sample coverage associated with spring (February –April) and fall (September-November) time frames. Surface and bottom temperatures were used to develop date of collection corrections using linear regression for each time frame. Temperatures were standardized to a collection date of April 3 for spring collections and October 11 for fall. A date of collection correction was not indicated for salinity data. The observed date-corrected temperature (°C) and uncorrected salinity data (PSU) was used in model fitting.

Model predictions were based on temperature and salinity fields using an optimal interpolation approach where annual data were combined with a climatology by season. For a half degree grid of the ecosystem, mean bottom temperature or salinity was calculated by year and season. For grid locations that had data for at least 80% of the time series, the data from those locals were used to calculate a seasonal mean. The annual seasonal means were used to calculate anomalies, which were combined over the time series to provide seasonal, surface and bottom anomaly climatologies.

Returning to the raw data, the observations for a year, season, and depth were then used to estimate an annual field using universal kriging with depth as a covariate. The kriging was done on a standard 0.1° grid using the automap (Hiemstra et al. 2008, version 1.0–14). The annual field was then combined with the climatology anomaly field, adjusted by the annual mean, using the variance field from the kriging as the basis for a weighted mean between the two. The variance field was divided into quartiles with the first quartile (lowest kriging variance) carrying a weighting of 4:1 between the annual and climatology values. Hence, the optimal interpolated field at these locations were skewed towards the annual data reflecting their proximity to actual data locations and low kriging variance associated with them. The weighting ratio shifted to 1:1 in the highest variance quartile reflecting less information from the annual field and more from the climatology.

7.0.9 Habitat Descriptors

Habitat descriptors are a series of static variables that reflect the shape and complexity of benthic habitats. Since the response variables for these models are derived from bottom trawl gear, naturally the range of candidate taxa for modelling is skewed to benthic organisms, making these descriptors particularly relevant. Most of the variables are based on depth measurement, including the complexity, BBI, VRM, Prcurv, rugostity, seabedforms, slp, and slpslp variables (Table 1). The soft-sed variable is based on benthic sediment grain size and the vorticity variable is based on current estimates.

7.0.10 Zooplankton Data

Zooplankton abundance is measured by the Ecosystem Monitoring Program (EcoMon), which conducts shelf-wide bimonthly surveys of the NES ecosystem (J. Kane 2007). Zooplankton and ichthyoplankton are collected throughout the water column to a maximum depth of 200 m using paired 61-cm Bongo samplers equipped with 333-micron mesh nets. Sample location in this survey is based on a randomized strata design, with strata defined by bathymetry and along-shelf location. Plankton taxa are sorted and identified. We used the bio-volume of the 18 most abundant taxonomic categories as potential predictor variables (Table 2). The zooplankton sample time series has some missing values which were ameliorated by summing data over five-year time steps for each seasonal time frame and interpolating a complete field using ordinary kriging. Thus, for example, the data for spring 2000 would include the available data from 1998-2002 tows.

7.0.11 Remote Sensing Data

Chlorophyll concentration and SST from remote sensing sources were applied in the habitat models as static variables. Chlorophyll and SST were summarized as monthly means with their associated gradient magnitude or frontal fields. The basis for the chlorophyll concentration was measurements made with the Sea-viewing Wide Field of View Sensor (SeaWiFS), Moderate Resolution Imaging Spectroradiometer on the Aqua satellite (MODIS), Medium Resolution Imaging Spectrometer (MERIS), and Visible and Infrared Imaging/Radiometer Suite (VIIRS) sensors during the period 1997-2016. The data is a merged product using the Garver, Siegel, Maritorena Model (GSM) algorithm obtained from the Hermes GlobColour website.

These four sensors provide an overlapping time series of chlorophyll concentration during the period and were combined based on a bio-optical model inversion algorithm (Maritorena et al. 2010). Monthly SST fields were based on data from the MODIS Terra sensor data available from the Ocean Color Website. From these data, mean monthly fields were generated for both chlorophyll and SST. There are a range of methods used to identify fronts (Belkin and O’Reilly 2009) in oceanographic data that usually apply some focal filter to reduce noise and identify gradient magnitude with a Sobel filter. These calculations were done in R using the raster package (Hijmans 2017, version 2.6–7) using a 3 by 3 mean focal filter and a Sobel filter to generate x and y derivatives, which are then used to calculate gradient magnitude.

7.0.12 Model Selection Criteria and Variable Importance

The habitat models were evaluated for fit based on out-of-bag classification accuracy. For occupancy models accuracy, AUC (Area Under the ROC Curve), and Cohen’s Kappa were calculated using the irr R package (Gamer et al. 2012, version 0.84). For regression models, the variance explained by the model, mean absolute error, the root mean square error, and bias were calculated using the Metrics R package (Hamner and Frasco 2018, version 0.1.3). To evaluate variable importance in both occupancy and regression models, we plotted the number of times a variable was the root variable versus the mean minimum node depth for the variable, highlighting the top ten important variables using the randomForestExplainer R package (Paluszynska and Biecek 2017, version 0.9). For occupancy models we also plotted the Gini index decrease versus accuracy decrease, whereas for the regression models we plotted node purity increase versus MSE increase, also highlighting the top ten most important variables.

7.0.13

Variables Notes References
Complexity - Terrain Ruggedness Index The difference in elevation values from a center cell and the eight cells immediately surrounding it. Each of the difference values are squared to make them all positive and averaged. The index is the square root of this average. Riley, DeGloria, and Elliot (1999)
Namera bpi BPI is a second order derivative of the surface depth using the TNC Northwest Atlantic Marine Ecoregional Assessment (“NAMERA”) data with an inner radius=5 and outer radius=50. Lundblad et al. (2006)
Namera_vrm Vector Ruggedness Measure (VRM) measures terrain ruggedness as the variation in three-dimensional orientation of grid cells within a neighborhood based the TNC Northwest Atlantic Marine Ecoregional Assessment (“NAMERA”) data. Hobson (1972); Sappington, Longshore, and Thompson (2007)
Prcurv (2 km, 10 km, and 20 km) Benthic profile curvature at 2km, 10km and 20 km spatial scales was derived from depth data. A. Winship et al. (2018)
Rugosity A measure of small-scale variations of amplitude in the height of a surface, the ratio of the real to the geometric surface area. Friedman et al. (2012)
seabedforms Seabed topography as measured by a combination of seabed position and slope. www.northeastoceandata.org
Slp (2 km, 10 km, and 20 km) Benthic slope at 2km, 10km and 20km spatial scales. A. Winship et al. (2018)
Slpslp (2 km, 10 km, and 20 km) Benthic slope of slope at 2km, 10km and 20km spatial scales A. Winship et al. (2018)
soft_sed Soft-sediments is based on grain size distribution from the USGS usSeabed: Atlantic coast offshore surficial sediment data. www.northeastoceandata.org
Vort (fall - fa; spring - sp; summer - su; winter - wi) Benthic current vorticity at a 1/6 degree (approx. 19 km) spatial scale. Kinlan et al. (2016)

Diet composition

NEFSC bottom-trawl sampling occurs twice annually in the spring (March-May) and fall (September-November). The survey area encompasses about 293,000 square km of continental shelf from Cape Hatteras, NC, to Nova Scotia, Canada in depths from 8-400 m. Food habits sampling has been conducted since 1973.

Stomachs are collected at sea by NEFSC, and have been primarily analyzed at sea since 1981. Total stomach volume is estimated, each prey item is identified and sorted to the lowest possible taxonomic level, and the proportion of each prey item is estimated. Detailed methods are described in Link and Almeida (2000). Prey composition percent by weight was calculated using a weighted mean (\(\overline{W_{ijs}}\)) (Link and Almeida 2000; Smith and Link 2010) to estimate mean weight of prey \(i\) in predator \(j\) for statistical group \(s\). Note: Prey volumes are used as proxies for prey weight. It may be calculated as

\[ \overline{W_{ijs}} = \frac{\sum_{t=1}^{N_{ts}}N_{jts}\overline{w_{ijts}}}{N_{ts}} \]

where \(t\) represents an individual bottom trawl tow, \(N_{jts}\) is the number of predator \(j\) stomachs in tow \(t\) for statistical group \(s\), \(N_{ts}\) is the number of tows in statistical group \(s\), and

\[ \overline{w_{ijts}} = \frac{\sum_{k=1}^{N_{jts}}w_{ijtsk}}{N_{jts}} \]

Appendix

7.0.14 Date of collection correction

The dates of survey data collection varied by year. To date correct temperature measurements, regressions were estimated between temperature and day of the year over the sample grid (0.5° grid) by season and depth. Spring data were transformed to a temperature representing April 3 and fall to October 11 based on the slope estimates shown in the maps in Figure A1.

7.0.15

Spring (a) and fall (b) linear slope coefficients used to date correct surface temperature to standard spring and fall dates; same for spring (c) and fall (d) bottom temperature correction.

Figure 7.3: Spring (a) and fall (b) linear slope coefficients used to date correct surface temperature to standard spring and fall dates; same for spring (c) and fall (d) bottom temperature correction.


7.0.16 Cross validation performance of temperature interpolation

A series of random draws of training and test sets were taken to evaluate the predictive skill of the estimation procedure. A set of climatology, annual interpolation, or optimal interpolation fields were estimated using the training set and compared to the held-out test set.

7.0.16.1

Mean and 95\% confidence intervals of squared errors for spring (a) and fall (b) surface temperature estimates based on climatology, annual interpolation, and optimal interpolation (OI); same data for spring (c) and fall (d) bottom temperature estimates.

Figure 7.4: Mean and 95% confidence intervals of squared errors for spring (a) and fall (b) surface temperature estimates based on climatology, annual interpolation, and optimal interpolation (OI); same data for spring (c) and fall (d) bottom temperature estimates.

The distribution of errors from the optimal interpolation test sets were evaluated spatially to determine where the larger errors occurred and what ecosystem features they are associated with.

7.0.17

Mean absolute error by 0.1 degree latitude and longitude intervals for spring (a) and fall (b) surface temperature; same data for spring (c) and fall (d) bottom temperature. Red indicates a positive error where blue indicates negative.

Figure 7.5: Mean absolute error by 0.1 degree latitude and longitude intervals for spring (a) and fall (b) surface temperature; same data for spring (c) and fall (d) bottom temperature. Red indicates a positive error where blue indicates negative.


7.0.18 Comparisons to external data sources

Sea surface temperature estimated in this study were compared to the OISST dataset serving as a source of data for external comparison. The OISST data for April 3 (spring) and October 11 (fall) over the period 1982-2017 were extracted on the same 0.5° grid used in the study. Time series of matching spring and fall temperatures were differenced (external minus interpolation data) and presented in the figure below (sample size for surface spring and fall data were 5220 and 5365, respectively). The interquartile range for the surface comparisons were generally symmetric around zero with differences between 0.5-1°C. Bottom temperature estimated in this study were compared to the data from a cooperative data collection program for external comparison. The “Environmental Monitors on Lobster Traps” (EMOLT) is a cooperative research program that collects bottom temperatures. The data for spring and fall were extracted from rasters of the study data to match the extremal data. The external data covered the period 1968-2017, with 360 data points for the spring and 728 for the fall. Spring bottom comparisons yielded the smallest interquartile range of differences less than 0.5°C, where fall comparisons were slightly larger and biased to lower interpolation estimates compared to the external data.

7.0.19

Box plots (box: interquartile range, whisker: 5-95\% range, square symbol: mean, line: median) of the difference between external and interpolation data.

Figure 7.6: Box plots (box: interquartile range, whisker: 5-95% range, square symbol: mean, line: median) of the difference between external and interpolation data.

8 Document automation

To facilitate speedy document creation many steps have been automated. Species specific information are passed as arguments to a default rmarkdown template (the backbone of this document). This template is then rendered into a book using the package bookdown. Prior to rendering the data are pre-processed and output as .RData files for use in the automation steps. The specific steps are as follows:

Manunal preprocessing

  1. model outputs are created in the form of annual raster files (.RData) for chlorophyll, zooplankton, temperature, salinity, ocurrance probability (species specific), and biomass (species specific).
  2. These sets of annual raster files are collapsed into a single stack (.RData) using the script collapse_to_stack.R. The resulting stacked raster files are exported with this package.

Automation

  1. create_template.R is run to create the species specific .Rmd document with the following arguments (for additional arguments, refer to ?create_template). For {{COMMON_NAME}}:
create_template(stock_name = {{STOCK_NAME}}, render_book=TRUE, output_dir = here::here("docs"))
  1. The species specific Rmd file is then rendered. During this step:
  • The stacked raster files are clipped to the appropriate seasonal stock area (See Figure 1.1 ) using stars_to_series.R.

  • A linear trend is fitted (using the package ecotrend) to the environmental variables (temperature and salinity), chlorophyll, and species biomass. The resulting fit is ONLY plotted if found to be significant at the 5% level.

  • A regime mean is also fit (using STARS_v18.R) and displayed on all plots.

  • Zooplankton:

  • Occupancy probability:

  1. This document (the one you are viewing) will then automatically open in a web browser and all associated files can be found in the docs\{{STOCK_NAME}} folder in your current working directory.
  2. These steps are repeated for additional species.

References

Belkin, Igor M, and John E O’Reilly. 2009. “An Algorithm for Oceanic Front Detection in Chlorophyll and Sst Satellite Imagery.” Journal of Marine Systems 78 (3). Elsevier: 319–26.

Despres-Patanjo, Linda I, Thomas R Azarovitz, and Charles J Byrne. 1988. “Twenty-Five Years of Fish Surveys in the Northwest Atlantic: The Nmfs Northeast Fisheries Center’s Bottom Trawl Survey Program.” Marine Fisheries Review 50 (4). NMFS, Scientific Publications Office: 69–71.

Evans, Jeffrey S., and Melanie A. Murphy. 2018. RfUtilities. https://cran.r-project.org/package=rfUtilities.

Friedman, Ariell, Oscar Pizarro, Stefan B. Williams, and Matthew Johnson-Roberson. 2012. “Multi-Scale Measures of Rugosity, Slope and Aspect from Benthic Stereo Image Reconstructions.” PLoS ONE 7 (12). doi:10.1371/journal.pone.0050440.

Fritz, R.L. 1965. “Autumn Distribution of Groundfish Species in the Gulf of Maine and Adjacent Waters, 1955-1961.” Serial Atlas of the Marine Environment 10 (1).

Gamer, Matthias, Jim Lemon, Ian Fellows, and Puspendra Singh. 2012. Irr: Various Coefficients of Interrater Reliability and Agreement. https://CRAN.R-project.org/package=irr.

Griswold, C.A., and T.A. McKenney. 1984. “Larval Development of the Scup, Stenotomus Chrysops (Pisces: Sparidae).” Fishery Bulletin (U.S.) 82 (1).

Grosslein, MD. 1969. Groundfish Survey Methods. Bureau of Commercial Fisheries, Biological Laboratory. https://nefsc.noaa.gov/publications/series/whlrd/whlrd6902.pdf.

Hamner, Ben, and Michael Frasco. 2018. Metrics: Evaluation Metrics for Machine Learning. https://CRAN.R-project.org/package=Metrics.

Hardison, Sean, Charles T Perretti, Geret S DePiper, and Andrew Beet. 2019. “A Simulation Study of Trend Detection Methods for Integrated Ecosystem Assessment.” ICES Journal of Marine Science.

Hiemstra, P.H., E.J. Pebesma, C.J.W. Twenh"ofel, and G.B.M. Heuvelink. 2008. “Real-time automatic interpolation of ambient gamma dose rates from the Dutch Radioactivity Monitoring Network.” Computers & Geosciences.

Hijmans, Robert J. 2017. Raster: Geographic Data Analysis and Modeling. https://CRAN.R-project.org/package=raster.

Kane, Joseph. 2007. “Zooplankton abundance trends on Georges Bank, 1977-2004.” ICES Journal of Marine Science 64 (5): 909–19. doi:10.1093/icesjms/fsm066.

Kinlan, Brian P., Arliss J. Winship, Timothy P. White, and John Christensen. 2016. “Modeling At-Sea Occurrence and Abundance of Marine Birds to Support Atlantic Marine Renewable Energy Planning Phase I Report.”

Liaw, Andy, and Matthew Wiener. 2002. “Classification and Regression by randomForest.” R News 2 (3): 18–22. https://CRAN.R-project.org/doc/Rnews/.

Lundblad, Emily R., Dawn J. Wright, Joyce Miller, Emily M. Larkin, Ronald Rinehart, David F. Naar, Brian T. Donahue, S. Miles Anderson, and Tim Battista. 2006. “A benthic terrain classification scheme for American Samoa.” Marine Geodesy 29 (2): 89–111. doi:10.1080/01490410600738021.

Murphy, Melanie A., Jeffrey S. Evans, and Andrew Storfer. 2010. “Quantifying Bufo boreas connectivity in Yellowstone National Park with landscape genetics.” Ecology 91 (1): 252–61. doi:10.1890/08-0879.1.

Paluszynska, Aleksandra, and Przemyslaw Biecek. 2017. randomForestExplainer: Explaining and Visualizing Random Forests in Terms of Variable Importance. https://CRAN.R-project.org/package=randomForestExplainer.

Riley, Shawn J, Stephen D DeGloria, and Robert Elliot. 1999. “A Terrain Ruggedness Index that Qauntifies Topographic Heterogeneity.” doi:citeulike-article-id:8858430.

Sappington, J. M., Kathleen M. Longshore, and Daniel B. Thompson. 2007. “Quantifying Landscape Ruggedness for Animal Habitat Analysis: A Case Study Using Bighorn Sheep in the Mojave Desert.” Journal of Wildlife Management 71 (5): 1419–26. doi:10.2193/2005-723.

Sherman, Kenneth, A Solow, J Jossi, and J Kane. 1998. “Biodiversity and Abundance of the Zooplankton of the Northeast Shelf Ecosystem.” ICES Journal of Marine Science 55 (4). Oxford University Press: 730–38.

Smith, Brian E., and Jason S. Link. 2010. The Trophic Dynamics of 50 Finfish and 2 Squid Species on the Northeast US Continental Shelf. NOAA Technichal Memorandum NMFS-NE-216. National Marine Fisheries Service, 166 Water Street, Woods Hole, MA 02543-1026. http://www.nefsc.noaa.gov/publications/tm/tm216/.

Winship, A.J., Kinlan B.P., T.P. White, and J.B. Leirness. 2018. “Modeling At-Sea Density of Marine Birds to Support Atlantic Marine Renewable Energy Planning: Final Report.” OCS Study BOEM 2018-010.