1 Introduction

This report provides contextual ecosystem information for Atlantic menhaden (Brevoortia tyrannus) on the Northeast U.S. Continental Shelf. Data extractions for spring and fall are confined to the Atlantic menhaden stock area based on respective survey strata sets. The information is intended to span a range of potential factors affecting the productivity and distribution of Atlantic menhaden including physical factors, lower trophic level changes, and habitat distribution. These factors can be used to qualitatively inform the interpretation of population status and/or quantitatively to improve model responsiveness to ecosystem factors. The range and complexity of ecosystem data makes it unlikely to find the most relevant and comprehensive factor variables with a first evaluation; this process will require an iterative approach of evaluation and feedback. Additional indices can be included to address the needs of the working group.

Summary

Substantial changes have occurred in the coastal environment supporting the menhaden population. Thermal conditions have changed in both spring and fall seasons, with high rates of change in fall temperatures. Further physical change in the environment can be seen in a shift in salinity during the spring.

There have also been changes in lower trophic level productivity with a significant decline in fall chlorophyll concentration. Zooplankton taxa utilized by menhaden show varying patterns of time series changes, with some of the principal taxa increasing and others in decline. Zooplankton biomass has not declined during the spring, which would suggest the food resources for menhaden larvae have remained abundant. However, fall zooplankton biomass has declined over the most recent decade and may be a factor for adult feeding.

Abundance estimates based on a habitat model suggest the stock is presently at a high level although regional abundance patterns may have changed. During spring, it appears more of the population is in the northern end of the stock than in earlier years.

Stock area

The putative seasonal stock areas for menhaden are based on sets of survey strata from the Northeast Fisheries Science Center bottom trawl survey. The spring stock area is shown as the green filled strata in the map. The fall stock area includes the spring strata as well as strata in the Gulf of Maine filled in blue. These two areas are further divided into northern and southern subareas; the strata for the areas are outlined in orange and blue, respectively.

Strata map for Atlantic menhaden (*Brevoortia tyrannus*) on the NE shelf

Figure 1.1: Strata map for Atlantic menhaden (Brevoortia tyrannus) on the NE shelf

A note on figures

Unless otherwise noted, time series in this document are represented by dark blue lines. Estimates of linear trends and regime shifts are included on plots when found to be significant (p < 0.05), and are shown by green and light blue lines respectively. Trend strengths and confidence intervals are also included in plot titles when trends are present. Trends were modeled using a GLS model selection approach (see Hardison et al. 2019 for details).

Last updated on June 25 2019.

2 Temperature

An optimal interpolation procedure was used to estimate NE Shelf surface and bottom temperatures for two seasonal time frames (see methods). The temperature estimates were standardized to April 3 and October 11 for spring and fall over the period 1968-2018. Surface and bottom temperature within the Atlantic menhaden stock areas are shown below.

Bottom Temperature

Thermal conditions on the Northeast Shelf have changed in recent decades as reflected in changes in bottom water temperatures in the stock areas. In both spring and fall, bottom water temperatures have significantly increased, with the suggestion of abrupt changes in temperature early in the time series. The rate of increase has been twice as high in the fall. This index reflects conditions of the inner shelf, representing thermal conditions at relatively shallow depths and would likely have relevance to menhaden.

Surface Temperature

There does not appear to be any trend associated with spring surface temperature conditions, however, fall temperature have increased over time. Furthermore, the change point in fall conditions in 2011 may be related to the cumulative warming that marked the extreme warming in 2012.

3 Salinity

An optimal interpolation procedure was used to estimate NE Shelf surface and bottom salinity for two seasonal time frames (see methods). Though collected with temperature data, reliable instrumentation limits this time series to 1992-2018. The salinity estimates were standardized to April 3 and October 11 for spring and fall. Surface and bottom salinities within the Atlantic menhaden stock areas are shown below.

Bottom Salinity

There is a long-term trend in spring bottom salinity in the stock area and the suggestion of an abrupt change in salinity around the year 2000. This change point would correspond to other changes in the ecosystem around that time (Morse et al. 2017). The fall bottom salinity does not appear to have a long-term trend, however, there are two change points present in the time series. The first change was around 1999 and the second around the year 2012, which may be related to the changes in thermal conditions associated with the temperature conditions in 2012 (Mills et al. 2013).

Surface Salinity

The spring surface salinity, like the bottom conditions, also show a long-term increasing trend and a change point in 2013. The fall surface salinity time series was without trend or any change points. These changes in salinity conditions are likely indicators of changes in source water on the shelf and may have ramifications for patterns of primary production.

4 Chlorophyll

Chlorophyll concentration in the stock area

The concentration of chlorophyll is an indicator of lower trophic level productivity and may be of particular importance to menhaden considering both juvenile and adults directly utilize phytoplankton as a food resource. Chlorophyll concentration was extracted for the spring and fall stock areas using a merged sensor data set. The data set included measurements made with the Sea-viewing Wide Field of View Sensor (SeaWiFS), Moderate Resolution Imaging Spectroradiometer on the Aqua satellite (MODIS), Medium Resolution Imaging Spectrometer (MERIS), and Visible and Infrared Imaging/Radiometer Suite (VIIRS) sensors during the period 1997-2016. The data used were a merged product using the Garver, Siegel, and Maritorena (GSM) algorithm Maritorena et al. (2010) obtained from the Hermes GlobColour website. There is no apparent trend in spring chlorophyll concentration; however, there is a significant trend in fall chlorophyll associated with a regime change to lower levels in 2012. During this new regime, fall chlorophyll has been at the lowest levels in this time series.

Chesapeake Bay recruitment index

Recruitment is a complex process in marine fishes and in the case of menhaden, it has been long recognized that recruitment variability is related to the life history of larval menhaden on the continental shelf, the successful egress into estuarine nursery areas, and the growth and survival of post-metamorphic juveniles in nursery habitats. Particular attention was focused on the later source of variability in Houde et al. (2016), which suggested the development of spring bloom conditions may factor into the success of a year class. This study utilized local sources of data to describe the dependency of year class strength on primary production and phytoplankton biomass.

A proxy index for this relationship is developed here based on satellite remote sensing data. Time series of chlorophyll concentrations for the oligohaline, mesohaline, and polyhaline segments of the bay were extracted from the merged dataset described above.

Based on the most recent menhaden assessment, the chlorophyll time series overlaps the recruitment series for the years 1998-2016. The two largest year classes during that period appear to have occurred in 2005 and 2010 (based on age-0 numbers). The correlation between April polyhaline chlorophyll concentration and recruitment was 0.43 (P = .066). The polyhaline chlorophyll time series was at its highest value in 2005 and among its highest values in 2010; the 2011 chlorophyll concentration was also among the highest values, but it was not associated with a strong recruitment event. Time series from other portions of the estuary were not correlated with the recruitment signal. Finally, it is worth noting that the 2019 chlorophyll concentration was the third highest in the time series. If the remote sensing series is capturing a part of the recruitment signal, 2019 may be a large year class and the index may provide some forecast skill for recruitment events.

5 Zooplankton

Menhaden consume zooplankton during all their life history stages, but have a dependency on zooplankton as larval fish and a preference for zooplankton as adults (Friedland, Lynch, and Gobler 2011). June and Carlson (1971) characterized larval menhaden feeding in context to metamorphosis, partitioning their sampling between marine and estuarine waters. In marine waters, the diet consisted predominantly of copepods and mainly the genera Centropages and Acartia. In estuarine waters, the copepod genera Acartia and Temora played larger roles along with other zooplankton species. These samples were collected over two years. Lozano (2011) analyzed the feeding of larval menhaden over three seasons and found they utilized copepods, cladocerans, barnacle nauplii, bivalve larvae, ostracods, decapods, polychaetes, tunicate larvae, metatrochophores, and digested material.

The most commonly encountered copepod genera were Centropages, Acartia, Temora, Caligus, and Labidocera. Though identifications were not made to species, it is likely Centropages typicus, Acartia tonsa, and Temora longicornis were the most important prey species. Numerically, the most important prey were copepods, which accounted for approximately 46-70% of the prey, followed by barnacle nauplii and cladocerans, which ranged from 10-30% of the diet. After the metamorphosis from larval particle feeding to filter feeding, the nature of their gill raker feeding apparatus would suggest they would feed on any size zooplankton and not be limited by any minimum size like some herring species (Friedland et al. 2006). So generally, whatever taxa are abundant would likely be a useful prey resource for both larval or adult fish.

Copepod abundance

Non-copepod zooplankton abundance

6 Habitat and abundance

Occurrence probability

The probability of occurrence was estimated using random forest classification models (see methods). The mean annual probabilities were extracted for the spring and fall stock definition areas. These data provide an estimate of the potential use of the habitat associated with the stock definition. The most dramatic change in occupancy probability occurred in the spring. In the beginning of the time series, occupancy probability was less than 0.08 and has increased to well over 0.1 in recent decades. This suggests greater utilization of the continental shelf by menhaden during the spring period. Fall occupancy has remained relatively constant of the time series range between 0.12-0.13.

Habitat area

The area of the Northeast Shelf with an estimated occurrence probability for menhaden of 0.25 was determined for the stock area and the ecosystem. The area of habitat with the stock area generally ranged from 2,000-4,000 km2 with a notable peak in habitat associated with 2012 spring conditions. With the exception of 2012, spring habitat appears to have been represented by a larger area during the 1980s. Fall habitat has increased in recent years and was also at high levels in the early 1980s. The fall habitat area for menhaden over the ecosystem is larger than the areas restricted to the stock definition. Fall habitat approached 5,000 km2 reflecting a greater use of the ecosystem during the warmer part of the year. The spring habitat over the ecosystem were similar to the stock area dimensions.

Stock area

Ecosystem

Minimum Population Size

Estimates of the numbers and biomasses for the minimum population size of menhaden stocks were determined by re-stratifying the habitat each year depending on the distribution of occupancy habitat (see methods for details). The procedure provides eight population size estimates for each stock, by season, based on the use of three estimation options. The estimates are denoted by a three-letter code (Y/N|Y/N|Y/N) where the first position indicates whether data were interpolated over habitat strata. The second position indicates whether the trawl data applied to the estimate was limited to the stations made in the stock area only or if trawls from other areas were used as well. The third position indicates whether the trawl data were subjected to a test for outliers, in which outliers were eliminated.

Of the eight treatment options, the four that included outlier testing and elimination were found to produce many missing values; hence, the abundance and biomass estimates were based on the four other treatments (NNN, NYN, YNN, and YYN). The spring minimum population size in numbers suggests the stock continues the trend of high abundance that began with the 2005 recruitment. When disaggregated by northern and southern subareas, it appears the southern subunit approximates the unit abundance for most of the time series.

In recent years, the southern subareas appear to have declined whereas the northern subarea is closer to the unit population size. Similar trends in the relative role of the subareas can be seen in the minimum biomass estimates as well; the southern subarea approximates the unit early in the time series and in recent years the northern subarea estimates the unit. Collectively, both the abundance and biomass data suggest a shift in spring population distribution, with more of the population now distributed in the northern subareas. The time series trends in both abundance and biomass appear to be positive, however, the recent biomass peak level is higher than the peak biomass in the 1980. This is not the case when comparing the recent and 1980s peaks in abundance. The fall population estimates are dissimilar to the spring estimates in terms of both localized abundance and trends. The fall data suggest an abundance peak in the early 2000s that is not seen in the spring data. Furthermore, a peak in fall biomass does not accompany this peak in abundance. The fall data do not suggest any trend in abundance or biomass in the unit estimate or the in the subareas estimates either. The fall unit estimate is most closely approximated by the northern subarea estimate in all cases.

Spring numbers and biomass

Fall numbers and biomass

7 Methods

7.1 Surface and Bottom Temperature and Salinity

7.1.1 Study System

The U.S. Northeast Shelf ecosystem, which roughly aligns with the boundaries of the Northeast U.S. Continental Shelf Large Marine Ecosystem (LME), is the study area for the surface and bottom thermal environments. Surface and bottom temperatures were estimated over a 0.1° latitude/longitude grid, termed the estimation grid, which circumscribes the range of ecosystem assessment areas in the region. The difference between the extents of the estimation grid from the extent of the LME relate to the resource management programs that are the sources of the data, which are focused on fishery management needs in the region.

7.1.2 Data Source

Temperature and salinity were collected on the Northeast Shelf as part of ongoing resource and ecosystem surveys conducted by the Northeast Fisheries Science Center. Water column temperatures have been collected contemporaneously to trawl tows associated with a bottom trawl survey beginning in the fall of 1963 and five years later during spring (Despres-Patanjo, Azarovitz, and Byrne 1988). In addition, the ecosystem has been surveyed by multiple sampling programs with varying sampling designs. The two most comprehensive monitoring programs over the study period were the MARMAP (1977-1987) and the Ecosystem Monitoring Program or EcoMon (1992-present) programs, both serving as shelf-wide surveys of the ecosystem (Sherman et al. 1998; Kane 2007). Temperature measurements were made with a mix of Conductivity Temperature and Depth (CTD) instruments, analog XBT, digital XBT, mechanical BT, glass thermometers (bottle temps) and trawl mounted temperature loggers instruments collecting either water column profiles or temperatures measured at targeted depths. Salinity measurements used in this analysis was limited to 1992-2017 when CTD instrument were used. Surface and bottom temperatures were identified from these measurements. Temperatures representing the spring period include data collected during the months of February to June; however, 99% of the samples were collected during the months of March to May. Likewise, the fall period samples include data collected during September to December, with 99% of the samples collected during September to November. The total number of surface temperature measurements were 14,540 and 14,666 for spring and fall, respectively; and, 14,450 and 14,656 for spring and fall bottom temperature, respectively. On average, there were 290 temperature measurements by season, depth, and year. The number of salinity observation per year were similar.

7.1.3 Interpolation Procedure

Seasonal surface and bottom temperature fields were estimated using an optimal interpolation approach. The optimal interpolation combined a climatological depiction of temperature and an annual estimates based on the data for a particular depth and season. (There were a number of precursor steps that are described below.). The surveys used to collect the data were conducted during the same period each year, however, there was variation in survey timing. To account for this, temperatures were standardized to a collection date of April 3 for spring surveys and October 11 for fall using linear regression for each depth and season over the sample grid (see Appendix, Figure A1). For each depth and season, annual shelf-wide mean temperatures were calculated using the data from sample grid locations with at least 80% of the time series present. The annual observations were then transformed to anomalies by subtracting the appropriate annual mean. All anomalies for a season and depth were combined over years into a single anomaly field or climatology.

The annual estimate of temperature for a depth and season was imputed by used universal kriging to estimate the temperature over the estimation grid with depth as a covariate. The kriging yielded temperature estimates and a variance estimate over the same grid. The optimal interpolation field was assembled by combining the annual estimate and information from the anomaly climatology. The climatology was re-leveled from anomaly values to temperatures by adding back the appropriate annual mean. For each location in the estimation grid, temperature was calculated as a weighted mean between the kriged annual estimate and the releveled climatology. The weightings in the calculations were partitioned based on the variance field of the annual kriging. The field was divided into quartiles from low to high variance with the weighting ratio of annual:climatology temperatures of 4:1, 3:1, 2:1, and 1:1, respectively. Hence, in areas of low variance the weighted mean was based on a weighting of 4:1, which would reflect a higher contribution of information from the annual estimate and thus be closer to an observation. In areas with high variance, the weighting ratio of 1:1 would reflect a greater effect of the climatology in determining the interpolation estimate.

The optimal interpolation temperature fields were evaluated using cross validation and a comparison to external data. The performance of optimal interpolation was compared to the predictive skill of using either the climatology or annual interpolation alone by doing ten random cross validations of each treatment. Each random draw of training and test sets sampled 3% of the data for the test set, or about 500 observations per draw. The temperature fields were fit with the training data and compared with the test set data. The lowest error rates were realized with the optimal interpolation contrasted with highest predictive error associated with fields based on the climatology alone (see Appendix, Figure A2). The spatial distribution of error had distinct depth and seasonal patterns. The spatial errors associated with the surface estimates were generally low with the exception of a few locations along the shelf break between latitudes 39-41°N; the spatial error in the bottom temperature varied by season, and was concentrated along the shelf break in spring and across the shelf between latitudes 36-40°N in fall (see Appendix, Figure A3). The optimal interpolation data were also compared to external data from other collection programs. The absolute errors between surface temperature data collected by satellites and the interpolation had interquartile ranges of approximately ±0.75°C (see Appendix, Figure A4). The absolute error in a comparison of interpolated bottom temperature to opportunist sampling was approximately ±0.75°C for spring bottom temperature and approximately 0.75-1.0°C in the fall. Salinity was estimated in the same way with the exception of collection correction, which was deemed unnecessary for the salinity data.

Map of the study system with half-degree sample grid (blue lines) and extent of estimation grid shown in beige points (a). Major features of the study system with shelf break marked in purple line (b). 100m depth shown as dashed line.

(#fig:study area)Map of the study system with half-degree sample grid (blue lines) and extent of estimation grid shown in beige points (a). Major features of the study system with shelf break marked in purple line (b). 100m depth shown as dashed line.

7.2 Chlorophyll data

Chlorophyll a concentration ([Chl]) data extracted from satellite remote-sensing databases based on measurements made with the Sea-viewing Wide Field of View Sensor (SeaWiFS), Moderate Resolution Imaging Spectroradiometer on the Aqua satellite (MODIS), Medium Resolution Imaging Spectrometer (MERIS), and Visible and Infrared Imaging/Radiometer Suite (VIIRS) sensors. We used the Garver, Siegel, Maritorena Model (GSM) merged data product at 100 km (equivalent to a 1° grid) and 8-day spatial and temporal resolutions, respectively, obtained from the Hermes GlobColour website. These four sensors provide an overlapping time series of [Chl] during the period 1997 to 2018 and were combined based on a bio-optical model inversion algorithm (Maritorena et al. 2010).

7.3 Zooplankton abundance

Zooplankton abundance is measured by the Ecosystem Monitoring Program (EcoMon), which conducts shelf-wide bimonthly surveys of the NES ecosystem (Kane 2007). Zooplankton and ichthyoplankton are collected throughout the water column to a maximum depth of 200 m using paired 61-cm Bongo samplers equipped with 333-micron mesh nets. Sample location in this survey is based on a randomized strata design, with strata defined by bathymetry and along-shelf location. Plankton taxa are sorted and identified. We used the bio-volume of the 18 most abundant taxonomic categories as potential predictor variables (Table 2). The zooplankton sample time series has some missing values, which necessitated the removal of spring data for the years 1989, 1990, 1991, and 1994 and fall data for the years 1989, 1990, and 1992. The data for each seasonal time frame was interpolated to a complete field over the estimation grid using ordinary kriging.

7.3.1

Table 7.1: Variable and species names for the 18 most abundant zooplankton taxa in EcoMon survey data.
Variable name Full name
calfin Calanus finmarchicus
chaeto Chaetognatha
cham Centropages hamatus
cirr Cirripedia
ctyp Centropages typicus
echino Echinodermata
evadnespp Evadne spp.
gas Gastropoda
hyper Hyperiidea
larvaceans Appendicularians
mlucens Metridia lucens
oithspp Oithona spp.
para Paracalanus parvus
penilia Penilia spp.
pseudo Pseudocalanus spp.
salps Salpa
tlong Temora longicornis

7.4 Occupancy models

Occupancy and productivity habitats for Atlantic menhaden were estimated with random forest classification and regression models using a suite of static and dynamic predictor variables. Variation in species presence or absence and biomass across space, bathymetry factors, productivity factors, and climate factors were tested. Models were constructed separately for spring and fall seasons. The response variables were the occurrence and catch-per-unit-effort of Atlantic menhaden in the Northeast Fisheries Science Center bottom trawl survey, which is a fishery-independent survey on the Northeast US Shelf. The survey is conducted in the spring and fall of the year and is based on a stratified random design, which provides both spatial and temporal depictions of fish and macroinvertebrate abundances (Grosslein 1969). The independent or predictor variable set included physical environment variables, habitat descriptors, zooplankton variables, and remote sensing variables; the variables will be described in more detail below. Occupancy models were fit as two-factor classification models (absence as 0; presence as 1) using the randomForest R package (Liaw and Wiener 2002).

Prior to fitting the model, the independent variable set was first tested for multi-collinearity among the predictors and correlated variables were eliminated (R package Evans and Murphy (2018), version 2.1-3). From this reduced set of predictors, the final model variables were selected utilizing the model selection criteria of (Murphy, Evans, and Storfer 2010) as implemented in rfUtilities. Productivity models were fit as regression models with log10 transformed biomass-per-unit-effort as the response variable and the same starting set of predictor variables as in the occupancy models. As with the occupancy models, independent variables were tested for multi-collinearity and the model selection criteria was applied. Habitat was estimated from the model fits over a standard 0.1° grid, which circumscribes the range of ecosystem assessment areas in the region.

Three types of visualizations were created from the output of the tree models. The first visualization was used to see the average probability of occupancy over space and the rate of change (Sen’s slope) in occupancy over the years. The second visualization was used to see the mean occupancy gradient magnitude, or frontal strength and the rate of change (Sen’s slope) in occupancy gradient magnitude over the years. Gradient magnitude was calculated by calculating the median of the occupancy probabilities with a moving window and then summing those medians with a moving window with a matrix of weights. The third visualization was used to see the average biomass over space and the rate of change (Sen’s slope) in biomass over the years. Trends in total occupancy habitat area, with occupancy probabilities of 25, 50, and 75% over time were plotted as well by calculating the sum of the area with occupancy probabilities at each percentage during each year.

Following from the 2018 ECSA for summer flounder, the methodology was changed by the conversion of salinity, surface and bottom, from a dynamic variable to a static variable. By so doing, the fitting period is extended from 1992-2017 to 1976-2018. Dynamic salinity data was limited by the time series of modern instrumentation, which began in 1992. In the 2018 model fits, dynamic salinity was not an important variable in any species model; hence, conversion to a static variable was of little consequence.

7.4.1 Minimum swept area abundance based on habitat informed dynamic re-stratification

Estimates of the numbers and biomass minimum population size of stocks was determined by re-stratifying the habitat each year depending on the distribution of occupancy habitat. The habitat area was restricted to the portion of the shelf associated with the stock based on the roster of bottom trawl strata consider to comprise the stock area. To raise an annual estimate, the habitat was partitioned into ten intervals based on the probability of occurrence from the species distribution model. The partitions were based on equal intervals of occurrence probability; hence, the size of each habitat strata could vary. The trawl catch per unit effort values were assigned to the appropriate habitat strata based on the habitat score where the trawl was made. Once the requisite trawl hauls associated with a habitat strata were identified, a mean catch per unit effort was determined and raised to a total minimum population estimate for that stratum assuming a constant trawl path area of 0.024 km 2. The total population was the sum of the estimates for the ten strata.

\(Total\;numbers\;or\;biomass =\sum_{i=1}^{10} (strata\;area/trawl\;path\;area) * mean\;CPUE\)

These estimates were conditioned on three options intended to ameliorate some of the issues associated with the variability in survey catch rates. The first allowed for the application of catch per unit effort data from either the stock area only or including tows from outside the stock area if they were collected in areas with habitat scores matching the probability from the intervals in the stock area. It was felt this may enhance the estimate by increasing the sample size for key strata. The second option provided the choice of performing a test for outliers on the trawl catch per unit effort data. The boxplot command in R was used to identify the outliers for removal. The last option involved whether to apply a smoothing across the mean catch per unit effort estimates in each strata. If the option was invoked, a loess smoother (span = 0.75) was applied to generate the smoothed catch rates; the procedure also had the benefit of interpolating and extrapolating a rate to a habitat strata that may not have had catch samples associated with it. Therefore, for each species/stock, an estimate of seasonal total abundance or biomass minimum population size will have eight realizations that test the sensitively of the calculations to these three formulation choices.

7.4.2 Predictor Variables

Static variables were kept constant over years where dynamic variables varied annually. Hence, the length of the time series of model fits is constrained by the shortest dynamic variable time series to meet the requirement of complete cases in the Random Forest fitting. The fitting time series was constrained to 1992 – 2016, which was determined by the length of the station salinity data.

7.4.3 Physical environment

Station data included observations made contemporaneously to survey bottom trawl stations. Depth of the station was used as a static variable in the analysis. The observed depth was used in model fitting where model predictions were based on depths from the ETOPO1 dataset, which provided Northeast Shelf bathymetry at a resolution of 0.0167° (below).

7.4.4

Estimation grid for predictor variable and habitat estimates, grid location spaced by 0.1° longitude and latitude.

(#fig:occ_grid)Estimation grid for predictor variable and habitat estimates, grid location spaced by 0.1° longitude and latitude.

Surface and bottom water temperature and salinity were used as dynamic variables in the analysis. Temperature and salinity on the NE Shelf was collected using Conductivity/Temperature/Depth (CTD) instruments with the most complete sample coverage associated with spring (February –April) and fall (September-November) time frames. Surface and bottom temperatures were used to develop date of collection corrections using linear regression for each time frame. Temperatures were standardized to a collection date of April 3 for spring collections and October 11 for fall. A date of collection correction was not indicated for salinity data. The observed date-corrected temperature (°C) and uncorrected salinity data (PSU) was used in model fitting.

Model predictions were based on temperature and salinity fields using an optimal interpolation approach where annual data were combined with a climatology by season. For a half degree grid of the ecosystem, mean bottom temperature or salinity was calculated by year and season. For grid locations that had data for at least 80% of the time series, the data from those locals were used to calculate a seasonal mean. The annual seasonal means were used to calculate anomalies, which were combined over the time series to provide seasonal, surface and bottom anomaly climatologies.

Returning to the raw data, the observations for a year, season, and depth were then used to estimate an annual field using universal kriging with depth as a covariate. The kriging was done on a standard 0.1° grid using the automap (Hiemstra et al. 2008, version 1.0–14). The annual field was then combined with the climatology anomaly field, adjusted by the annual mean, using the variance field from the kriging as the basis for a weighted mean between the two. The variance field was divided into quartiles with the first quartile (lowest kriging variance) carrying a weighting of 4:1 between the annual and climatology values. Hence, the optimal interpolated field at these locations were skewed towards the annual data reflecting their proximity to actual data locations and low kriging variance associated with them. The weighting ratio shifted to 1:1 in the highest variance quartile reflecting less information from the annual field and more from the climatology.

7.4.5 Habitat Descriptors

Habitat descriptors are a series of static variables that reflect the shape and complexity of benthic habitats. Since the response variables for these models are derived from bottom trawl gear, naturally the range of candidate taxa for modelling is skewed to benthic organisms, making these descriptors particularly relevant. Most of the variables are based on depth measurement, including the complexity, BBI, VRM, Prcurv, rugostity, seabedforms, slp, and slpslp variables (Table 1). The soft-sed variable is based on benthic sediment grain size and the vorticity variable is based on current estimates.

7.4.6 Zooplankton Data

Zooplankton abundance is measured by the Ecosystem Monitoring Program (EcoMon), which conducts shelf-wide bimonthly surveys of the NES ecosystem (Kane 2007). Zooplankton and ichthyoplankton are collected throughout the water column to a maximum depth of 200 m using paired 61-cm Bongo samplers equipped with 333-micron mesh nets. Sample location in this survey is based on a randomized strata design, with strata defined by bathymetry and along-shelf location. Plankton taxa are sorted and identified. We used the bio-volume of the 18 most abundant taxonomic categories as potential predictor variables (Table 2). The zooplankton sample time series has some missing values which were ameliorated by summing data over five-year time steps for each seasonal time frame and interpolating a complete field using ordinary kriging. Thus, for example, the data for spring 2000 would include the available data from 1998-2002 tows.

7.4.7 Remote Sensing Data

Chlorophyll concentration and SST from remote sensing sources were applied in the habitat models as static variables. Chlorophyll and SST were summarized as monthly means with their associated gradient magnitude or frontal fields. The basis for the chlorophyll concentration was measurements made with the Sea-viewing Wide Field of View Sensor (SeaWiFS), Moderate Resolution Imaging Spectroradiometer on the Aqua satellite (MODIS), Medium Resolution Imaging Spectrometer (MERIS), and Visible and Infrared Imaging/Radiometer Suite (VIIRS) sensors during the period 1997-2016. The data is a merged product using the Garver, Siegel, Maritorena Model (GSM) algorithm obtained from the Hermes GlobColour website.

These four sensors provide an overlapping time series of chlorophyll concentration during the period and were combined based on a bio-optical model inversion algorithm (Maritorena et al. 2010). Monthly SST fields were based on data from the MODIS Terra sensor data available from the Ocean Color Website. From these data, mean monthly fields were generated for both chlorophyll and SST. There are a range of methods used to identify fronts (Belkin and O’Reilly 2009) in oceanographic data that usually apply some focal filter to reduce noise and identify gradient magnitude with a Sobel filter. These calculations were done in R using the raster package (Hijmans 2017, version 2.6–7) using a 3 by 3 mean focal filter and a Sobel filter to generate x and y derivatives, which are then used to calculate gradient magnitude.

7.4.8 Model Selection Criteria and Variable Importance

The habitat models were evaluated for fit based on out-of-bag classification accuracy. For occupancy models accuracy, AUC (Area Under the ROC Curve), and Cohen’s Kappa were calculated using the irr R package (Gamer et al. 2012, version 0.84). For regression models, the variance explained by the model, mean absolute error, the root mean square error, and bias were calculated using the Metrics R package (Hamner and Frasco 2018, version 0.1.3). To evaluate variable importance in both occupancy and regression models, we plotted the number of times a variable was the root variable versus the mean minimum node depth for the variable, highlighting the top ten important variables using the randomForestExplainer R package (Paluszynska and Biecek 2017, version 0.9). For occupancy models we also plotted the Gini index decrease versus accuracy decrease, whereas for the regression models we plotted node purity increase versus MSE increase, also highlighting the top ten most important variables.

7.4.9

Table 7.2: Habitat descriptors used in occupancy model parameterization.
Variables Notes References
Namera_vrm Vector Ruggedness Measure (VRM) measures terrain ruggedness as the variation in three-dimensional orientation of grid cells within a neighborhood based the TNC Northwest Atlantic Marine Ecoregional Assessment (“NAMERA”) data. Hobson (1972); Sappington, Longshore, and Thompson (2007)
Prcurv (2 km, 10 km, and 20 km) Benthic profile curvature at 2km, 10km and 20 km spatial scales was derived from depth data. Winship et al. (2018)
Rugosity A measure of small-scale variations of amplitude in the height of a surface, the ratio of the real to the geometric surface area. Friedman et al. (2012)
seabedforms Seabed topography as measured by a combination of seabed position and slope. http://www.northeastoceandata.org/
Slp (2 km, 10 km, and 20 km) Benthic slope at 2km, 10km and 20km spatial scales. Winship et al. (2018)
Slpslp (2 km, 10 km, and 20 km) Benthic slope of slope at 2km, 10km and 20km spatial scales Winship et al. (2018)
soft_sed Soft-sediments is based on grain size distribution from the USGS usSeabed: Atlantic coast offshore surficial sediment data. http://www.northeastoceandata.org/
Vort (fall - fa; spring - sp; summer - su; winter - wi) Benthic current vorticity at a 1/6 degree (approx. 19 km) spatial scale. Kinlan et al. (2016)

7.5 Diet composition

NEFSC bottom-trawl sampling occurs twice annually in the spring (March-May) and fall (September-November). The survey area encompasses about 293,000 square km of continental shelf from Cape Hatteras, NC, to Nova Scotia, Canada in depths from 8-400 m. Food habits sampling has been conducted since 1973.

Stomachs are collected at sea by NEFSC, and have been primarily analyzed at sea since 1981. Total stomach volume is estimated, each prey item is identified and sorted to the lowest possible taxonomic level, and the proportion of each prey item is estimated. Detailed methods are described in Link and Almeida (2000). Prey composition percent by weight was calculated using a weighted mean (\(\overline{W_{ijs}}\)) (Link and Almeida 2000; Smith and Link 2010) to estimate mean weight of prey \(i\) in predator \(j\) for statistical group \(s\). Note: Prey volumes are used as proxies for prey weight. It may be calculated as

\[ \overline{W_{ijs}} = \frac{\sum_{t=1}^{N_{ts}}N_{jts}\overline{w_{ijts}}}{N_{ts}} \]

where \(t\) represents an individual bottom trawl tow, \(N_{jts}\) is the number of predator \(j\) stomachs in tow \(t\) for statistical group \(s\), \(N_{ts}\) is the number of tows in statistical group \(s\), and

\[ \overline{w_{ijts}} = \frac{\sum_{k=1}^{N_{jts}}w_{ijtsk}}{N_{jts}} \]

7.6 Appendix

7.6.1 Date of collection correction

The dates of survey data collection varied by year. To date correct temperature measurements, regressions were estimated between temperature and day of the year over the sample grid (0.5° grid) by season and depth. Spring data were transformed to a temperature representing April 3 and fall to October 11 based on the slope estimates shown in the maps in Figure A1.

7.6.2

**Figure A1**: Spring (a) and fall (b) linear slope coefficients used to date correct surface temperature to standard spring and fall dates; same for spring (c) and fall (d) bottom temperature correction.

(#fig:doc_correction)Figure A1: Spring (a) and fall (b) linear slope coefficients used to date correct surface temperature to standard spring and fall dates; same for spring (c) and fall (d) bottom temperature correction.


7.6.3 Cross validation performance of temperature interpolation

A series of random draws of training and test sets were taken to evaluate the predictive skill of the estimation procedure. A set of climatology, annual interpolation, or optimal interpolation fields were estimated using the training set and compared to the held-out test set.

7.6.3.1

**Figure A2**: Mean and 95% confidence intervals of squared errors for spring (a) and fall (b) surface temperature estimates based on climatology, annual interpolation, and optimal interpolation (OI); same data for spring (c) and fall (d) bottom temperature estimates.

(#fig:cross_val)Figure A2: Mean and 95% confidence intervals of squared errors for spring (a) and fall (b) surface temperature estimates based on climatology, annual interpolation, and optimal interpolation (OI); same data for spring (c) and fall (d) bottom temperature estimates.


The distribution of errors from the optimal interpolation test sets were evaluated spatially to determine where the larger errors occurred and what ecosystem features they are associated with.

7.6.4

**Figure A3**: Mean absolute error by 0.1 degree latitude and longitude intervals for spring (a) and fall (b) surface temperature; same data for spring (c) and fall (d) bottom temperature. Red indicates a positive error where blue indicates negative.

(#fig:oi_mae)Figure A3: Mean absolute error by 0.1 degree latitude and longitude intervals for spring (a) and fall (b) surface temperature; same data for spring (c) and fall (d) bottom temperature. Red indicates a positive error where blue indicates negative.


7.6.5 Comparisons to external data sources

Sea surface temperature estimated in this study were compared to the OISST dataset serving as a source of data for external comparison. The OISST data for April 3 (spring) and October 11 (fall) over the period 1982-2017 were extracted on the same 0.5° grid used in the study. Time series of matching spring and fall temperatures were differenced (external minus interpolation data) and presented in the figure below (sample size for surface spring and fall data were 5220 and 5365, respectively). The interquartile range for the surface comparisons were generally symmetric around zero with differences between 0.5-1°C. Bottom temperature estimated in this study were compared to the data from a cooperative data collection program for external comparison. The “Environmental Monitors on Lobster Traps” (EMOLT) is a cooperative research program that collects bottom temperatures. The data for spring and fall were extracted from rasters of the study data to match the extremal data. The external data covered the period 1968-2017, with 360 data points for the spring and 728 for the fall. Spring bottom comparisons yielded the smallest interquartile range of differences less than 0.5°C, where fall comparisons were slightly larger and biased to lower interpolation estimates compared to the external data.

7.6.6

**Figure A4**: Box plots (box: interquartile range, whisker: 5-95% range, square symbol: mean, line: median) of the difference between external and interpolation data.

(#fig:oi_compare)Figure A4: Box plots (box: interquartile range, whisker: 5-95% range, square symbol: mean, line: median) of the difference between external and interpolation data.

References

Belkin, Igor M, and O’ReillyJohn E. 2009. “An Algorithm for Oceanic Front Detection in Chlorophyll and Sst Satellite Imagery.” Journal of Marine Systems 78 (3): 319–26.

Despres-Patanjo, Linda I, Thomas R Azarovitz, and Charles J Byrne. 1988. “Twenty-Five Years of Fish Surveys in the Northwest Atlantic: The Nmfs Northeast Fisheries Center’s Bottom Trawl Survey Program.” Marine Fisheries Review 50 (4): 69–71.

Evans, Jeffrey S., and Melanie A. Murphy. 2018. RfUtilities. https://cran.r-project.org/package=rfUtilities.

Friedland, Kevin D, Dean W Ahrenholz, Joseph W Smith, Maureen Manning, and Julia Ryan. 2006. “Sieving Functional Morphology of the Gill Raker Feeding Apparatus of Atlantic Menhaden.” Journal of Experimental Zoology Part A: Comparative Experimental Biology 305 (12): 974–85.

Friedland, Kevin D, Patrick D Lynch, and Christopher J Gobler. 2011. “Time Series Mesoscale Response of Atlantic Menhaden Brevoortia Tyrannus to Variation in Plankton Abundances.” Journal of Coastal Research 27 (6): 1148–58.

Friedman, Ariell, Oscar Pizarro, Stefan B. Williams, and Matthew Johnson-Roberson. 2012. “Multi-Scale Measures of Rugosity, Slope and Aspect from Benthic Stereo Image Reconstructions.” PLoS ONE 7 (12). https://doi.org/10.1371/journal.pone.0050440.

Gamer, Matthias, Jim Lemon, Ian Fellows, and Puspendra Singh. 2012. Irr: Various Coefficients of Interrater Reliability and Agreement. https://CRAN.R-project.org/package=irr.

Grosslein, MD. 1969. Groundfish Survey Methods. Bureau of Commercial Fisheries, Biological Laboratory. https://nefsc.noaa.gov/publications/series/whlrd/whlrd6902.pdf.

Hamner, Ben, and Michael Frasco. 2018. Metrics: Evaluation Metrics for Machine Learning. https://CRAN.R-project.org/package=Metrics.

Hardison, Sean, Charles T Perretti, Geret S DePiper, and Andrew Beet. 2019. “A Simulation Study of Trend Detection Methods for Integrated Ecosystem Assessment.” ICES Journal of Marine Science.

Hiemstra, P.H., E.J. Pebesma, Twenh"ofelC.J.W., and G.B.M. Heuvelink. 2008. “Real-time automatic interpolation of ambient gamma dose rates from the Dutch Radioactivity Monitoring Network.” Computers & Geosciences.

Hijmans, Robert J. 2017. Raster: Geographic Data Analysis and Modeling. https://CRAN.R-project.org/package=raster.

Hobson, R. D. 1972. Surface roughness in topography: quantitative approach. New York, New York: Harper; Row.

Houde, Edward D, Eric R Annis, Harding JrLawrence W, Michael E Mallonee, and Michael J Wilberg. 2016. “Factors Affecting the Abundance of Age-0 Atlantic Menhaden (Brevoortia Tyrannus) in Chesapeake Bay.” ICES Journal of Marine Science 73 (9): 2238–51.

June, Fred C, and Frank T Carlson. 1971. “Food of Young Atlantic Menhaden Brevoortia Tyrannus in Relation to Metamorphosis.” US Fish. Wildl. Serv. Fish. Bull. 68: 493–512.

Kane, Joseph. 2007. “Zooplankton abundance trends on Georges Bank, 1977-2004.” ICES Journal of Marine Science 64 (5): 909–19. https://doi.org/10.1093/icesjms/fsm066.

Kinlan, Brian P., Arliss J. Winship, Timothy P. White, and John Christensen. 2016. “Modeling At-Sea Occurrence and Abundance of Marine Birds to Support Atlantic Marine Renewable Energy Planning Phase I Report.”

Liaw, Andy, and Matthew Wiener. 2002. “Classification and Regression by randomForest.” R News 2 (3): 18–22. https://CRAN.R-project.org/doc/Rnews/.

Lozano, C. 2011. “Dynamics of Ingress, Hatch Dates, Growth, and Feeding of Atlantic Menhaden, Brevoortia Tyrannus, Larvae at the Chesapeake Bay Mouth.” Master’s thesis, Chesapeake Biological Laboratory, University of Maryland Center For Environmental Science.

Maritorena, Stéphane, d’AndonOdile Hembise Fanton, Antoine Mangin, and David A Siegel. 2010. “Merged Satellite Ocean Color Data Products Using a Bio-Optical Model: Characteristics, Benefits and Issues.” Remote Sensing of Environment 114 (8): 1791–1804.

Mills, Katherine E, Andrew J Pershing, Curtis J Brown, Yong Chen, Fu-Sung Chiang, Daniel S Holland, Sigrid Lehuta, et al. 2013. “Fisheries Management in a Changing Climate: Lessons from the 2012 Ocean Heat Wave in the Northwest Atlantic.” Oceanography 26 (2): 191–95.

Morse, RE, KD Friedland, D Tommasi, C Stock, and J Nye. 2017. “Distinct Zooplankton Regime Shift Patterns Across Ecoregions of the Us Northeast Continental Shelf Large Marine Ecosystem.” Journal of Marine Systems 165: 77–91.

Murphy, Melanie A., Jeffrey S. Evans, and Andrew Storfer. 2010. “Quantifying Bufo boreas connectivity in Yellowstone National Park with landscape genetics.” Ecology 91 (1): 252–61. https://doi.org/10.1890/08-0879.1.

Paluszynska, Aleksandra, and Przemyslaw Biecek. 2017. randomForestExplainer: Explaining and Visualizing Random Forests in Terms of Variable Importance. https://CRAN.R-project.org/package=randomForestExplainer.

Sappington, J. M., Kathleen M. Longshore, and Daniel B. Thompson. 2007. “Quantifying Landscape Ruggedness for Animal Habitat Analysis: A Case Study Using Bighorn Sheep in the Mojave Desert.” Journal of Wildlife Management 71 (5): 1419–26. https://doi.org/10.2193/2005-723.

Sherman, Kenneth, A Solow, J Jossi, and J Kane. 1998. “Biodiversity and Abundance of the Zooplankton of the Northeast Shelf Ecosystem.” ICES Journal of Marine Science 55 (4): 730–38.

Smith, Brian E., and Jason S. Link. 2010. The Trophic Dynamics of 50 Finfish and 2 Squid Species on the Northeast US Continental Shelf. NOAA Technichal Memorandum NMFS-NE-216. National Marine Fisheries Service, 166 Water Street, Woods Hole, MA 02543-1026. http://www.nefsc.noaa.gov/publications/tm/tm216/.

Winship, A.J., B.P.Kinlan, T.P. White, and J.B. Leirness. 2018. “Modeling At-Sea Density of Marine Birds to Support Atlantic Marine Renewable Energy Planning: Final Report.” OCS Study BOEM 2018-010.