19 Expected Number of Species

Description: Time Series of Expected Number of Species per Tow in NEFSC BTS

Found in: State of the Ecosystem - Gulf of Maine & Georges Bank (2021+), State of the Ecosystem - Mid-Atlantic (2021+)

Indicator category: Database pull with analysis; Published methods

Contributor(s): Sean Lucey

Data steward: Sean Lucey,

Point of contact: Sean Lucey,

Public availability statement:

19.1 Methods

Diversity estimates have been developed to understand whether the overall structure of the ecosystem has remained stable or is changing. There are a large number of diversity indices that can be used to measure diversity; for the purposes of the State of the Ecosystem report we report on the expected number of species in a sample size (\(E(S_n)\)). This index was originally developed by Sanders (1968) and later refined by Hurlbert (1971) using a hypergeometric probability distribution. These “rarefied” samples allow for comparisons between sample sites with varying number of species present. The estimate of \(E(S_n)\) is less biased than other diversity indices which usually increase with sample size. It also has a more meaningful biological interpretation than other indices. For example, if a predator eats 10 random individuals, \(E(S_n)\) will predict the number of species consumed.

19.1.1 Data sources

Data used for the calculation of the expected number of species come from the Northeast Fisheries Science Center’s survey database (SVDBS) as pulled in the Survdat data set. These data are available to qualified researchers upon request. More information on the data request process is available under the “Access Information” field here.

19.1.2 Data analysis

The expected number of species (\(E(S_n)\)) was calculated for each survey tow as:

\[\begin{equation} E(S_n) = \sum_{i=1}^S{ \Bigg( 1 - \frac{\binom{N-N_i}{n}}{\binom{N}{n}} \Bigg) } \end{equation}\]

where \(S\) is the total number of species present, \(N\) the total number of individuals, and \(N_i\) the number of individuals of ith species. The result represents a sample of n individuals randomly selected from the tow without replacement. The calculation is made using the rarefy function of the vegan package (R-vegan?) using an n of 1000.

The number of species represented in these samples of 1000 fishes are then averaged over the survey for each Ecological Production Unit. Due to the lack of survey calibration factor to account for differences in the number of species caught between the NOAA Ship Albatross IV and NOAA Ship Henry B. Bigelow, the time series are kept separate.

19.1.3 Data processing

Data were formatted for inclusion in the ecodata R package using the R code found here.

catalog page https://noaa-edab.github.io/catalog/exp_n.html

References

Hurlbert, Stuart H. 1971. “The Nonconcept of Species Diversity: A Critique and Alternative Parameters.” Ecology 52 (4): 577–86. https://doi.org/https://doi.org/10.2307/1934145.
Sanders, Howard L. 1968. “Marine Benthic Diversity: A Comparative Study.” American Naturalist 102: 247–82. https://www.journals.uchicago.edu/doi/10.1086/282541.