Home /Search
Search datasets
We have found 1,097 datasets for the keyword "species distribution models". You can continue exploring the search results in the list below.
Datasets: 104,589
Contributors: 42
Results
1,097 Datasets, Page 1 of 110
Updated Species Distribution Models for Marine Invasive Species Hotspot Identification
Monitoring data from DFO invasive species monitoring programs, along with occurrence information from online databases and the scientific literature, have been paired with high resolution environmental data and oceanographic models in species distribution models that predict present-day and project future distributions of 24 non-indigenous species (NIS) on North America`s east coast, and 31 NIS on its west coast. Future distributions were predicted for 2100, under Representative Concentration Pathway 8.5 from the Intergovernmental Panel on Climate Change’s fifth Assessment Report. Present-day and future richness of these species (i.e., hotspots) have been estimated by summing the occurrence probabilities of NIS. This data set includes the present-day and year 2100 species distribution modeling results for each species, and the estimated species richness.Cite this data as: Lyons DA., Lowen JB, Therriault TW., Brickman D., Guo L., Moore AM., Peña MA., Wang Z., DiBacco C. Data of: Updated species distribution models for marine invasive species hotspot identification. Published: November 2023. Coastal Ecosystems Science Division, Fisheries and Oceans Canada, Dartmouth, N.S. https://open.canada.ca/data/en/dataset/1439dcb3-82a6-40fd-a9a4-8f045b20ff5b
Characteristics of Environmental Data Layers for Use in Species Distribution Modelling in the Maritimes Region
Species distribution models (SDMs) are tools that combine species observations of occurrence, abundance, or biomass with environmental variables to predict the distribution of a species in unsampled locations. To produce accurate predictions of occurrence, abundance or biomass distribution, a wide range of physical and/or biological variables is desirable. Such data is often collected over limited or irregular spatial scales, and require the application of geospatial techniques to produce continuous environmental surfaces that can be used for modelling at all spatial scales. Here we provide a review of 102 environmental data layers that were compiled for the entire spatial extent of Fisheries and Oceans Canada’s (DFO) Maritimes Region. Variables were obtained from a broad range of physical and biological data sources and spatially interpolated using geostatistical methods. For each variable we document the underlying data distribution, provide relevant diagnostics of the interpolation models and an assessment of model performance, and present the final standard error and interpolation surfaces. These layers have been archived in a common (raster) format at the Bedford Institute of Oceanography to facilitate future use. Based on the diagnostic summaries in this report, a subset of these variables has subsequently been used in species distribution models to predict the distribution of deep-water corals, sponges, and other significant benthic taxa in the Maritimes Region.Cite this data as: Beazley, Lindsay; Guijarro, Javier, Lirette; Camille; Wang, Zeliang; Kenchington, Ellen (2020). Characteristics of Environmental Data Layers for Use in Species Distribution Modelling in the Maritimes Region. Published July 2023. Ocean Ecosystems Science Division, Fisheries and Oceans Canada, Dartmouth, N.S. https://open.canada.ca/data/en/dataset/34a917cb-a0e3-403c-91c7-af3dc20628b1
Species distribution models and occurrence data for marine invasive species hotspot identification
Since 2005, Fisheries and Oceans Canada has been collecting monitoring data for aquatic invasive species (e.g. https://open.canada.ca/data/en/dataset/8d87f574-0661-40a0-822f-e9eabc35780d, https://open.canada.ca/data/en/dataset/503a957e-7d6b-11e9-aef3-f48c505b2a29, https://open.canada.ca/data/en/dataset/8661edcf-f525-4758-a051-cb3fc8c74423). This monitoring data, as well additional occurrence information from online databases and the scientific literature, have been paired with high resolution environmental data and oceanographic models in species distribution models that predict the present-day and future potential distributions of 12 moderate to high risk invasive species on Canada’s east and west coasts. Future distributions were predicted for 2075, under Representative Concentration Pathway 8.5 from the Intergovernmental Panel on Climate Change’s fifth Assessment Report. Present-day and future richness of these species (i.e., hotspots) has also been estimated by summing their occurrence probabilities. This data set includes the occurrence locations of each species, the present-day and future species distribution modeling results for each species, and the estimated species richness. This research has been published in the scientific literature(Lyons et al. 2020).Lyons DA, Lowen JB, Therriault TW, Brickman D, Guo L, Moore AM, Peña MA, Wang Z, DiBacco C. (In Press) Identifying Marine Invasion Hotspots Using Stacked Species Distribution Models. Biological InvasionsCite this data as: Lyons DA., Lowen JB, Therriault TW., Brickman D., Guo L., Moore AM., Peña MA., Wang Z., DiBacco C. Data of: Species distribution models and occurrence data for marine invasive species hotspot identification. Published: November 2020. Coastal Ecosystems Science Division, Fisheries and Oceans Canada, Dartmouth, N.S. https://open.canada.ca/data/en/dataset/1bbd5131-8b34-4245-b999-3b4c4259d74f
Species Distribution Modelling of Corals and Sponges in the Maritimes Region for Use in the Identification of Significant Benthic Areas
Effective fisheries and habitat management processes require knowledge of the distribution of areas of high ecological or biological significance. On the Scotian Shelf and Slope, a number of benthic ecologically or biologically significant areas consisting of habitat-forming species such as sponges and deep-water corals have been identified. However, knowledge of their spatial distribution is largely based on targeted surveys that are limited in their spatial extent. We used a species distribution modelling approach called random forest (RF) to predict the probability of occurrence and biomass of sponges, sea pens, and large and small gorgonian corals across the entire spatial extent of Fisheries and Oceans Canada’s (DFO) Maritimes Region. We also modelled the rare sponge Vazella pourtalesi, which forms the largest known aggregation of its kind on the Scotian Shelf. We utilized a number of data sources including DFO multispecies trawl catch data and in situ benthic imagery observations. Most models had excellent predictive capacity with cross-validated Area Under the Receiver Operating Characteristic Curve (AUC) values ranging from 0.760 to 0.977. Areas of suitable habitat were identified for each taxon and were contrasted against their known distribution and when applicable, the location of closure areas designated for their protection. Generalized additive models (GAMs) were developed to predict the biomass distribution of each taxonomic group and serve as a comparison to the RF models. The RF and GAM models provided comparable results, although GAMs provided superior predictions of biomass along the continental slope for some taxonomic groups. In the absence of data observations, the results of this study could be used to identify the potential distribution of sensitive benthic taxa for use in fisheries and habitat management applications. These results could also be used to refine significant concentrations of these taxa as identified through the kernel density analyses.Cite this data as: Beazley, Lindsay; Kenchington, Ellen; Murillo-Perez, Javier; Lirette, Camille; Guijarro-Sabaniel, Javier; McMillan, Andrew; Knudby, Anders (2019). Species Distribution Modelling of Corals and Sponges in the Maritimes Region for Use in the Identification of Significant Benthic Areas. Published July 2023. Ocean Ecosystems Science Division, Fisheries and Oceans Canada, Dartmouth, N.S. https://open.canada.ca/data/en/dataset/356e92f3-5bf3-4810-98b1-3e10cd7742aa
Predicted distributions of 65 groundfish species in Canadian Pacific waters
Description:This dataset contains layers of predicted occurrence for 65 groundfish species as well as overall species richness (i.e., the total number of species present) in Canadian Pacific waters, and the median standard error per grid cell across all species. They cover all seafloor habitat depths between 10 and 1400 m that have a mean summer salinity above 28 PSU. Two layers are provided for each species: 1) predicted species occurrence (prob_occur) and 2) the probability that a grid cell is an occurrence hotspot for that species (hotspot_prob; defined as being in the lower of: 1) 0.8, or 2) the 80th percentile of the predicted probability of occurrence values across all grid cells that had a probability of occurrence greater than 0.05.). The first measure provides an overall prediction of the distribution of the species while the second metric identifies areas where that species is most likely to be found, accounting for uncertainty within our model. All layers are provided at a 1 km resolution.Methods:These layers were developed using a species distribution model described in Thompson et al. 2023. This model integrates data from three fisheries-independent surveys: the Fisheries and Oceans Canada (DFO) Groundfish Synoptic Bottom Trawl Surveys (Sinclair et al. 2003; Anderson et al. 2019), the DFO Groundfish Hard Bottom Longline Surveys (Lochead and Yamanaka 2006, 2007; Doherty et al. 2019), and the International Pacific Halibut Commission Fisheries Independent Setline Survey (IPHC 2021). Further details on the methods are found in the metadata PDF available with the dataset.Abstract from Thompson et al. 2023:Predictions of the distribution of groundfish species are needed to support ongoing marine spatial planning initiatives in Canadian Pacific waters. Data to inform species distribution models are available from several fisheries-independent surveys. However, no single survey covers the entire region and different gear types are required to survey the range of habitats that are occupied by groundfish. Bottom trawl gear is used to sample soft bottom habitat, predominantly on the continental shelf and slope, whereas longline gear often focuses on nearshore and hardbottom habitats where trawling is not possible. Because data from these two gear types are not directly comparable, previous species distribution models in this region have been limited to using data from one survey at a time, restricting their spatial extent and usefulness at a regional scale. Here we demonstrate a method for integrating presence-absence data across surveys and gear types that allows us to predict the coastwide distributions of 66 groundfish species in British Columbia. Our model leverages the use of available data from multiple surveys to estimate how species respond to environmental gradients while accounting for differences in catchability by the different surveys. Overall, we find that this integrated method has two main benefits: 1) it increases the accuracy of predictions in data-limited surveys and regions while having negligible impacts on the accuracy when data are already sufficient to make predictions, 2) it reduces uncertainty, resulting in tighter confidence intervals on predicted species occurrences. These benefits are particularly relevant in areas of our coast where our understanding of habitat suitability is limited due to a lack of spatially comprehensive long-term groundfish research surveys.Data Sources:Research data was provided by Pacific Science’s Groundfish Data Unit for research surveys from the GFBio database between 2003 and 2020 for all species which had at least 150 observations, across all gear type and survey datasets available.Uncertainties:These are modeled results based on species observations at sea and their related environmental covariate predictions that may not always accurately reflect real-world groundfish distributions though methods that integrate different data types/sources have been demonstrated to improve model inference by increasing the accuracy of the predictions and reducing uncertainty.
Spatial estimates of Blue Shark, Salmon Shark, Pacific Sleeper Shark and Bluntnose Sixgill Shark presence in British Columbia
Description:Spatial information on ecologically important species is needed to support marine spatial planning initiatives in British Columbia’s (BC) marine environment. For data deficient taxa, such as shark species, species distribution models that integrate presence-absence data from different sources can be used to predict their coastwide distributions. Here we provide spatial estimates of the distribution of Blue Shark (Prionace glauca), Salmon Shark (Lamna ditropis), Pacific Sleeper Shark (Somniosus pacificus) and Bluntnose Sixgill Shark (Hexanchus griseus). These estimates were generated using spatial generalized linear mixed effects models and are based on data from two scientific surveys and the commercial hook and line, midwater trawl and bottom trawl fisheries. For each species, we provide predicted probability of occurrence and prediction uncertainty at a 3 km resolution for the British Columbia coast, and parameter estimates for model covariates (depth, slope, year, data source). Results show variable predicted distributions across species, with Blue Shark and Pacific Sleeper Shark showing higher probability of presence along the continental slope, while Salmon Shark show low probability of occurrence coastwide and Bluntnose Sixgill Shark show the highest probability of occurrence in the Strait of Georgia. The results from this study can support ongoing marine spatial planning initiatives in the BC and support the conservation and management of these important species.Methods:Data Sources The species distribution models (SDMs) are based on data from two fishery independent scientific surveys and from the commercial hook and line fishery, which are all conducted within Canadian Pacific waters. The scientific surveys include the Fisheries and Oceans Canada (DFO) hard bottom longline surveys and the International Pacific Halibut Commission (IPHC) fishery-independent setline survey. The study area is bound by the outer convex hull of these three data sources. Other DFO research surveys, such as the groundfish synoptic bottom trawl surveys, midwater trawl surveys and sablefish trap surveys were investigated as potential data sources, but were found to have insufficient presence observations for the species of interest to warrant their inclusion in the analysis. For more information on the details of the source data please refer to Proudfoot et al. 2024.Modelling Approach and Comparison For each species, we fit a suite of generalized linear mixed effects models (GLMMs) using the sdmTMB package (Anderson et al. 2022). For each species, we fit four models, each with a different set of fixed effects/environmental predictors. Additionally, we compared the predictive power of four models for each species, with each model having a different combination of environmental predictors (i.e., slope, depth, slope + depth, none). A summary of the candidate models is provided in Table 2 of Proudfoot et al. 2024. For each species, we selected the model with the highest predictive accuracy (assessed using the predicted log likelihood based on the cross-validation) as the best fit.Spatial Species Distribution Predictions We made predictions of species occurrence using the selected model and a 3 km resolution spatial prediction grid. Our predictions were made for the entire BC coast, and species distribution predictions were made using models fit to the full dataset, as opposed to models fit using cross-validation. We made predictions with year set to 2014 (the approximate midpoint of the dataset) and type set to IPHC (the dataset with the most even spatial distribution of data points).Uncertainties:Because limited survey and commercial catch data exists for deep areas off the continental shelf, predictions in these areas are likely more uncertain than predictions on the shelf. To illustrate this, uncertainty (standard deviation derived from the 500 simulated values from the joint precision matrix of selected models) was mapped across the full study area for each species. Additionally, because these models are based on data that likely do not span the full spatiotemporal extent of the species’ habitat (i.e., mid depths, surface waters, and data across all seasons may not be captured), these results illustrate a snapshot of occurrence but do not account for more complex migration and movement patterns undertaken by these species.
BC Tree Species Map/Likelihoods 2015
Dominant Species Map 2015The data represent dominant tree species for British Columbia forests in 2015, are based upon Landsat data and modeling, with results mapped at 30 m spatial resolution. The map was generated with the Random Forests classifier that used predictor variables derived from Landsat time series including surface reflectance, land cover, forest disturbance, and forest structure, and ancillary variables describing the topography and position. Training and validation samples were derived from the Vegetation Resources Inventory (VRI), from a pool of polygons with homogeneous internal conditions and with low discrepancies with the remotely sensed predictions. Local models were applied over 100x100 km tiles that considered training samples from the 5x5 neighbouring tiles to avoid edge effects. An overall accuracy of 72% was found for the species which occupy 80% of the forested areas. Satellite data and modeling have demonstrated the capacity for up-to-date, wall-to-wall, forest attribute maps at sub-stand level for British Columbia, Canada.BC Species Likelihood 2015The tree species class membership likelihood distribution data included in this product focused on the province of British Columbia, based upon Landsat data and modeling, with results mapped at 30 m spatial resolution. The data represent tree species class membership likelihood in 2015. The map was generated with the Random Forests classifier that used predictor variables derived from Landsat time series including surface reflectance, land cover, forest disturbance, and forest structure, and ancillary variables describing the topography and position. Training and validation samples were derived from the Vegetation Resources Inventory (VRI) selecting from a stratified pool of polygons with homogeneous internal conditions and with low discrepancies when related to remotely sensed information. Local models were applied over 100x100 km tiles that, to avoid edge effects, considered training samples from the 5x5 neighbouring tiles. An overall accuracy of 72% was found for the species which occupy 80% of the forested areas. As an element of the mapping process, we also obtain the votes received for each class by the Random Forest models. The votes can be understood as analogous to class membership likelihoods, providing enriched information on land cover class uncertainty for use in modeling. Tree species class membership likelihoods lower than 5% have been masked and converted to zero.When using this data, please cite as: Shang, C., Coops, N.C., Wulder, M.A., White, J.C., Hermosilla, T., 2020. Update and spatial extension of strategic forest inventories using time series remote sensing and modeling. International Journal of Applied Earth Observation and Geoinformation 84, 101956. DOI: 10.1016/j.jag.2019.101956 ( Shang et al. 2020).
Kernel Density Analyses of Coral and Sponge Catches from Research Vessel Survey Data (2016)
Kernel density estimation (KDE) utilizes spatially explicit data to model the distribution of a variable of interest. It is a simple non-parametric neighbour-based smoothing function that relies on few assumptions about the structure of the observed data. It has been used in ecology to identify hotspots, that is, areas of relatively high biomass/abundance, and in 2010 was used by Fisheries and Oceans Canada to delineate significant concentrations of corals and sponges. The same approach has been used successfully in the Northwest Atlantic Fisheries Organization (NAFO) Regulatory Area. Here, we update the previous analyses with the catch records from up to 5 additional years of trawl survey data from Eastern Canada, including the Gulf of Saint Lawrence. We applied kernel density estimation to create a modelled biomass surface for each of sponges, small and large gorgonian corals, and sea pens, and applied an aerial expansion method to identify significant concentrations of these taxa. We compared our results to those obtained previously and provided maps of significant concentrations as well as point data co-ordinates for catches above the threshold values used to construct the significant area polygons. The borders of the polygons can be refined using knowledge of null catches and species distribution models of species presence/absence and/or biomass.
3D model spatial extents, Groundwater Geoscience Program
The dataset shows the distribution and spatial extent of the 3D models that were created in the context of Canadian aquifers mapping projects from the Geological Survey of Canada.
Distribution areas of terrestrial mammals, reptiles, reptiles, amphibians, and freshwater fish
The data represent the distribution of species of amphibians, reptiles, reptiles, terrestrial mammals and freshwater and migratory fish in Quebec.The files represent:amphibians: 21 speciesreptiles: 17 speciesterrestrial mammals: 69 speciesfreshwater and migratory fish: 118 speciesThe ranges were established on the basis of various sources of information and validated by the Main Directorate of expertise on terrestrial fauna (DPEFT), the Main Directorate for Threatened or Vulnerable Species (DPEMV) and the Main Directorate of Expertise on Aquatic Wildlife (DPEFA) of the Ministry of the Environment, the Fight against Climate Change, Climate Change, Wildlife and Parks (MELCCFP).The ranges of species of _freshwater and migratory fish_ are also illustrated in the [“Freshwater Fish of Quebec”] poster (https://cdn-contenu.quebec.ca/cdn-contenu/faune/documents/animaux/affiche-poissons-eau-douce.pdf). Some ranges have been slightly modified since they were included in the poster.__There may be differences between the ranges of the species shown in the files and the current spatial distribution of the species. __The distribution areas were produced on a small scale; they provide indicative information on the presence of the species in Quebec.The cards are the property of MELCCFP.__Atten:__ The ranges of marine mammals that frequent the coasts of the province of Quebec are not included in this dataset.**This third party metadata element was translated using an automated translation tool (Amazon Translate).**
Tell us what you think!
GEO.ca is committed to open dialogue and community building around location-based issues and
topics that matter to you.
Please send us your feedback