Dataset Identification:

Resource Abstract:
description: Data presented here are subset of a larger plankton imagery data set collected in the subtropical Straits of Florida from 2014-05-28 to 2014-06-14. Imagery data were collected using the In Situ Ichthyoplankton Imaging System (ISIIS-2) as part of a NSF-funded project to assess the biophysical drivers affecting fine-scale interactions between larval fish, their prey, and predators. This subset of images was used in the inaugural National Data Science Bowl (www.datasciencebowl.com) hosted by Kaggle and sponsored by Booz Allen Hamilton. Data were originally collected to examine the biophysical drivers affecting fine scale (spatial) interactions between larval fish, their prey, and predators in a subtropical pelagic marine ecosystem. Image segments extracted from the raw data were sorted into 121 plankton classes, split 50:50 into train and test data sets, and provided for a machine learning competition (the National Data Science Bowl). There was no hierarchical relationships explicit in the 121 plankton classes, though the class naming convention and a tree-like diagram (see file "Plankton Relationships.pdf") indicated relationships between classes, whether it was taxonomic or structural (size and shape). We intend for this dataset to be available to the machine learning and computer vision community as a standard machine learning benchmark. This €œPlankton 1.0€ dataset is a medium-size dataset with a fair amount of complexity where image classification improvements can still be made.; abstract: Data presented here are subset of a larger plankton imagery data set collected in the subtropical Straits of Florida from 2014-05-28 to 2014-06-14. Imagery data were collected using the In Situ Ichthyoplankton Imaging System (ISIIS-2) as part of a NSF-funded project to assess the biophysical drivers affecting fine-scale interactions between larval fish, their prey, and predators. This subset of images was used in the inaugural National Data Science Bowl (www.datasciencebowl.com) hosted by Kaggle and sponsored by Booz Allen Hamilton. Data were originally collected to examine the biophysical drivers affecting fine scale (spatial) interactions between larval fish, their prey, and predators in a subtropical pelagic marine ecosystem. Image segments extracted from the raw data were sorted into 121 plankton classes, split 50:50 into train and test data sets, and provided for a machine learning competition (the National Data Science Bowl). There was no hierarchical relationships explicit in the 121 plankton classes, though the class naming convention and a tree-like diagram (see file "Plankton Relationships.pdf") indicated relationships between classes, whether it was taxonomic or structural (size and shape). We intend for this dataset to be available to the machine learning and computer vision community as a standard machine learning benchmark. This €œPlankton 1.0€ dataset is a medium-size dataset with a fair amount of complexity where image classification improvements can still be made.
Citation
Title PlanktonSet 1.0: Plankton imagery data collected from F.G. Walton Smith in Straits of Florida from 2014-06-03 to 2014-06-06 and used in the 2015 National Data Science Bowl (NCEI Accession 0127422).
creation  Date   2018-02-08T01:11:33.144897
Resource language:
Processing environment:
Back to top:
Digital Transfer Options
Linkage for online resource
name Dublin Core references URL
URL:https://doi.org/10.7289/V5D21VJD
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Linkage for online resource
name Dublin Core references URL
URL:https://accession.nodc.noaa.gov/oas/127422
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Linkage for online resource
name Dublin Core references URL
URL:https://accession.nodc.noaa.gov/download/127422
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Linkage for online resource
name Dublin Core references URL
URL:https://www.ncei.noaa.gov/
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Linkage for online resource
name Dublin Core references URL
URL:http://hmsc.oregonstate.edu/
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Linkage for online resource
name Dublin Core references URL
URL:http://hmsc.oregonstate.edu/
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Linkage for online resource
name Dublin Core references URL
URL:http://www.rsmas.miami.edu
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Linkage for online resource
name Dublin Core references URL
URL:https://www.ncei.noaa.gov/
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Linkage for online resource
name Dublin Core references URL
URL:http://gcmd.gsfc.nasa.gov/learn/keywords.html
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Linkage for online resource
name Dublin Core references URL
URL:http://gcmd.gsfc.nasa.gov/learn/keywords.html
protocol WWW:LINK-1.0-http--link
link function information
Description URL provided in Dublin Core references element.
Metadata data stamp:  2018-08-06T20:43:04Z
Resource Maintenance Information
maintenance or update frequency:
notes: This metadata record was generated by an xslt transformation from a dc metadata record; Transform by Stephen M. Richard, based on a transform by Damian Ulbricht. Run on 2018-08-06T20:43:04Z
Metadata contact - pointOfContact
organisation Name  CINERGI Metadata catalog
Contact information
Address
electronic Mail Addresscinergi@sdsc.edu
Metadata language  eng
Metadata character set encoding:   utf8
Metadata standard for this record:  ISO 19139 Geographic Information - Metadata - Implementation Specification
standard version:  2007
Metadata record identifier:  urn:dciso:metadataabout:8c3a7a46-e9a6-4648-8fb9-9cd7fd96fd2e

Metadata record format is ISO19139 XML (MD_Metadata)