Formatting for incorporation into JEDI
http://data.calcofi.net/zooplankton.html
Save data from CalCOFI database
Import into Excel
Change month number to word
Change Day/Night
Day to 12:00:00 (Day was classified as the time between Nautical sunrise and sunset)
Night to 00:00:00 (Day was classified as the time between Nautical sunset and sunrise)
Removed Region, Tow, and Pooled/Unpooled data. Seen as irrelevant.
Added
Net Opening, found in Materials and Methods
Net Mesh, found in net information
Integrated depth, included in dataset obtained from ZooDB as search parameter
Numeric integrated, included in dataset obtained from ZooDB as search parameter
Project Title, CalCOFI Zooplankton Data Base
Project Owner, California Cooperative Oceanic Fisheries Investigations
Contact, Dr. Mark D. Ohman
Taxon, provided in ZooDB as search parameter
Phylum, found on http://www.itis.gov, when applicable
Class, found on http://www.itis.gov, when applicable
Order, found on http://www.itis.gov, when applicable
Family, found on http://www.itis.gov, when applicable
Genus, found on http://www.itis.gov, when applicable
Species, some provided in ZooDB as search parameter
Data Type, Quantitative based on numeric abundance of Zooplankton
Collection Method, plankton net
6 Nov 2010 - Kelly Robinson
Presence_Absence JEDI field
#All records with '0.00' values in the fields 'numeric_density' and 'depth_integrated_density' were denoted as 'absent.'
#All records with values greater than '0.00' in the fields 'numeric_density' and 'depth_integrated_density' were denoted as 'present.'
Formatting for JEDI
#Changed owner from Dr. Marc Ohman to 'California Cooperative Oceanic Fisheries Investigations.'
#Ran the 'jedi-prep' R validation script.
#Kicked back error message about the 'is_public' field failing a 'boolan' test. Couldn't find anything wrong with field or values in column. Not sure what that means so wrote Jim Regetz. Jim's response: First, the data set is failing a 'boolean test,' not a 'boolan' test. Whoops for typo in program. Second, data set failed validation because the only values accepted in the 'is_public' field are (T, TRUE, F, FALSE).
#Records with no values in 'numeric_density" and "depth_integrated_density" in the Excel file "CalCOFI_JEDI_dataset_6Nov2010_KR.xls" deleted. Looked like they left over duplicates from when Stacy combined records with only 'numeric_density' or 'depth_integrated_density' values into a single record that had values for both fields.
#Deleted records saved in a separate Excel file 'Deleted records from CalCOFI_JEDI_dataset_6Nov2010_KR.xls"
#Added 13 records of Salpa maxima from day sampling that were missing.
#Removed genus names from 'species' field.
#Numeric_density values in original data set from the web are given as (no/1000m3). Divided all values by 1000 to yield (no/m3).