dleucas / wmmsdb (public) (License: GPLv3) (since 2018-07-08) (hash sha1)
A collection of scripts to download, transform and normalize the Watkins Marine Mammal Sound Database.

Credit:

“Watkins Marine Mammal Sound Database, Woods Hole Oceanographic Institution.”

http://cis.whoi.edu/science/B/whalesounds/index.cfm

/DATA.md (fa6fa76919a7cf93fba42f24e05ce897839c8bd2) (2019 bytes) (mode 100644) (type blob)

Notes on the data

Total number of records 15254

RN: Always present, all unique 
CU / Cue: 89 missing, 12631 contain "B" buffer size flag
NC / channels: 8 missing, 40 unique values, some invalid format
SR / sample rate: 5 missing, 48 unique values, some with . delemiter, mixed khz hz writing
CS / cut size: 11 missing, 6131 values, mostly seconds (n.n+), some with minutes (n:n.n+)?
PL / recorder: 7 missing, 253 unqiue values
SC / signal class, 952 missing, 26 unique values, quality not always present, flags in no order
ID / vocal animal id: 13338 missing, 18 unique values, species code not always present
AG / age: 14769 missing, 13 unique values, using ? as placeholder if age is unknown, species code might be name
IA / interaction: 15211 missing, 5 unique values, multiple interaction with | separated, always pairs
GS / genus: always present, 307 unique values, | separated, other species codes X / O / E
GA / geo A code: 20 missing, 194 unique values, | separated
OD / observation date, always present, 496 unique values, | separated
NT / note: 4 missing, 5398 unique values, free text
DA / record date: 30 missing, 437 unique values, Month written 3 or 4 letters, some extra noise
IP / ID of con present: 15 records, 2 unique values, | separated
AG / age of con present: 15 records, 2 unique values
BH / behavior: 2442 records, 48 unqiue values, some variation/free text, normalize?
OS / other species: 3995 records, 75 unique values, | separated, not vocalizing species?
NA / number of animals vocalizing: 14889 records, 420 unique values, ranges 1-2, or 1+, handle space, | separated, some noise
GB / Geo B: 13354 records, 362 unique values
GC / Geo C: 13910 records, 224 unique values, | separated
OT / observation time: 7141 records, sometimes range nnnn - nnnn, | or ; separated
SH / ship: 13675 records, 62 unique values
AU / author: 14204 records, 58 unique values, | separated
LO / storage location: 16 missing
HY: 8075 missing
RC: 9524 missing
RG: 2208 missing
SL: 15253 missing
ST: 1648 missing



Mode Type Size Ref File
100644 blob 11 dea9097138acee79ce13ef95ecaed594eea1fc34 .gitignore
100644 blob 2019 fa6fa76919a7cf93fba42f24e05ce897839c8bd2 DATA.md
100755 blob 1964 77e9f5c9baaa1276ae068eeb31f51d0fe7ca4893 GeoJSON.jq
100644 blob 864 e0082817c39b573df78e0ed2d3e2d4cfc9255036 README.md
100644 blob 865 6ac29799fea3cd2dd8c0e8116a12e6da93809572 TODO.md
100755 blob 1509 d154c6171b793c0b32b8f3020b6703ca4fdcc1ee download.sh
100644 blob 218 9100d4eb109a354733264a3b989d0de699db3c9c index.jq
100644 blob 4648 8029c7e3c56f4c61e38ec0abc8135bc1a063071b index.mapping.json
100755 blob 482 7abb8bcf9a49d7c849ab50538e47c9033a921f28 index.sh
040000 tree - 60e19bf3f6e7f2fba709ae362e565cf2df36ac29 snd
040000 tree - 6e02959705bd738b6776f95e9db4ad6e7abdbcd1 srv
100644 blob 8614 475fb0324d694a9e90c19017b971e5dd5b23aa15 transform.jq
100755 blob 1336 93d9e62fb6f1e4b2444fb3c8ded4ea6a856c7c13 transform.sh
040000 tree - 4e40690ecd154bee3b72ed1a53388af8534ddafb webroot
Hints:
Before first commit, do not forget to setup your git environment:
git config --global user.name "your_name_here"
git config --global user.email "your@email_here"

Clone this repository using HTTP(S):
git clone https://rocketgit.com/user/dleucas/wmmsdb

Clone this repository using ssh (do not forget to upload a key first):
git clone ssh://rocketgit@ssh.rocketgit.com/user/dleucas/wmmsdb

Clone this repository using git:
git clone git://git.rocketgit.com/user/dleucas/wmmsdb

You are allowed to anonymously push to this repository.
This means that your pushed commits will automatically be transformed into a merge request:
... clone the repository ...
... make some changes and some commits ...
git push origin main