dleucas / wmmsdb (public) (License: GPLv3) (since 2018-07-08) (hash sha1)
A collection of scripts to download, transform and normalize the Watkins Marine Mammal Sound Database.

Credit:

“Watkins Marine Mammal Sound Database, Woods Hole Oceanographic Institution.”

http://cis.whoi.edu/science/B/whalesounds/index.cfm
List of commits:
Subject Hash Author Date (UTC)
fix: spelling f9f762ebe566a72f3f1a1f00d5025b33d353f53b dleucas 2022-11-18 02:10:43
add var fields to schema and search 9f81adb53860c282edb0ce4d2035d7c6cef50d9c dleucas 2021-12-28 17:04:20
extract and transform signal type 862c51f027d9a8a94d7015874c854f2b610dd259 dleucas 2021-12-28 14:11:40
extract number of vocal animals 0098cbc1d57cef5d616f649b8e1ea6681027a5ce dleucas 2021-12-22 20:23:38
extract and transform playback gear and settings 875fd6790e7ea524dd0b9a95673e00233ea22375 dleucas 2021-12-22 16:48:12
add last updated date to about page ae4dde781b3f24b1ba9414cb963a6828d5446642 dleucas 2021-12-22 12:54:32
link fix e67dd97c20bab60e5166909156407016ee561f1c dleucas 2021-12-22 12:46:24
add recording gear 5751b54f70ce451374b467e5a3a227cecc81b0aa dleucas 2021-12-22 12:43:29
add recording conditions c5603786d17346690a789c5392137fd9afd78b28 dleucas 2021-12-22 12:18:07
add signal level with comment on missing data af643d86e29e82e0879eaf899e6bfea2a505adab dleucas 2021-12-22 11:20:28
docu updates 0912bbabf50c7c106fc45f623497ca87f7503377 dleucas 2021-12-22 09:54:28
extract storage location b73fc943e55d51fd3df9fe26d46220b67d2efe75 dleucas 2021-12-19 18:32:04
extract and transform author names 0ff3dd29773a1b93c7a893c4971dda5522fa488b dleucas 2021-12-19 17:56:35
extract and transform ship names 12eb822d4ae70fbd2f01b0f03a77eee5e0d9a2b7 dleucas 2021-12-19 14:40:54
extract and transform observation time 595eb911201f8697c9c5f1aa1c40ee502e8009c2 dleucas 2021-12-19 12:04:29
formating f82ed6a3f1c0dccfc50aaa2a0fa3e866f5741acf dleucas 2021-12-18 23:18:45
update readme for GeoJSON command e5c1a6f49ea17a8860cecffdcf280d1d2fdb3ac2 dleucas 2021-12-18 23:13:31
add species names to GeoJSON and World Map 86af1d9597c3cb3c58f71c777902148d2ce333f8 dleucas 2021-12-18 22:51:11
re-worked GeoJSON from transformed JSON. identical output. added license 8ec5e1678966c6b1c441d035a6833f55586325ec dleucas 2021-12-18 22:01:19
remove genus, add species common and scientific names, display names in record details e2025219ade78f9ac2e6dfd81c479d242b9db24b dleucas 2021-12-18 17:18:28
Commit f9f762ebe566a72f3f1a1f00d5025b33d353f53b - fix: spelling
Author: dleucas
Author date (UTC): 2022-11-18 02:10
Committer name: dleucas
Committer date (UTC): 2022-11-18 02:10
Parent(s): 9f81adb53860c282edb0ce4d2035d7c6cef50d9c
Signing key:
Tree: 52a25ba7f48aa2cf68e7863646a50abc648b7d32
File Lines added Lines deleted
DATA.md 4 5
README.md 1 1
File DATA.md changed (mode: 100644) (index fa6fa76..4b92f96)
... ... Total number of records 15254
5 5 RN: Always present, all unique RN: Always present, all unique
6 6 CU / Cue: 89 missing, 12631 contain "B" buffer size flag CU / Cue: 89 missing, 12631 contain "B" buffer size flag
7 7 NC / channels: 8 missing, 40 unique values, some invalid format NC / channels: 8 missing, 40 unique values, some invalid format
8 SR / sample rate: 5 missing, 48 unique values, some with . delemiter, mixed khz hz writing
8 SR / sample rate: 5 missing, 48 unique values, some with . delimiter, mixed kHz Hz writing
9 9 CS / cut size: 11 missing, 6131 values, mostly seconds (n.n+), some with minutes (n:n.n+)? CS / cut size: 11 missing, 6131 values, mostly seconds (n.n+), some with minutes (n:n.n+)?
10 PL / recorder: 7 missing, 253 unqiue values
10 PL / recorder: 7 missing, 253 unique values
11 11 SC / signal class, 952 missing, 26 unique values, quality not always present, flags in no order SC / signal class, 952 missing, 26 unique values, quality not always present, flags in no order
12 12 ID / vocal animal id: 13338 missing, 18 unique values, species code not always present ID / vocal animal id: 13338 missing, 18 unique values, species code not always present
13 13 AG / age: 14769 missing, 13 unique values, using ? as placeholder if age is unknown, species code might be name AG / age: 14769 missing, 13 unique values, using ? as placeholder if age is unknown, species code might be name
14 14 IA / interaction: 15211 missing, 5 unique values, multiple interaction with | separated, always pairs IA / interaction: 15211 missing, 5 unique values, multiple interaction with | separated, always pairs
15 15 GS / genus: always present, 307 unique values, | separated, other species codes X / O / E GS / genus: always present, 307 unique values, | separated, other species codes X / O / E
16 GA / geo A code: 20 missing, 194 unique values, | separated
16 GA / Geo A code: 20 missing, 194 unique values, | separated
17 17 OD / observation date, always present, 496 unique values, | separated OD / observation date, always present, 496 unique values, | separated
18 18 NT / note: 4 missing, 5398 unique values, free text NT / note: 4 missing, 5398 unique values, free text
19 19 DA / record date: 30 missing, 437 unique values, Month written 3 or 4 letters, some extra noise DA / record date: 30 missing, 437 unique values, Month written 3 or 4 letters, some extra noise
20 20 IP / ID of con present: 15 records, 2 unique values, | separated IP / ID of con present: 15 records, 2 unique values, | separated
21 21 AG / age of con present: 15 records, 2 unique values AG / age of con present: 15 records, 2 unique values
22 BH / behavior: 2442 records, 48 unqiue values, some variation/free text, normalize?
22 BH / behavior: 2442 records, 48 unique values, some variation/free text, normalize?
23 23 OS / other species: 3995 records, 75 unique values, | separated, not vocalizing species? OS / other species: 3995 records, 75 unique values, | separated, not vocalizing species?
24 24 NA / number of animals vocalizing: 14889 records, 420 unique values, ranges 1-2, or 1+, handle space, | separated, some noise NA / number of animals vocalizing: 14889 records, 420 unique values, ranges 1-2, or 1+, handle space, | separated, some noise
25 25 GB / Geo B: 13354 records, 362 unique values GB / Geo B: 13354 records, 362 unique values
 
... ... RC: 9524 missing
33 33 RG: 2208 missing RG: 2208 missing
34 34 SL: 15253 missing SL: 15253 missing
35 35 ST: 1648 missing ST: 1648 missing
36
File README.md changed (mode: 100644) (index 843ceb9..e510d9c)
1 1 # Watkins Marine Mammal Sound Database, Woods Hole Oceanographic Institution # Watkins Marine Mammal Sound Database, Woods Hole Oceanographic Institution
2 2
3 ## Remasterd Delux Edition
3 ## Remastered Deluxe Edition
4 4
5 5 - [Data Source](http://cis.whoi.edu/science/B/whalesounds/fullCuts.cfm) - [Data Source](http://cis.whoi.edu/science/B/whalesounds/fullCuts.cfm)
6 6 - [Source Code](https://codeberg.org/dleucas/wmmsdb) - [Source Code](https://codeberg.org/dleucas/wmmsdb)
Hints:
Before first commit, do not forget to setup your git environment:
git config --global user.name "your_name_here"
git config --global user.email "your@email_here"

Clone this repository using HTTP(S):
git clone https://rocketgit.com/user/dleucas/wmmsdb

Clone this repository using ssh (do not forget to upload a key first):
git clone ssh://rocketgit@ssh.rocketgit.com/user/dleucas/wmmsdb

Clone this repository using git:
git clone git://git.rocketgit.com/user/dleucas/wmmsdb

You are allowed to anonymously push to this repository.
This means that your pushed commits will automatically be transformed into a merge request:
... clone the repository ...
... make some changes and some commits ...
git push origin main