File webroot/data.md changed (mode: 100644) (index 929aa71..14d2ee1) |
... |
... |
Note that the database is not available in it's native DOS based format, as crea |
10 |
10 |
|
|
11 |
11 |
[whoi.report]: https://whoicf2.whoi.edu/science/B/whalesounds/WHOI-92-31.pdf |
[whoi.report]: https://whoicf2.whoi.edu/science/B/whalesounds/WHOI-92-31.pdf |
12 |
12 |
|
|
13 |
|
### Database fields |
|
14 |
|
|
|
15 |
|
Total number of records 15254 |
|
16 |
|
|
|
17 |
|
| Field | Description | Missing | Unique | Multi-Valued | Note | |
|
18 |
|
|-------+-------------+---------+--------+--------------|------| |
|
19 |
|
| RN | Record Number | 0 | 15254 | No | Record Number: always present, all unique | |
|
20 |
|
| CU | Cue | 89 | | No | Cue: 12631 contain "B" buffer size flag | |
|
21 |
|
| NC | Number of Audio Channels | 8 | 40 | No | Some invalid formatting | |
|
22 |
|
| SR | Sample Rate | 5 | 48 | No | Some with dot as delimiter and mixed khz hz writing | |
|
23 |
|
| CS | Cut Size | 11 | 6131 | No | Mostly seconds (n.n+), some with minutes (n:n.n+) | |
|
24 |
|
| PL | Recorder | 7 | 253 | No | | |
|
25 |
|
| SC | signal class| 952 | 26 | No | quality not always present, flags in no order | |
|
26 |
|
| ID | vocal animal id | 13338 | 18 | Yes | species code not always present | |
|
27 |
|
| AG | age | 14769 | 13 | Yes | using ? as placeholder if age is unknown, species code might be name | |
|
28 |
|
| IA | interaction | 15211 | 5 | Yes | multiple interactions with pipe separated, always in pairs | |
|
29 |
|
| GS | genus | 0 | 307 | Yes | pipe separated, other species codes X / O / E | |
|
30 |
|
| GA | geo A code | 20 | 194 | Yes | pipe separated | |
|
31 |
|
| OD | observation date | 0 | 496 | Yes | pipe separated | |
|
32 |
|
| NT | note | 4 | 5398 | No | free text | |
|
33 |
|
| DA | record date | 30 | 437 | No | Month written 3 or 4 letters, some extra noise | |
|
34 |
|
| IP | ID of con present | 15 records | 2 | Yes | pipe separated | |
|
35 |
|
| AG | age of con present | 15 records | 2 | | | |
|
36 |
|
| BH | behavior | 2442 records | 48 | No | some variation/free text, normalize? | |
|
37 |
|
| OS | other species | 3995 records | 75 | Yes | pipe separated, not vocalizing species? | |
|
38 |
|
| NA | number of animals vocalizing | 14889 records | 420 | Yes | ranges 1-2, or 1+, handle space, pipe separated, some noise | |
|
39 |
|
| GB | Geo B | 13354 records | 362 | |
|
40 |
|
| GC | Geo C | 13910 records | 224 | Yes | pipe separated | |
|
41 |
|
| OT | observation time | 7141 records | ? | Yes | sometimes range nnnn - nnnn, pipe or ; separated | |
|
42 |
|
| SH | ship | 13675 records | 62 | |
|
43 |
|
| AU | author | 14204 records | 58 | Yes | pipe separated | |
|
44 |
|
| LO | 16 | |
|
45 |
|
| HY | 8075 | |
|
46 |
|
| RC | 9524 | |
|
47 |
|
| RG | 2208 | |
|
48 |
|
| SL | 15253 | |
|
49 |
|
| ST | 1648 | |
|
50 |
|
|
|
|
13 |
|
### Database Fields |
|
14 |
|
|
|
15 |
|
The following table describes each database field (work in progress). |
|
16 |
|
|
|
17 |
|
| Field | Description | Records | Missing | Unique | Multi-Valued | Note | |
|
18 |
|
|------:+-------------+--------:+--------:+-------:+--------------+------| |
|
19 |
|
| RN | Record Number | 15254 | 0 | 15254 | No | Always present, all unique | |
|
20 |
|
| CU | Cue | | 89 | | No | 12631 contain "B" buffer size flag | |
|
21 |
|
| NC | Number of Audio Channels | | 8 | 40 | No | Some invalid formatting | |
|
22 |
|
| SR | Sample Rate | | 5 | 48 | No | Some with dot as delimiter and mixed khz hz writing | |
|
23 |
|
| CS | Cut Size | | 11 | 6131 | No | Mostly seconds (n.n+), some with minutes (n:n.n+) | |
|
24 |
|
| PL | Recorder | | 7 | 253 | No | | |
|
25 |
|
| SC | Signal Class| | 952 | 26 | No | quality not always present, flags in no order | |
|
26 |
|
| ID | Vocal Animal ID | | 13338 | 18 | Yes | species code not always present | |
|
27 |
|
| AG | Age | | 14769 | 13 | Yes | using ? as placeholder if age is unknown, species code might be name | |
|
28 |
|
| IA | Interaction | | 15211 | 5 | Yes | multiple interactions with pipe separated, always in pairs | |
|
29 |
|
| GS | Genus | | 0 | 307 | Yes | pipe separated, other species codes X / O / E | |
|
30 |
|
| GA | Geo A Code | | 20 | 194 | Yes | pipe separated | |
|
31 |
|
| OD | Observation Date | | 0 | 496 | Yes | pipe separated | |
|
32 |
|
| NT | Note | | 4 | 5398 | No | Free Text | |
|
33 |
|
| DA | Record Date | | 30 | 437 | No | Month written 3 or 4 letters, some extra noise | |
|
34 |
|
| IP | ID of con present | 15 | | 2 | Yes | pipe separated | |
|
35 |
|
| AG | Age of con present | 15 | | 2 | | | |
|
36 |
|
| BH | Behavior | 2442 | | 48 | No | some variation/free text, normalize? | |
|
37 |
|
| OS | Other Species | 3995 | | 75 | Yes | pipe separated, not vocalizing species? | |
|
38 |
|
| NA | Number of Animals Vocalizing | 14889 | | 420 | Yes | ranges 1-2, or 1+, handle space, pipe separated, some noise | |
|
39 |
|
| GB | Geo B | 13354 | | 362 | | | |
|
40 |
|
| GC | Geo C | 13910 | | 224 | Yes | pipe separated | |
|
41 |
|
| OT | Observation Time | 7141 | | | Yes | sometimes range nnnn - nnnn, pipe or ; separated | |
|
42 |
|
| SH | Ship | 13675 | | 62 | No | | |
|
43 |
|
| AU | Author | 14204 | | 58 | Yes | pipe separated | |
|
44 |
|
| LO | | 16 | | | | | |
|
45 |
|
| HY | | 8075 | | | | | |
|
46 |
|
| RC | | 9524 | | | | | |
|
47 |
|
| RG | | 2208 | | | | | |
|
48 |
|
| SL | | 15253 | | | | | |
|
49 |
|
| ST | | 1648 | | | | | |
|
50 |
|
|
|
51 |
|
### Examples and Transformations |
|
52 |
|
|
|
53 |
|
(TODO) |
|
54 |
|
|
|
55 |
|
#### GS / Genus |
|
56 |
|
|
|
57 |
|
(TODO) |
|
58 |
|
|