Skip to content

Commit

Permalink
Merge pull request #37 from mbsantiago/docs
Browse files Browse the repository at this point in the history
Enhance docs
  • Loading branch information
mbsantiago authored Oct 9, 2024
2 parents ce5ea11 + 43d7998 commit 7fa94d6
Show file tree
Hide file tree
Showing 81 changed files with 3,361 additions and 2,452 deletions.
2 changes: 1 addition & 1 deletion Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -6,7 +6,7 @@ BACK_DIR := back
.PHONY: dev dev-front dev-back dev-docs storybook install-dev build-frontend build-guide bundle publish clean

dev:
docker-compose -f compose-dev.yaml up --watch --build frontend backend storybook
docker-compose -f compose-dev.yaml up --watch --build frontend backend

dev-front:
cd $(FRONT_DIR) && npm run dev
Expand Down
16 changes: 16 additions & 0 deletions back/docs/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -32,6 +32,22 @@ Whombat bridges this gap by focusing on the following key requirements:
_Whombat_'s design prioritizes **careful, manual data curation**.
We believe this emphasis on precision will enable the community to build gold-standard bioacoustic datasets, fueling the development of cutting-edge machine learning models.

## New to Whombat?

Get started quickly with these resources:

1. **Install:** Follow the [installation instructions](user_guide/installation.md) to set up Whombat on your machine.
2. **Explore:** Dive into the [user guide](user_guide/index.md) and familiarize yourself with Whombat's features, including:
- [Datasets](user_guide/datasets.md): Learn how to manage and organize your recordings and their metadata.
- [Annotation Projects](user_guide/annotation_projects.md): Create and manage annotation projects.
- [Evaluation Sets](user_guide/evaluation.md): Evaluate model and user predictions and compare against annotated data.
- [Exploration Tools](user_guide/exploration.md): Discover whombat's tools for data exploration.
3. **Import Data:** Get hands-on with example data to experience Whombat's capabilities ([import guide](user_guide/import.md)).
4. **Learn More:** Deepen your understanding with detailed guides:
- [Spectrogram Display](user_guide/spectrogram_display.md): Tune the spectrogram display to visualize your recordings.
- [Annotation Export](user_guide/export.md): Export your annotations for machine learning development.
5. **Questions?** Check out the [FAQ](user_guide/faq.md).

## Learn more

For those keen on delving into the internal workings of _Whombat_, integrating it with Python scripts, or contributing to its development, keep an eye on the upcoming [Developer Guide](developer_guide/index.md).
Expand Down
272 changes: 107 additions & 165 deletions back/docs/user_guide/annotation_projects.md

Large diffs are not rendered by default.

93 changes: 39 additions & 54 deletions back/docs/user_guide/datasets.md
Original file line number Diff line number Diff line change
@@ -1,32 +1,31 @@
# Managing Datasets and Recordings

In Whombat, you have the flexibility to efficiently manage multiple recording
datasets.
In Whombat, you have the flexibility to efficiently manage multiple recording datasets.

!!! info "What is a dataset?"

A dataset, simply put, is a collection of recordings. However, these
recordings typically share a common context, whether it's from a single research
project, a specific field season, or files downloaded from a particular website.
While you can structure your datasets as you prefer, creating coherent datasets
is generally considered good practice.
A **dataset**, simply put, is a collection of recordings.
However, these recordings typically share a common context, whether it's from a single research project, a specific field season, or files downloaded from a particular website.
While you can structure your datasets as you prefer, creating coherent datasets is generally considered good practice.

Navigate to the dataset list using the sidebar or the cards on the homepage.

!!! tip "Explore Whombat with Example Data"

Curious to see how a fully populated dataset looks in Whombat? Import the example dataset, complete with rich metadata and recordings, to explore its features and functionalities.
The [import data guide](import.md) provides step-by-step instructions.

## Selecting a Dataset to Work On

On the dataset list page, you'll find an overview of all available datasets.
Clicking on the name of a dataset takes you to a dedicated page where you can
explore its details and the recordings it contains.
Clicking on the name of a dataset takes you to a dedicated page where you can explore its details and the recordings it contains.

![Dataset List](../assets/img/dataset_list.png)

## Create a dataset

To create a new dataset, click the **+Create** button. You'll be prompted to
provide a name and a description for your dataset. Additionally, paste the full
path of the folder containing the recordings you want to add. Whombat will scan
the folder and automatically include all available audio files it can find.
To create a new dataset, click the **+Create** button.
You'll be prompted to provide a name and a description for your dataset.
Additionally, paste the full path of the folder containing the recordings you want to add.
Whombat will scan the folder and automatically include all available audio files it can find.

??? info "Supported Audio Formats"

Expand All @@ -46,23 +45,18 @@ the folder and automatically include all available audio files it can find.
an external hard drive, copy the recordings you wish to annotate into your home
folder. If you wish to modify this configuration, feel free to reach out.

Whombat automatically extracts fundamental information from each discovered
file, such as recording **duration**, **sampling rate**, and the number of
**audio channels**. While Whombat can store additional details like date, time,
and location of the recording, automatic extraction is not applied due to the
diverse formats in which this information can be stored. Manual input is
required for these specific details.
Whombat automatically extracts fundamental information from each discovered file, such as recording **duration**, **sampling rate**, and the number of **audio channels**.
While Whombat can store additional details like date, time, and location of the recording, automatic extraction is not applied due to the diverse formats in which this information can be stored.
Manual input is required for these specific details.

## Import a dataset

Whombat offers a convenient option to import all dataset information from a
file. Currently, we support datasets in a JSON-based format known as AOEF. Refer
to this [link](https://mbsantiago.github.io/soundevent/) for comprehensive
information about the AOEF format.
Whombat offers a convenient option to import all dataset information from a file.
Currently, we support datasets in a JSON-based format known as AOEF.
Refer to this [link](https://mbsantiago.github.io/soundevent/) for comprehensive information about the AOEF format.

To import a dataset, simply click the **import** button. Select the JSON file
you want to upload and provide the path to the directory where the recordings
are stored.
To import a dataset, simply click the **import** button.
Select the JSON file you want to upload and provide the path to the directory where the recordings are stored.

??? note "What path should I provide?"

Expand All @@ -73,22 +67,18 @@ are stored.

## Dataset Home Page

Upon selecting or creating a dataset, you'll be directed to the dataset detail
page, offering a comprehensive overview of your dataset. Here, you'll find key
statistics, including the total number of recordings. Additionally, the page
provides insights into potential issues with files or their metadata, along with
a detailed breakdown of the recording count.
Upon selecting or creating a dataset, you'll be directed to the dataset detail page, offering a comprehensive overview of your dataset.
Here, you'll find key statistics, including the total number of recordings.
Additionally, the page provides insights into potential issues with files or their metadata, along with a detailed breakdown of the recording count.

![Dataset Dashboard](../assets/img/dataset_dashboard.png)

From this dashboard, you can easily handle your dataset in various ways. **A)**
shows the total number of recordings, while **B)** points out any issues found
for specific recordings. **C)** Highlights recordings that were initially
registered but are currently missing from your files, offering essential
visibility. **D)** Gives a breakdown of the recordings based on tags, helping
you organize efficiently. **E)** Showcases the latest recording issues and
notes. Lastly, **F)** provides detailed information about the dataset and allows
for easy editing.
From this dashboard, you can easily handle your dataset in various ways.
**A)** shows the total number of recordings, while **B)** points out any issues found for specific recordings.
**C)** Highlights recordings that were initially registered but are currently missing from your files, offering essential visibility.
**D)** Gives a breakdown of the recordings based on tags, helping you organize efficiently.
**E)** Showcases the latest recording issues and notes.
Lastly, **F)** provides detailed information about the dataset and allows for easy editing.

!!! warning "Deleting the dataset"

Expand All @@ -109,11 +99,9 @@ for easy editing.

## Manage the Recordings Metadata

To access the table of recordings, simply click on the Recordings button in the
navigation header. Within this table, you can conveniently edit the metadata of
your recordings in a tabular format. This includes manually inputting the date
and time of recording, specifying the location where the recording was made, and
most importantly, adding any number of tags to your recordings.
To access the table of recordings, simply click on the Recordings button in the navigation header.
Within this table, you can conveniently edit the metadata of your recordings in a tabular format.
This includes manually inputting the date and time of recording, specifying the location where the recording was made, and most importantly, adding any number of tags to your recordings.

![Recording Table](../assets/img/recordings_table.png)

Expand Down Expand Up @@ -147,11 +135,9 @@ most importantly, adding any number of tags to your recordings.

## See the Recording Spectrogram

If you want to view the spectrogram of a recording, simply click on the path of
the recording. This will lead you to a dedicated page where you can access all
the recording information, visualize the spectrogram, and play the recording.
Additionally, you have the option to edit the recording metadata directly from
this page.
If you want to view the spectrogram of a recording, simply click on the path of the recording.
This will lead you to a dedicated page where you can access all the recording information, visualize the spectrogram, and play the recording.
Additionally, you have the option to edit the recording metadata directly from this page.

![Recording Detail](../assets/img/recording_detail.png)

Expand All @@ -169,6 +155,5 @@ this page.

## Ready to start annotating

Now that you've registered a dataset and ensured its metadata is accurate,
you're all set to begin annotating your audio material. Head to the next section
to learn how to create and manage an annotation project.
Now that you've registered a dataset and ensured its metadata is accurate, you're all set to begin annotating your audio material.
Head to the next section to learn how to create and manage an annotation project.
Loading

0 comments on commit 7fa94d6

Please sign in to comment.