Skip to content

Dataset representation

Each dataset is represented by a pandas dataframe. Each row corresponds to one entry, which depicts one individual animal. Usually one entry corresponds to one image and one animal. However, sometimes, there are multiple animals in an image and then one image may generate multiple entries differentiated by additional information such as bounding box. Columns are descriptions of the entry.

Required columns

The following three columns must be part of all dataframes.

Column Type Description
image_id int or str Unique id of the entry.
identity int or str Identity (or label) of the depicted individual animal.
path str Relative path to the image.

There is a special value for identity which describes an unknown individual. Its default value for unknown animals is

from wildlife_datasets import datasets

datasets.WildlifeDataset.unknown_name
unknown

When a dataset contains unknown inidividuals, the identity entry should be changed to the default value described above.

Optional columns

The following columns may be present in the dataframe. Besides these columns, it is possible to define additional columns.

Column Type Description
bbox List[float] Bounding box in the form [x, y, w, h]. Therefore, the topleft corner has coordinates [x, y], while the bottomright corner has coordinates [x+w, y+h].
date special Timestamp of the photo. The preferred format is %Y-%m-%d %H:%M:%S from the datetime package but it is sufficient to be amenable to pd.to_datetime(x).
keypoints List[float] Keypoints coordinates in the image such as eyes or joints.
position str Position from which each photo was taken. The usual values are left and right.
segmentation List[float] or special Segmentation mask in the form [x1, y1, x2, y2, ...]. Additional format are possible such as file path to a mask image, or pytorch RLE.
species str or List[str] The depicted species for datasets with multiple species.
video int The index of a video.

Metadata

Besides the dataframe, each dataset also contains some metadata. All entries are optional.

Column Description
licenses License file for the dataset.
licenses_url URL for the license file.
url URL for the dataset.
cite Citation in Google Scholar type of the paper.
animals List of all animal scientific names in the dataset.
animals_simple List of all animal common names in the dataset.
real_animals Determines whether the images are of real animals as opposed to computer generated image.
year Publication year of the dataset.
reported_n_total The reported number of total animals.
reported_n_individuals The reported number of individuals.
wild Determines whether the environment that it was photographed in is wild.
clear_photos Determines whether the database is quality-controlled such that image quality is consistent across all images.
pose Determines whether the photos have one orientation (single), two orientation such as left and right flanks (double) or more (multiple).
unique_pattern Determines whether the animals have unique features (fur patern, fin shape) for recognition.
from_video Determines whether the dataset was created from photos or videos.
cropped Determines whether the photos are cropped.
span The span of the dataset (the time difference between the last and first photos).
size Size of the zipped datasets (in MB).