Dataset representation

Each dataset is represented by a pandas dataframe. Each row corresponds to one entry, which depicts one individual animal. Usually one entry corresponds to one image and one animal. However, sometimes, there are multiple animals in an image and then one image may generate multiple entries differentiated by additional information such as bounding box. Columns are descriptions of the entry.

Required columns

The following three columns must be part of all dataframes.

Column	Type	Description
image_id	`int` or `str`	Unique id of the entry.
identity	`int` or `str`	Identity (or label) of the depicted individual animal.
path	`str`	Relative path to the image.

There is a special value for identity which describes an unknown individual. Its default value for unknown animals is

from wildlife_datasets import datasets

datasets.WildlifeDataset.unknown_name

unknown

When a dataset contains unknown inidividuals, there identities should be changed to the default value.

Optional columns

The following columns may be present in the dataframe. Besides these columns, it is possible to define additional columns.

Column	Type	Description
bbox	`List[float]`	Bounding box in the form [x, y, w, h]. Therefore, the topleft corner has coordinates [x, y], while the bottomright corner has coordinates [x+w, y+h].
date	special	Timestamp of the photo. The preferred format is `%Y-%m-%d %H:%M:%S` from the `datetime` package but it is sufficient to be amenable to `pd.to_datetime(x)`.
keypoints	`List[float]`	Keypoints coordinates in the image such as eyes or joints.
position	`str`	Position from each the photo was taken. The usual values are left and right.
segmentation	`List[float]` or special	Segmentation mask in the form [x1, y1, x2, y2, ...]. Additional format are possible such as file path to a mask image, or `pytorch` RLE.
species	`str` or `List[str]`	The depicted species for datasets with multiple species.
video	`int`	The index of a video.

Metadata

Besides the dataframe, each dataset also contains some metadata. The metadata are saved in a separate csv file, which currently contains the following information. All entries are optional.

Column	Description
name	Name of the dataset.
licenses	License file for the dataset.
licenses_url	URL for the license file.
url	URL for the dataset.
cite	Citation in Google Scholar type of the paper.
animals	List of all animal species in the dataset.
animals_simple	List of all animal species in the dataset.
real_animals	Determines whether the dataset contains real animals.
year	Publication year of the dataset.
reported_n_total	The reported number of total animals.
reported_n_identified	The reported number of identified animals.
reported_n_photos	The reported number of photos.
eported_n_individuals	The reported number of individuals.
wild	Determines whether the photos were taken in the wild.
clear_photos	Determines whether the photos are clear.
pose	Determines whether the photos have one orientation (single), two orientation such as left and right flanks (double) or more (multiple).
unique_pattern	Determines whether the animals have unique features (fur patern, fin shape) for recognition.
from_video	Determines whether the dataset was created from photos or videos.
cropped	Determines whether the photos are cropped.
span	The span of the dataset (the time difference between the last and first photos).
size	Size of the zipped datasets (in MB).