Task: We will be focusing on identifying different species of plants, so we need to choose the best dataset to do so. The larger the range of plant species, the better, but we can also focus on plants native to North America/Vancouver if we need to choose a smaller subset.

App idea is similar to this application that I found on Github:

https://github.com/UrbanPlanter/urbanplanterapp

The dataset I would go with is likely the Herbarium one (we would need to significant preprocessing that I could work on for next week - turning all the images into black-and-white, boundary detection, apply different convolutions and edge smoothing etc.)

Herbarium Dataset

https://www.kaggle.com/competitions/herbarium-2022-fgvc9/data

https://www.kaggle.com/code/ihelon/herbarium-2021-exploratory-data-analysis/notebook

Flower Dataset

https://www.kaggle.com/datasets/alxmamaev/flowers-recognition

Leaf Dataset

https://www.kaggle.com/competitions/leaf-classification/data