Task: We will be focusing on identifying different species of plants, so we need to choose the best dataset to do so. The larger the range of plant species, the better, but we can also focus on plants native to North America/Vancouver if we need to choose a smaller subset.
App idea is similar to this application that I found on Github:
https://github.com/UrbanPlanter/urbanplanterapp
The dataset I would go with is likely the Herbarium one (we would need to significant preprocessing that I could work on for next week - turning all the images into black-and-white, boundary detection, apply different convolutions and edge smoothing etc.)
Herbarium Dataset
https://www.kaggle.com/competitions/herbarium-2022-fgvc9/data
https://www.kaggle.com/code/ihelon/herbarium-2021-exploratory-data-analysis/notebook
Flower Dataset
https://www.kaggle.com/datasets/alxmamaev/flowers-recognition
Leaf Dataset
https://www.kaggle.com/competitions/leaf-classification/data