ViT is a transformers model that can be used for image classification. This repository will show you how to fine tune this model with your own database.
First you need to install all python libraries :
python -m pip install -r requirements.txtCreate 3 subfolders : test, train, val.
Into this three folders, create a folder for each category of your data.
python runner.pypython inferenceTest.py