Given images of people walking through a doorway, the project's aim was to recognize the different dresses worn by the people entering through the doorway in real time. This was mainly achieved by exploring two different solutions. One was to use a person detector to find a bounding box and then run a classification model on top of the bounding box. However, this idea was scrapped in favour of buulding a one shot detector which directly detects all the different dresses present in the image. In particular, This project achieves the following:
An important point to note here is that the images must be of a certain orientation and type for the model to work best (looking straight at the dresses). The report that details the entire project can be found here