SeeTree is a leading company in the agritech field, looking to bring technology into the world of farming. The primary focus of SeeTree is to monitor the fruiting status of trees in order to increase their productivity. This requires a system for the automatic detection of ripe fruits using machine learning techniques.
SeeTree partnered with Ciklum to approach this task through deep learning methods considered the state-of-the-art in tackling image recognition tasks.
The prototype with near human-level performance was built in only two months.
Ciklum’s team of skilled R&D experts had to find a solution to overcome the following challenges:
Duplicate photos, images with unlabelled oranges or invalid polygons, etc. were filtered out from the dataset. The resulting dataset was divided into train/validation/test sets stratified by the number of oranges per image.
The best result was obtained with the Faster R-CNN architecture family. This model was a state-of-the-art approach for solving object detection tasks at the time. Faster R-CNN is a two-stage object detection system in which the first stage generates a sparse set of candidate object locations and the second stage classifies each candidate location as one of the foreground classes or as background using a convolutional neural network.
Ciklum’s R&D team’s research and experiments showed that Faster R-CNN with FPN Inception-ResNetV2 backbone is the most suitable for the problem and had the best performance.
The best result was with the Faster R-CNN with FPN InceptionResNetV2 backbone. Performance using the mAP metric was about 85%. The figure below shows predicted bounding boxes (red squares – labels, green squares – predicted bounding boxes).
Ciklum’s prototype had the following advantages: