The fruit collection robot with vision is an intelligent robotics application designed to provide autonomous fallen fruit management in orchards. The system includes a Raspberry Pi 5 microcontroller with Pi Camera Module 3, which is capable of detecting and classifying fallen tomatoes with a mean average precision of 89.1% using a custom trained YOLO11 Nano model.
Development and performance benchmarking of three distinct architectural paradigms that are MobileNetV2 (Efficient CNN), YOLOv11n (State-of-the-art CNN), and ResNet50 (Real-Time Detection Transformer) to determine the optimal balance of precision and latency for edge-based orchard robots. Design of a 3-DOF servo-actuated robotic manipulator optimized for gentle fruit handling without damage.
The fruit collecting manipulator includes a 3 DOF arm fixed atop a 4-wheel driven mobile robot, where picking actions are executed via a parallel-jaw gripper and controlled using geometric inverse kinematics from the Law of Cosines and camera-to-millimeter space coordinate conversion via homographic matrix calibration. The gripper acts upon detecting fruits in an arcade-claw fashion by descending onto the identified target perpendicularly to reliably collect fruit without sideways misalignment. The fallen fruits are automatically sorted into two bins based on their ripeness and quality: edible fruits (ripe and half-ripe tomatoes) are sorted into the left bin, while the remaining (diseased, overripe, rot, and unripe) tomatoes are sorted into the right bin.
The sorting process ensures no contamination of fruits during picking operations. The robot utilizes an Arduino Uno microcontroller as a slave controller responsible for coordinating the gripper motions in a degree-by-degree manner and serially confirming its position with a handshake mechanism preventing joint collision and avoiding high current demand simultaneously by the servo motors and the Li-ion power source. The robot features an interactive diagnostic view available via web browser, which shows in real time the YOLO11 detections, classifications, and confidence scores of the falling fruit targets. This entire system is a viable, economic, and efficient solution to save human effort and reduce the loss of fruits while allowing the orchard to be monitored based on data analysis.
Project Highlights
The pipeline is split across two boards. The Raspberry Pi 5 handles all perception — running the YOLO11 Nano model at inference, computing the homographic transform, and dispatching pick coordinates over serial UART.
The Arduino Uno receives coordinates and drives three servo motors via a custom inverse kinematics routine, moving the end-effector to the target position, triggering the gripper, and returning to a home pose.
A simple state machine governs the full cycle: Scan → Detect → Classify → Localise → Pick → Sort → Return.