Counting Objects in a Robotic Hand

Francis Tsow,Tianze Chen,Yu Sun
2024-04-10
Abstract:A robot performing multi-object grasping needs to sense the number of objects in the hand after grasping. The count plays an important role in determining the robot's next move and the outcome and efficiency of the whole pick-place process. This paper presents a data-driven contrastive learning-based counting classifier with a modified loss function as a simple and effective approach for object counting despite significant occlusion challenges caused by robotic fingers and objects. The model was validated against other models with three different common shapes (spheres, cylinders, and cubes) in simulation and in a real setup. The proposed contrastive learning-based counting approach achieved above 96\% accuracy for all three objects in the real setup.
Robotics,Artificial Intelligence
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the issue of how a robotic hand can accurately count the number of objects it has grasped after picking up multiple items. Specifically, after the robotic hand grabs objects from a container, it needs to determine the number of objects it has grasped. This is crucial for deciding the robot's next action and for the outcome and efficiency of the entire grasp-and-place process. The paper proposes a counting classifier based on contrastive learning and modifies the loss function to tackle the significant occlusion challenges caused by the robot's fingers and the objects. This method has been proven to be simple and effective in both simulated environments and real-world settings for three common shapes (spheres, cylinders, and cubes). Experimental results show that for these three shapes, the method achieves an accuracy of over 96% in real-world environments. In summary, the core issue of the paper is to develop a simple and effective method to estimate the number of objects a robotic hand has grasped after lifting them from the source container, and it proposes a solution based on improved contrastive learning.