Abstract: Since cohabitation with robots is transitioning from fiction to reality, ensuring their safe and efficient adoption is essential. However, a robot learning a new task needs to explore the ...
Abstract: Vision-language models such as CLIP have boosted the performance of open-vocabulary object detection, where the detector is trained on base categories but required to detect novel categories ...