Perception – PRACSYS Lab

In the past we have worked on robot perception challenges, especially on how to use vision to understand 3D scenes by solving problems, such as object detection and 6D object pose estimation. We have also dealt with other perception challenges, such as bearing-only navigation and localization, as well as localization based on wireless signal strength.

6D pose estimation

A requirement in order to be able to plan the motion of a robotic arm in a cluttered environment is to be able to detect the objects in the robot’s vicinity and their 6D pose (i.e., location and orientation). The goal of our work is to build the capability to identify accurate pose estimates for objects in cluttered scenarios. Particularly, we have been working on 1) developing intelligent techniques to autonomously generate labeled datasets for training object recognition pipelines, and 2) developing search-based algorithms for scene estimation, given RGBD data and 3D CAD models of objects.

Navigation without a Map using Bearing-Only Sensors

Often times, robot navigation schemes rely on having accurate distance information in the form of laser-range scanners or sonar. This work focuses on navigation using only bearing information, rather than using distance information. The robot can accurately determine the relative bearing of landmarks in its environment using a panoramic camera. Using this bearing information, the robot is able to execute a long and complex trajectory in order to complete some desired task and then return to its original position with a high degree of accuracy. This work focuses on the theoretical guarantees provided under an ideal model and proves navigability in two-dimensional workspaces under this model.

SLAM with Bearing-Only Sensors

This work focuses on studying the problem of bearing-only Simultaneous Localization and Mapping (SLAM) for robotic systems using only bearing information. A deep and wide study into different approaches to the problem is given, investigating methods such as the Extended Kalman Filter (EKF), Expectation Maximization (EM), and Particle Filtering. This work shows that particle filters work particularly well, especially when extra steps are taken to improve their robustness to outliers.

Publications:

2025

Ramesh, D; Keskar, S; Sivaramakrishnan, A; Bekris, K; Yu, J; Boularias, A

PROBE: Proprioceptive Obstacle Detection and Estimation while Navigating in Clutter Conference

IEEE International Conference on Robotics and Automation (ICRA), 2025.

Abstract | Links | BibTeX

2023

Lu, S; Chang, H; Jing, E; Boularias, A; Bekris, K

Ovir-3d: Open-Vocabulary 3d Instance Retrieval without Training on 3d Data Inproceedings

Conference on Robot Learning (CoRL), Atlanta, GA, 2023.

Abstract | Links | BibTeX

Chang, H; Boyalakuntla, K; Lu, S; Cai, S; Jing, E; Keskar, S; Geng, S; Abbas, A; Zhou, L; Bekris, K; Boularias, A

Context-Aware Entity Grounding with Open-Vocabulary 3D Scene Graphs Conference

Conference on Robot Learning (CoRL), Atlanta, GA, 2023.

Abstract | Links | BibTeX

Lu, S; Deng, Y; Boularias, A; Bekris, K

Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos Inproceedings

IEEE International Conference on Robotics and Automation (ICRA), London, UK, 2023.

Abstract | Links | BibTeX

Nakhimovich, D; Miao, Y; Bekris, K

Resolution Complete In-Place Object Retrieval Given Known Object Models Inproceedings

IEEE International Conference on Robotics and Automatics (ICRA), London, UK, 2023.

Abstract | Links | BibTeX

2022

Wen, B; Lian, W; Bekris, K; Schaal, S

You Only Demonstrate Once: Category-Level Manipulation from Single Visual Demonstration Inproceedings

Robotics: Science and Systems (RSS), 2022, (Nomination for Best Paper Award).

Abstract | Links | BibTeX

@inproceedings{Wen:2022ab,
title = {You Only Demonstrate Once: Category-Level Manipulation from Single Visual Demonstration},
author = {B Wen and W Lian and K Bekris and S Schaal},
url = {https://www.roboticsproceedings.org/rss18/p044.pdf},
year = {2022},
date = {2022-06-01},
booktitle = {Robotics: Science and Systems (RSS)},
abstract = {Promising results have been achieved recently in category-level manipulation that generalizes across object instances. Nevertheless, it often requires expensive real-world data collection and manual specification of semantic keypoints for each object category and task. Additionally, coarse keypoint predictions and ignoring intermediate action sequences hinder adoption in complex manipulation tasks beyond pick-and-place. This work proposes a novel, category-level manipulation framework that leverages an object-centric, category-level representation and model-free 6 DoF motion tracking. The canonical object representation is learned solely in simulation and then used to parse a category-level, task trajectory from a single demonstration video. The demonstration is reprojected to a target trajectory tailored to a novel object via the canonical representation. During execution, the manipulation horizon is decomposed into long range, collision-free motion and last-inch manipulation. For the latter part, a category-level behavior cloning (CatBC) method leverages motion tracking to perform closed-loop control. CatBC follows the target trajectory, projected from the demonstration and anchored to a dynamically selected category-level coordinate frame. The frame is automatically selected along the manipulation horizon by a local attention mechanism. This framework allows to teach different manipulation strategies by solely providing a single demonstration, without complicated manual programming. Extensive experiments demonstrate its efficacy in a range of challenging industrial tasks in high precision assembly, which involve learning complex, long-horizon policies. The process exhibits robustness against uncertainty due to dynamics as well as generalization across object instances and scene configurations.},
note = {Nomination for Best Paper Award},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}

Lu, S; Wang, R; Miao, Y; Mitash, C; Bekris, K

Online Object Model Reconstruction and Reuse for Lifelong Improvement of Robot Manipulation Inproceedings

IEEE International Conference on Robotics and Automation (ICRA), 2022, (Nomination for Best Paper Award in Manipulation).

Abstract | Links | BibTeX

@inproceedings{Lu:2022ab,
title = {Online Object Model Reconstruction and Reuse for Lifelong Improvement of Robot Manipulation},
author = {S Lu and R Wang and Y Miao and C Mitash and K Bekris},
url = {https://arxiv.org/abs/2109.13910},
year = {2022},
date = {2022-05-01},
booktitle = {IEEE International Conference on Robotics and Automation (ICRA)},
abstract = {This work proposes a robotic pipeline for picking and constrained placement of objects without geometric shape priors. Compared to recent efforts developed for similar tasks, where every object was assumed to be novel, the proposed system recognizes previously manipulated objects and performs online model reconstruction and reuse. Over a lifelong manipulation process, the system keeps learning features of objects it has interacted with and updates their reconstructed models. Whenever an instance of a previously manipulated object reappears, the system aims to first recognize it and then register its previously reconstructed model given the current observation. This step greatly reduces object shape uncertainty allowing the system to even reason for parts of objects, which are currently not observable. This also results in better manipulation efficiency as it reduces the need for active perception of the target object during manipulation. To get a reusable reconstructed model, the proposed pipeline adopts: i) TSDF for object representation, and ii) a variant of the standard particle filter algorithm for pose estimation and tracking of the partial object model. Furthermore, an effective way to construct and maintain a dataset of manipulated objects is presented. A sequence of real-world manipulation experiments is performed. They show how future manipulation tasks become more effective and efficient by reusing reconstructed models of previously manipulated objects, which were generated during their prior manipulation, instead of treating objects as novel every time.},
note = {Nomination for Best Paper Award in Manipulation},
keywords = {},
pubstate = {published},
tppubtype = {inproceedings}
}

Mitash, C; Boularias, A; Bekris, K

Physics-Based Scene-Level Reasoning for Object Pose Estimation in Clutter Journal Article

International Journal of Robotics Research (IJRR), 2022.

Abstract | Links | BibTeX

@article{Mitash:2022aa,
title = {Physics-Based Scene-Level Reasoning for Object Pose Estimation in Clutter},
author = {C Mitash and A Boularias and K Bekris},
url = {https://arxiv.org/pdf/1806.10457.pdf},
year = {2022},
date = {2022-05-01},
journal = {International Journal of Robotics Research (IJRR)},
abstract = {This paper focuses on vision-based pose estimation for multiple rigid objects placed in clutter, especially in cases involving occlusions and objects resting on each other. Progress has been achieved recently in object recognition given advancements in deep learning. Nevertheless, such tools typically require a large amount of training data and significant manual effort to label objects. This limits their applicability in robotics, where solutions must scale to a large number of objects and variety of conditions. Moreover, the combinatorial nature of the scenes that could arise from the placement of multiple objects is hard to capture in the training dataset. Thus, the learned models might not produce the desired level of precision required for tasks, such as robotic manipulation. This work proposes an autonomous process for pose estimation that spans from data generation, to scene-level reasoning and self-learning. In particular, the proposed framework first generates a labeled dataset for training a Convolutional Neural Network (CNN) for object detection in clutter. These detections are used to guide a scene-level optimization process, which considers the interactions between the different objects present in the clutter to output pose estimates of high precision. Furthermore, confident estimates are used to label online real images from multiple views and re-train the process in a self-learning pipeline. Experimental results indicate that this process is quickly able to identify in cluttered scenes physically-consistent object poses that are more precise than the ones found by reasoning over individual instances of objects. Furthermore, the quality of pose estimates increases over time given the self-learning process.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

Wen, B; Lian, W; Bekris, K; Schaal, S

Catgrasp: Learning Category-Level Task-Relevant Grasping in Clutter from Simulation Inproceedings

IEEE International Conference on Robotics and Automation (ICRA), 2022.

Abstract | Links | BibTeX

2021

Wen, B; Bekris, K

Bundletrack: 6d Pose Tracking for Novel Objects without Instance or Category-Level 3d Models Inproceedings

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2021.

Abstract | Links | BibTeX

Morgan, A; Wen, B; Junchi, L; Boularias, A; Dollar, A; Bekris, K

Vision-Driven Compliant Manipulation for Reliable, High-Precision Assembly Tasks Conference

Robotics: Science and Systems, 2021.

Abstract | BibTeX

2020

Wen, B; Mitash, C; Ren, B; Bekris, K

se(3)-TrackNet: Data-Driven 6d Pose Tracking by Calibrating Image Residuals in Synthetic Domains Conference

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, 2020.

Abstract | Links | BibTeX

Mitash, C; Shome, R; Wen, B; Boularias, A; Bekris, K

Task-Driven Perception and Manipulation for Constrained Placement of Unknown Objects Journal Article

IEEE Robotics and Automation Letters (RA-L) (also appearing at IEEE/RSJ IROS 2020), 2020.

Abstract | Links | BibTeX

Mitash, C

Scalable, Physics-Aware 6d Pose Estimation for Robot Manipulation PhD Thesis

Rutgers University, 2020.

Abstract | Links | BibTeX

@phdthesis{Mitash:2020aa,
title = {Scalable, Physics-Aware 6d Pose Estimation for Robot Manipulation},
author = {C Mitash},
url = {https://rucore.libraries.rutgers.edu/rutgers-lib/64961/},
year = {2020},
date = {2020-09-01},
school = {Rutgers University},
abstract = {Robot manipulation often depend on some form of pose estimation to represent the state of the world and allow decision making both at the task-level and for motion or grasp planning. Recent progress in deep learning gives hope for a pose estimation solution that could generalize over textured and texture-less objects, objects with or without distinctive shape properties, and under different lighting conditions and clutter scenarios. Nevertheless, it gives rise to a new set of challenges such as the painful task of acquiring large-scale labeled training datasets and of dealing with their stochastic output over unforeseen scenarios that are not captured by the training. This restricts the scalability of such pose estimation solutions in robot manipulation tasks that often deal with a variety of objects and changing environments. The thesis first describes an automatic data generation and learning framework to address the scalability challenge. Learning is bootstrapped by generating labeled data via physics simulation and rendering. Then it self-improves over time by acquiring and labeling real-world images via a search-based pose estimation process. The thesis proposes algorithms to generate and validate object poses online based on the objects' geometry and based on the physical consistency of their scene-level interactions. These algorithms provide robustness even when there exists a domain gap between the synthetic training and the real test scenarios. Finally, the thesis proposes a manipulation planning framework that goes beyond model-based pose estimation. By utilizing a dynamic object representation, this integrated perception and manipulation framework can efficiently solve the task of picking unknown objects and placing them in a constrained space. The algorithms are evaluated over real-world robot manipulation experiments and over large-scale public datasets. The results indicate the usefulness of physical constraints in both the training and the online estimation phase. Moreover, the proposed framework, while only utilizing simulated data can obtain robust estimation in challenging scenarios such as densely-packed bins and clutter where other approaches suffer as a result of large occlusion and ambiguities due to similar looking texture-less surfaces.},
keywords = {},
pubstate = {published},
tppubtype = {phdthesis}
}

Robot manipulation often depend on some form of pose estimation to represent the state of the world and allow decision making both at the task-level and for motion or grasp planning. Recent progress in deep learning gives hope for a pose estimation solution that could generalize over textured and texture-less objects, objects with or without distinctive shape properties, and under different lighting conditions and clutter scenarios. Nevertheless, it gives rise to a new set of challenges such as the painful task of acquiring large-scale labeled training datasets and of dealing with their stochastic output over unforeseen scenarios that are not captured by the training. This restricts the scalability of such pose estimation solutions in robot manipulation tasks that often deal with a variety of objects and changing environments. The thesis first describes an automatic data generation and learning framework to address the scalability challenge. Learning is bootstrapped by generating labeled data via physics simulation and rendering. Then it self-improves over time by acquiring and labeling real-world images via a search-based pose estimation process. The thesis proposes algorithms to generate and validate object poses online based on the objects' geometry and based on the physical consistency of their scene-level interactions. These algorithms provide robustness even when there exists a domain gap between the synthetic training and the real test scenarios. Finally, the thesis proposes a manipulation planning framework that goes beyond model-based pose estimation. By utilizing a dynamic object representation, this integrated perception and manipulation framework can efficiently solve the task of picking unknown objects and placing them in a constrained space. The algorithms are evaluated over real-world robot manipulation experiments and over large-scale public datasets. The results indicate the usefulness of physical constraints in both the training and the online estimation phase. Moreover, the proposed framework, while only utilizing simulated data can obtain robust estimation in challenging scenarios such as densely-packed bins and clutter where other approaches suffer as a result of large occlusion and ambiguities due to similar looking texture-less surfaces.

Wang, R; Mitash, C; Lu, S; Boehm, D; Bekris, K

Safe and Effective Picking Paths in Clutter Given Discrete Distributions of Object Poses Conference

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Las Vegas, NV, 2020.

Abstract | Links | BibTeX

Wen, B; Mitash, C; Soorian, S; Kimmel, A; Sintov, A; Bekris, K

Robust, Occlusion-Aware Pose Estimation for Objects Grasped by Adaptive Hands Conference

IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 2020.

Abstract | Links | BibTeX

2019

Mitash, C; Wen, B; Bekris, K; Boularias, A

Scene-Level Pose Estimation for Multiple Instances of Densely Packed Objects Conference

Conference on Robot Learning (CoRL), Osaka, Japan, 2019.

Abstract | Links | BibTeX

Shome, R; Tang, W; Song, C; Mitash, C; Kourtev, C; Yu, J; Boularias, A; Bekris, K

Towards Robust Product Packing with a Minimalistic End-Effector Conference

IEEE International Conference on Robotics and Automation (ICRA), 2019, (Nomination for Best Paper Award in Automation).

Abstract | Links | BibTeX

2018

Mitash, C; Boularias, A; Bekris, K

Robust 6D Pose Estimation with Stochastic Congruent Sets Conference

British Machine Vision Conference (BMVC), Newcastle, UK, 2018.

Abstract | Links | BibTeX

Mitash, C; Boularias, A; Bekris, K

Improving 6d Pose Estimation of Objects in Clutter Via Physics-Aware Monte Carlo Tree Search Conference

IEEE International Conference on Robotics and Automation (ICRA), Brisbane, Australia, 2018.

Abstract | Links | BibTeX

@conference{Mitash:2018aa,
title = {Improving 6d Pose Estimation of Objects in Clutter Via Physics-Aware Monte Carlo Tree Search},
author = {C Mitash and A Boularias and K Bekris},
url = {https://arxiv.org/pdf/1710.08577},
year = {2018},
date = {2018-05-01},
booktitle = {IEEE International Conference on Robotics and Automation (ICRA)},
address = {Brisbane, Australia},
abstract = {This work proposes a process for efficiently searching over combinations of individual object 6D pose hypotheses in cluttered scenes, especially in cases involving occlusions and objects resting on each other. The initial set of candidate object poses is generated from state-of-the-art object detection and global point cloud registration techniques. The best scored pose per object by using these techniques may not be accurate due to overlaps and occlusions. Nevertheless, experimental indications provided in this work show that object poses with lower ranks may be closer to the real poses than ones with high ranks according to registration techniques. This motivates a global optimization process for improving these poses by taking into account scene-level physical interactions between objects. It also implies that the Cartesian product of candidate poses for interacting objects must be searched so as to identify the best scene-level hypothesis. To perform the search efficiently, the candidate poses for each object are clustered so as to reduce their number but still keep a sufficient diversity. Then, searching over the combinations of candidate object poses is performed through a Monte Carlo Tree Search (MCTS) process that uses the similarity between the observed depth image of the scene and a rendering of the scene given the hypothesized pose as a score that guides the search procedure. MCTS handles in a principled way the tradeoff between fine-tuning the most promising poses and exploring new ones, by using the Upper Confidence Bound (UCB) technique. Experimental results indicate that this process is able to quickly identify in cluttered scenes physically-consistent object poses that are significantly closer to ground truth compared to poses found by point cloud registration methods.},
keywords = {},
pubstate = {published},
tppubtype = {conference}
}

Hodan, T; Kouskouridas, R; Kim, T; Tombari, F; Bekris, K; Drost, B; Groueix, T; Walas, K; Lepetit, V; Leonardis, A; Steger, C; Michel, F; Sahin, C; Rother, C; Matas, J

A Summary of the 4th International Workshop on Recovering 6d Object Pose Journal Article

2018.

Abstract | Links | BibTeX

2017

Mitash, C; Bekris, K; Boularias, A

A Self-Supervised Learning System for Object Detection Using Physics Simulation and Multi-View Pose Estimation Conference

IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, Canada, 2017.

Abstract | Links | BibTeX

2016

Rennie, C; Shome, R; Bekris, K; Souza, A

A Dataset for Improved RGBD-Based Object Detection and Pose Estimation for Warehouse Pick-and-Place Journal Article

IEEE Robotics and Automation Letters (RA-L) [Also accepted to appear at the 2016 IEEE International Conference on Robotics and Automation (ICRA)], 1 , pp. 1179–1185, 2016.

Abstract | Links | BibTeX

@article{Rennie:2016aa,
title = {A Dataset for Improved RGBD-Based Object Detection and Pose Estimation for Warehouse Pick-and-Place},
author = {C Rennie and R Shome and K Bekris and A Souza},
url = {http://www.cs.rutgers.edu/~kb572/pubs/icra16_pose_estimation.pdf},
year = {2016},
date = {2016-02-01},
journal = {IEEE Robotics and Automation Letters (RA-L) [Also accepted to appear at the 2016 IEEE International Conference on Robotics and Automation (ICRA)]},
volume = {1},
pages = {1179--1185},
address = {Stockholm, Sweden},
abstract = {An important logistics application of robotics involves manipulators that pick-and-place objects placed in warehouse shelves. A critical aspect of this task corresponds to detecting the pose of a known object in the shelf using visual data. Solving this problem can be assisted by the use of an RGB-D sensor, which also provides depth information beyond visual data. Nevertheless, it remains a challenging problem since multiple issues need to be addressed, such as low illumination inside shelves, clutter, texture-less and reflective objects as well as the limitations of depth sensors. This paper provides a new rich data set for advancing the state-of-the-art in RGBD-based 3D object pose estimation, which is focused on the challenges that arise when solving warehouse pick-and-place tasks. The publicly available data set includes thousands of images and corresponding ground truth data for the objects used during the first Amazon Picking Challenge at different poses and clutter conditions. Each image is accompanied with ground truth information to assist in the evaluation of algorithms for object detection. To show the utility of the data set, a recent algorithm for RGBD-based pose estimation is evaluated in this paper. Based on the measured performance of the algorithm on the data set, various modifications and improvements are applied to increase the accuracy of detection. These steps can be easily applied to a variety of different methodologies for object pose detection and improve performance in the domain of warehouse pick-and-place.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

2012

Fallah, N; Apostolopoulos, I; Bekris, K; Folmer, E

The User As a Sensor: Navigating Users with Visual Impairments in Indoor Spaces Using Tactile Landmarks Conference

ACM SIGCHI Conference on Human Factors in Computing Systems (CHI), Austin, TX, 2012.

Abstract | Links | BibTeX

Navkar, N V; Deng, Z; Shah, D; Bekris, K; Tsekos, N

Visual and Force-Feedback Guidance for Robot-Assisted Interventions in the Beating Heart with Real-Time Mri Inproceedings

IEEE International Conference on Robotics and Automation, ICRA 2012, 14-18 May, 2012, St. Paul, Minnesota, USA, pp. 689–694, 2012.

Abstract | Links | BibTeX

2010

Apostolopoulos, I; Fallah, N; Folmer, E; Bekris, K

Feasibility of Interactive Localization and Navigation of People with Visual Impairments Conference

IEEE Intelligent Autonomous Systems Conference (IAS), Ottawa, Canada, 2010.

Abstract | Links | BibTeX

2006

Bekris, K; Glick, M; Kavraki, L

Evaluation of Algorithms for Bearing-Only Slam Conference

IEEE International Conference on Robotics and Automation (ICRA), Orlando, FL, 2006.

Abstract | Links | BibTeX

Bekris, K; Argyros, A; Kavraki, L

Exploiting Panoramic Vision for Angle-Based Robot Homing Book Chapter

Lecture Notes in Computer Science, 33 , Springer, 2006.

Abstract | Links | BibTeX

@inbook{Bekris:2006ab,
title = {Exploiting Panoramic Vision for Angle-Based Robot Homing},
author = {K Bekris and A Argyros and L Kavraki},
url = {http://www.cs.rutgers.edu/~kb572/pubs/omnidirectional_homing.pdf},
year = {2006},
date = {2006-01-01},
booktitle = {Lecture Notes in Computer Science},
volume = {33},
publisher = {Springer},
organization = {Springer},
abstract = {Omni-directional vision allows for the development of techniques for mobile robot navigation that have minimum perceptual requirements. In this work, we focus on robot navigation algorithms that do not require range information or metric maps of the environment. More specifically, we present a homing strategy that enables a robot to return to its home position after executing a long path. The proposed strategy relies on measuring the angle between pairs of features extracted from panoramic images, which can be achieved accurately and robustly. In the heart of the proposed homing strategy lies a novel, local control law that enables a robot to reach any position on the plane by exploiting the bearings of at least three landmarks of unknown position, without making assumptions regarding the robottextquoterights orientation and without making use of a compass. This control law is the result of the unification of two other local control laws which guide the robot by monitoring the bearing of landmarks and which are able to reach complementary sets of goal positions on the plane. Long-range homing is then realized through the systematic application of the unified control law between automatically extracted milestone positions connecting the robottextquoterights current position to the home position. Experimental results, conducted both in a simulated environment and on a robotic platform equipped with a panoramic camera validate the employed local control laws as well as the overall homing strategy. Moreover, they show that panoramic vision can assist in simplifying the perceptual processes required to support robust and accurate homing behaviors.},
keywords = {},
pubstate = {published},
tppubtype = {inbook}
}

2005

Argyros, A; Bekris, K; Orphanoudakis, S; Kavraki, L

Robot Homing by Exploiting Panoramic Vision Journal Article

Autonomous Robots, 19 (1), 2005.

Abstract | Links | BibTeX

@article{Argyros:2005aa,
title = {Robot Homing by Exploiting Panoramic Vision},
author = {A Argyros and K Bekris and S Orphanoudakis and L Kavraki},
url = {http://www.cs.rutgers.edu/~kb572/pubs/robot_homing_panoramic.pdf},
year = {2005},
date = {2005-01-01},
journal = {Autonomous Robots},
volume = {19},
number = {1},
chapter = {7-25},
abstract = {We propose a novel, vision-based method for robot homing, the problem of computing a route so that a robot can return to its initial "home" position after the execution of an arbitrary "prior" path. The method assumes that the robot tracks visual features in panoramic views of the environment that it acquires as it moves. By exploiting only angular information regarding the tracked features, a local control strategy moves the robot between two positions, provided that there are at least three features that can be matched in the panoramas acquired at these positions. The strategy is successful when certain geometric constraints on the configuration of the two positions relative to the features are fulfilled. In order to achieve long-range homing, the features' trajectories are organized in a visual memory during the execution of the "prior" path. When homing is initiated, the robot selects Milestone Positions (MPs) on the "prior" path by exploiting information in its visual memory. The MP selection process aims at picking positions that guarantee the success of the local control strategy between two consecutive MPs. The sequence of successive MPs successfully guides the robot even if the visual context in the "home" position is radically different from the visual context at the position where homing was initiated. Experimental results from a prototype implementation of the method demonstrate that homing can be achieved with high accuracy, independent of the distance traveled by the robot. The contribution of this work is that it shows how a complex navigational task such as homing can be accomplished efficiently, robustly and in real-time by exploiting primitive visual cues. Such cues carry implicit information regarding the 3D structure of the environment. Thus, the computation of explicit range information and the existence of a geometric map are not required.},
keywords = {},
pubstate = {published},
tppubtype = {article}
}

2004

Bekris, K; Argyros, A; Kavraki, L

Angle-Based Methods for Mobile Robot Navigation: Reaching the Entire Plane Conference

IEEE International Conference on Robotics and Automation (ICRA04), New Orleans, LA, 2004.

Abstract | Links | BibTeX

2001

Argyros, A; Bekris, K; Orphanoudakis, S

Robot Homing Based on Corner Tracking in a Sequence of Panoramic Images Conference

Computer Vision and Pattern Recognition Conference (CVPR01), Hawaii, USA, 2001.

Abstract | Links | BibTeX