Showing papers by "Willow Garage published in 2009"

PDF

Open Access

Proceedings Article•DOI•

Influences on proxemic behaviors in human-robot interaction

[...]

Leila Takayama¹, Caroline Pantofaru¹•Institutions (1)

10 Oct 2009

TL;DR: It is found that personal experience with pets and robots decreases a person's personal space around robots, and the personality trait of agreeableness decreases personal spaces when people approach robots, while the personality traits of neuroticism and having negative attitudes toward robots increase personal spacesWhen robots approach people.

...read moreread less

Abstract: As robots enter the everyday physical world of people, it is important that they abide by society's unspoken social rules such as respecting people's personal spaces. In this paper, we explore issues related to human personal space around robots, beginning with a review of the existing literature in human-robot interaction regarding the dimensions of people, robots, and contexts that influence human-robot interactions. We then present several research hypotheses which we tested in a controlled experiment (N=30). Using a 2 (robotics experience vs. none: between-participants) × 2 (robot head oriented toward a participant's face vs. legs: within-participants) mixed design experiment, we explored the factors that influence proxemic behavior around robots in several situations: (1) people approaching a robot, (2) people being approached by an autonomously moving robot, and (3) people being approached by a teleoperated robot. We found that personal experience with pets and robots decreases a person's personal space around robots. In addition, when the robot's head is oriented toward the person's face, it increases the minimum comfortable distance for women, but decreases the minimum comfortable distance for men. We also found that the personality trait of agreeableness decreases personal spaces when people approach robots, while the personality trait of neuroticism and having negative attitudes toward robots increase personal spaces when robots approach people. These results have implications for both human-robot interaction theory and design.

...read moreread less

370 citations

Proceedings Article•DOI•

Towards lifelong visual maps

[...]

Kurt Konolige¹, James Bowman¹•Institutions (1)

Willow Garage¹

10 Oct 2009

TL;DR: This paper presents a system of visual mapping, using only input from a stereo camera, that continually updates an optimized metric map in large indoor spaces with movable objects: people, furniture, partitions, etc.

...read moreread less

Abstract: The typical SLAM mapping system assumes a static environment and constructs a map that is then used without regard for ongoing changes. Most SLAM systems, such as FastSLAM, also require a single connected run to create a map. In this paper we present a system of visual mapping, using only input from a stereo camera, that continually updates an optimized metric map in large indoor spaces with movable objects: people, furniture, partitions, etc. The system can be stopped and restarted at arbitrary disconnected points, is robust to occlusion and localization failures, and efficiently maintains alternative views of a dynamic environment. It operates completely online at a 30 Hz frame rate.

...read moreread less

175 citations

Proceedings Article•DOI•

Detecting and segmenting objects for mobile manipulation

[...]

Radu Bogdan Rusu¹, Andreas Holzbach¹, Michael Beetz¹, Gary Bradski²•Institutions (2)

Technische Universität München¹, Willow Garage²

01 Sep 2009

TL;DR: A novel 3D scene interpretation approach for robots in mobile manipulation scenarios using a set of 3D point features (Fast Point Feature Histograms) and probabilistic graphical methods (Conditional Random Fields) to obtain dense depth maps in the robot's manipulators working space.

...read moreread less

Abstract: This paper proposes a novel 3D scene interpretation approach for robots in mobile manipulation scenarios using a set of 3D point features (Fast Point Feature Histograms) and probabilistic graphical methods (Conditional Random Fields). Our system uses real time stereo with textured light to obtain dense depth maps in the robot's manipulators working space. For the purposes of manipulation, we want to interpret the planar supporting surfaces of the scene, recognize and segment the object classes into their primitive parts in 6 degrees of freedom (6DOF) so that the robot knows what it is attempting to use and where it may be handled. The scene interpretation algorithm uses a two-layer classification scheme: i) we estimate Fast Point Feature Histograms (FPFH) as local 3D point features to segment the objects of interest into geometric primitives; and ii) we learn and categorize object classes using a novel Global Fast Point Feature Histogram (GFPFH) scheme which uses the previously estimated primitives at each point. To show the validity of our approach, we analyze the proposed system for the problem of recognizing the object class of 20 objects in 500 table settings scenarios. Our algorithm identifies the planar surfaces, decomposes the scene and objects into geometric primitives with 98.27% accuracy and uses the geometric primitives to identify the object's class with an accuracy of 96.69%.

...read moreread less

115 citations

Journal Issue•DOI•

Mapping, navigation, and learning for off-road traversal

[...]

Kurt Konolige¹, Motilal Agrawal², Morten Rufus Blas, Robert C. Bolles², Brian P. Gerkey¹, Joan Sola³, Aravind Sundaresan² - Show less +3 more•Institutions (3)

Willow Garage¹, SRI International², Centre national de la recherche scientifique³

01 Jan 2009-Journal of Field Robotics

TL;DR: The main components that comprise the system, including stereo processing, obstacle and free space interpretation, long-range perception, online terrain traversability learning, visual odometry, map registration, planning, and control are described.

...read moreread less

Abstract: The challenge in the DARPA Learning Applied to Ground Robots (LAGR) project is to autonomously navigate a small robot using stereo vision as the main sensor. During this project, we demonstrated a complete autonomous system for off-road navigation in unstructured environments, using stereo vision as the main sensor. The system is very robust—we can typically give it a goal position several hundred meters away and expect it to get there. In this paper we describe the main components that comprise the system, including stereo processing, obstacle and free space interpretation, long-range perception, online terrain traversability learning, visual odometry, map registration, planning, and control. At the end of 3 years, the system we developed outperformed all nine other teams in final blind tests over previously unseen terrain. © 2008 Wiley Periodicals, Inc.

...read moreread less

103 citations

Proceedings Article•DOI•

Real-time perception-guided motion planning for a personal robot

[...]

Radu Bogdan Rusu¹, Ioan A. Sucan², Brian P. Gerkey³, Sachin Chitta³, Michael Beetz¹, Lydia E. Kavraki² - Show less +2 more•Institutions (3)

Technische Universität München¹, Rice University², Willow Garage³

10 Oct 2009

TL;DR: A modular and distributed architecture is proposed, which seamlessly integrates the creation of 3D maps for collision detection and semantic annotations, with a real-time motion replanning framework.

...read moreread less

Abstract: This paper presents significant steps towards the online integration of 3D perception and manipulation for personal robotics applications. We propose a modular and distributed architecture, which seamlessly integrates the creation of 3D maps for collision detection and semantic annotations, with a real-time motion replanning framework. To validate our system, we present results obtained during a comprehensive mobile manipulation scenario, which includes the fusion of the above components with a higher level executive.

...read moreread less

93 citations

Book Chapter•DOI•

Learning kinematic models for articulated objects

[...]

Jürgen Sturm¹, Vijay Pradeep², Cyrill Stachniss¹, Christian Plagemann³, Kurt Konolige², Wolfram Burgard¹ - Show less +2 more•Institutions (3)

University of Freiburg¹, Willow Garage², Stanford University³

11 Jul 2009

TL;DR: This paper presents an approach to learn kinematic models by inferring the connectivity of rigid parts and the articulation models for the corresponding links by using a mixture of parameterized and parameter-free representations and finding low-dimensional manifolds that provide the best explanation of the given observations.

...read moreread less

Abstract: Robots operating in home environments must be able to interact with articulated objects such as doors or drawers. Ideally, robots are able to autonomously infer articulation models by observation. In this paper, we present an approach to learn kinematic models by inferring the connectivity of rigid parts and the articulation models for the corresponding links. Our method uses a mixture of parameterized and parameter-free (Gaussian process) representations and finds low-dimensional manifolds that provide the best explanation of the given observations. Our approach has been implemented and evaluated using real data obtained in various realistic home environment settings.

...read moreread less

71 citations

Proceedings Article•

An Additive Latent Feature Model for Transparent Object Recognition

[...]

Mario Fritz¹, Gary Bradski², Sergey Karayev¹, Trevor Darrell¹, Michael J. Black³ - Show less +1 more•Institutions (3)

University of California, Berkeley¹, Willow Garage², Brown University³

07 Dec 2009

TL;DR: This work implements a novel LDA-SIFT formulation which performs LDA prior to any vector quantization step, and discovers latent topics which are characteristic of particular transparent patches and quantize the SIFT space into transparent visual words according to the latent topic dimensions.

...read moreread less

Abstract: Existing methods for visual recognition based on quantized local features can perform poorly when local features exist on transparent surfaces, such as glass or plastic objects. There are characteristic patterns to the local appearance of transparent objects, but they may not be well captured by distances to individual examples or by a local pattern codebook obtained by vector quantization. The appearance of a transparent patch is determined in part by the refraction of a background pattern through a transparent medium: the energy from the background usually dominates the patch appearance. We model transparent local patch appearance using an additive model of latent factors: background factors due to scene content, and factors which capture a local edge energy distribution characteristic of the refraction. We implement our method using a novel LDA-SIFT formulation which performs LDA prior to any vector quantization step; we discover latent topics which are characteristic of particular transparent patches and quantize the SIFT space into transparent visual words according to the latent topic dimensions. No knowledge of the background scene is required at test time; we show examples recognizing transparent glasses in a domestic environment.

...read moreread less

69 citations

Proceedings Article•DOI•

I'm sorry, Dave: i'm afraid i won't do that: social aspects of human-agent conflict

[...]

Leila Takayama¹, Victoria Groom², Clifford Nass²•Institutions (2)

Willow Garage¹, Stanford University²

04 Apr 2009

TL;DR: The effects of agent disagreement and agent voice location in a collaborative human-agent desert survival task are studied and it is found that people changed their answers more often when agents disagreed with them and felt more similar to agents that always agreed with them, even when substantive content was identical.

...read moreread less

Abstract: As computational agents become more sophisticated, it will frequently be necessary for the agents to disagree with users. In these cases, it might be useful for the agent to use politeness strategies that defuse the person's frustrations and preserve the human-computer relationship. One such strategy is distancing, which we implemented by spatially distancing an agent's voice from its body. In a 2 (agent disagreement: none vs. some) x 2 (agent voice location: on robotic body vs. in control box) between-participants experiment, we studied the effects of agent disagreement and agent voice location in a collaborative human-agent desert survival task (N=40). People changed their answers more often when agents disagreed with them and felt more similar to agents that always agreed with them, even when substantive content was identical. Strikingly, people felt more positively toward the disagreeing agent whose voice came from a separate control box rather than from its body; for agreement, the body-attached voice was preferred.

...read moreread less

67 citations

Proceedings Article•DOI•

Compact signatures for high-speed interest point description and matching

[...]

Michael Calonder¹, Vincent Lepetit¹, Pascal Fua¹, Kurt Konolige², James Bowman², Patrick Mihelich² - Show less +2 more•Institutions (2)

École Polytechnique Fédérale de Lausanne¹, Willow Garage²

01 Sep 2009

TL;DR: This paper shows that it can exploit the sparseness of these signatures to compact them, speed up the computation, and drastically reduce memory usage, and highlights its effectiveness by incorporating it into two very different SLAM packages and demonstrating substantial performance increases.

...read moreread less

Abstract: Prominent feature point descriptors such as SIFT and SURF allow reliable real-time matching but at a computational cost that limits the number of points that can be handled on PCs, and even more on less powerful mobile devices. A recently proposed technique that relies on statistical classification to compute signatures has the potential to be much faster but at the cost of using very large amounts of memory, which makes it impractical for implementation on low-memory devices. In this paper, we show that we can exploit the sparseness of these signatures to compact them, speed up the computation, and drastically reduce memory usage. We base our approach on Compressive Sensing theory. We also highlight its effectiveness by incorporating it into two very different SLAM packages and demonstrating substantial performance increases.

...read moreread less

66 citations

Proceedings Article•

Laser-based perception for door and handle identification

[...]

Radu Bogdan Rusu¹, Wim Meeussen², Sachin Chitta², Michael Beetz¹•Institutions (2)

Technische Universität München¹, Willow Garage²

22 Jun 2009

TL;DR: In this paper, a 3D perception pipeline is used to annotate doors and their handles from sensed laser data, without any a priori model learning, and the robustness of their approach is demonstrated by real world experiments conducted on a large set of doors.

...read moreread less

Abstract: In this paper, we present a laser-based approach for door and handle identification. The approach builds on a 3D perception pipeline to annotate doors and their handles solely from sensed laser data, without any a priori model learning. In particular, we segment the parts of interest using robust geometric estimators and statistical methods applied on geometric and intensity distribution variations in the scan. We present experimental results on a mobile manipulation platform (PR2) intended for indoor manipulation tasks. We validate the approach by generating trajectories that position the robot end-effector in front of door handles and grasp the handle. The robustness of our approach is demonstrated by real world experiments conducted on a large set of doors.

...read moreread less

62 citations

Proceedings Article•DOI•

Perception for mobile manipulation and grasping using active stereo

[...]

Radu Bogdan Rusu¹, Andreas Holzbach¹, Rosen Diankov², Gary Bradski³, Michael Beetz¹ - Show less +1 more•Institutions (3)

Technische Universität München¹, Carnegie Mellon University², Willow Garage³

01 Dec 2009

TL;DR: A comprehensive perception system with applications to mobile manipulation and grasping for personal robotics, which makes use of dense 3D point cloud data acquired using stereo vision cameras by projecting textured light onto the scene.

...read moreread less

Abstract: In this paper we present a comprehensive perception system with applications to mobile manipulation and grasping for personal robotics. Our approach makes use of dense 3D point cloud data acquired using stereo vision cameras by projecting textured light onto the scene. To create models suitable for grasping, we extract the supporting planes and model object clusters with different surface geometric primitives. The resultant decoupled primitive point clusters are then reconstructed as smooth triangular mesh surfaces, and their use is validated in grasping experiments using OpenRAVE [1]. To annotate the point cloud data with primitive geometric labels we make use of our previously proposed Fast Point Feature Histograms [2] and probabilistic graphical methods (Conditional Random Fields), and obtain a classification accuracy of 98.27% for different object geometries. We show the validity of our approach by analyzing the proposed system for the problem of building object models usable in grasping applications with the PR2 robot (see Figure 1).

...read moreread less

Proceedings Article•DOI•

Stanford Testbed of Autonomous Rotorcraft for Multi-Agent Control

[...]

Gabriel M. Hoffmann¹, Steven L. Waslander², Michael P. Vitus³, Haomiao Huang³, Jeremy H. Gillula³, Vijay Pradeep⁴, Claire J. Tomlin⁵ - Show less +3 more•Institutions (5)

PARC¹, University of Waterloo², Stanford University³, Willow Garage⁴, University of California, Berkeley⁵

10 Oct 2009

TL;DR: The Stanford Testbed of Autonomous Rotorcraft for Multi-Agent Control, a fleet of quadrotor helicopters, has been developed as a testbed for novel algorithms that enable autonomous operation of aerial vehicles to validate multiple algorithms such as reactive collision avoidance, collision avoidance through Nash Bargaining, path planning, cooperative search and aggressive maneuvering.

...read moreread less

Abstract: The Stanford Testbed of Autonomous Rotorcraft for Multi-Agent Control, a fleet of quadrotor helicopters, has been developed as a testbed for novel algorithms that enable autonomous operation of aerial vehicles. The testbed has been used to validate multiple algorithms such as reactive collision avoidance, collision avoidance through Nash Bargaining, path planning, cooperative search and aggressive maneuvering. This article briefly describes the algorithms presented and provides references for a more in-depth formulation, and the accompanying movie shows the demonstration of the algorithms on the testbed.

...read moreread less

Proceedings Article•DOI•

At your service: using butlers as a model to overcome the mobile attention deficit

[...]

Timothy Sohn¹, Rafael Ballagas¹, Leila Takayama²•Institutions (2)

Nokia¹, Willow Garage²

04 Apr 2009

TL;DR: This paper describes how a suite of butlers can opportunistically and proactively offer information to the user in the moment, allowing mobile users to stay focused on their task at hand.

...read moreread less

Abstract: Advances in mobile phones and cellular network capabilities have enabled many opportunities for information access on the move. These capabilities provide instant access for the mobile user, but have exacerbated the problem of interaction in a mobile context. Mobile users are often engaged in another task that makes it difficult for them to filter and interact with their mobile device at the same time. Mobile multitasking creates an attention deficit for the user. This paper proposes using butlers as a model to overcome this problem by offloading the burden of interaction from the user to the device. We describe how a suite of butlers can opportunistically and proactively offer information to the user in the moment, allowing mobile users to stay focused on their task at hand.

...read moreread less

Proceedings Article•DOI•

Auditory priming for upcoming events

[...]

Timothy Sohn¹, Leila Takayama², Dean Eckles¹, Rafael Ballagas¹•Institutions (2)

Nokia¹, Willow Garage²

04 Apr 2009

TL;DR: In this system, audio priming is being used to prepare a person's state of mind to improve one's sociability in the upcoming social context.

...read moreread less

Abstract: Psychologically preparing for upcoming events can be a difficult task, particularly when switching social contexts, e.g., from office work to a family event. To help with such transitions, the audio priming system uses pre-recorded audio messages to psychologically prepare a person for an upcoming event. In this system, audio priming is being used to prepare a person's state of mind to improve one's sociability in the upcoming social context.

...read moreread less