Sumario: | Endowing robots with reasoning capabilities allowing them not to just answer the question 2where am I?" but also giving them the ability to infer, for example, the type of room they are in, through semantic recognition of the objects, persons,... in the room, is an extremely dificult research goal. Arnau Ramisa, in this PhD thesis, has addressed these issues and the results he has obtained are a step towards this goal. The thesis has two parts. The first one addresses the localization problem and the second one addresses the object recognition problem. The main contribution of the first part is the obtention of a signature to characterize places relying on a constellation of combinations of diferent types of visual feature regions extracted from a panoramic image of the place that is represented by a node of a topological map graph. The proposed method has been tested with panoramic images of different rooms in various UAB Campus buildings. The results show that through the signature obtained the robot can compute its location even if some changes in the environment have occurred. The second contribution of the first part of this thesis is the development of a biologically-inspired and computationally cheap homing method based on the Average Landmark Vector (ALV). Normally, artificial landmarks had been used together with the Average Landmark Vector, but this thesis introduces a method that combines the ALV with the feature regions used for global localization extracted from the panoramic images mentioned above. Although the localization method proposed in this first part of the thesis is able to reliably model and recognize places, a powerful space representation constructed from semantically rich elements is needed. To progress towards this goal, the second part of this thesis presents and evaluation of two state of the art object recognition methods - SIFT and Vocabulary Tree - and, more importantly, several improvements of these methods are proposed in order to adapt them to the more strict computational requirements of mobile robots. Furthermore, since the evaluation has shown that the suitability of an object recognition method depends on some specific properties of the acquired images, a Reinforcement Learning based approach has been proposed to select, on-line, which object recognition method should be applied in each situation. This learning approach constitutes the last contribution of this thesis.
|