Long-Range Target Acquisition and Visual Servoing with a UAV

Athanasios Tsoukalas; Nikolaos Evangeliou; Anthony Tzes

doi:10.20944/preprints202604.0178.v1

Submitted:

02 April 2026

Posted:

03 April 2026

You are already at the latest version

Abstract

This article identifies, using a zero-shot method (Gen6d), the 3D-bounding box of a target far-distanced from a UAV. Furthermore, it infers the attached camera’s pose to the drone, based on the underlying training on the visual data. These visual data are used in a YOLO-framework to identify targets belonging to a class. The vertices of the orthogonal 3D-box are used in a visual-servoing scheme on the attached gimbal on UAV. The camera has a varying focal length (zoom) and the indirect objective is to move the UAV close to the target while reducing the zoom factor. Initially, the UAV starts with a large zoom-factor (36×) at a far distance (100m) from the target. The UAV approaches the target using the visual servoing scheme, while reducing its zoom at discrete steps and maintaining its focus. Experimental results indicate the efficiency of the proposed method.

Keywords:

UAV

;

visual servoing

;

YOLO

;

Gen6D

;

3D object detection

;

robotics

Subject:

Engineering - Control and Systems Engineering

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Long-Range Target Acquisition and Visual Servoing with a UAV

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe