Towards Lifelong Aerial Autonomy: Geometric Memory Management for Continual Visual Place Recognition in Dynamic Environments
A new memory management system helps drones stay accurately located across changing seasons and conditions without forgetting what they've already learned.

The Thesis
Drones and autonomous aerial systems need to know where they are at all times — and they usually figure that out by recognizing visual landmarks from their cameras. The problem is that environments change: lighting shifts, seasons turn, construction crews rearrange landscapes. When a drone updates its visual memory to handle new conditions, it tends to overwrite what it knew before — a failure mode called catastrophic forgetting. This paper proposes a hybrid memory system that keeps a stable layer of satellite-derived map anchors alongside a smaller, constantly refreshed buffer of recent flight experience. The key finding is that keeping a structurally diverse set of visual memories — covering as wide a range of appearances as possible — outperforms either selecting hard examples or preserving class averages. The catch is that all experiments are conducted in simulation and structured mission sequences; real-world, unscripted deployment remains unproven.
Catalyst
Drone delivery and long-endurance UAV (unmanned aerial vehicle) programs now operate across months-long missions that span seasonal change — something short-hop test flights never exposed. At the same time, continual learning as a subfield has matured enough to offer concrete benchmarks, and satellite imagery at sub-meter resolution is cheaply accessible, making the hybrid anchor-plus-replay approach computationally tractable on embedded hardware for the first time.
What's New
Prior visual place recognition systems for aerial platforms were trained once and deployed statically, meaning they degraded silently whenever real-world appearance drifted from training data. Continual learning methods developed for object classification tried to mitigate this by replaying stored examples, but those methods assumed stable, well-separated categories — an assumption that breaks down when the same geographic location looks radically different under snow, rain, or seasonal foliage. This paper separates memory into two tiers: persistent satellite-image anchors that never change and a small rotating replay buffer selected for maximum feature-space diversity, achieving a 7.8% improvement in knowledge retention over random buffer selection across 21 mission sequences.
The Counter
The 7.8% retention gain over random buffer selection sounds meaningful, but the entire evaluation rests on 21 curated mission sequences drawn from existing aerial datasets — not live, unscripted flights. Real UAV deployments encounter sensor noise, aggressive maneuvers, and unpredictable occlusions that structured benchmarks systematically underrepresent. The 'static satellite anchor' assumption is also fragile: satellite imagery goes stale, is unavailable in denied environments, and often has resolution or timestamp mismatches with what the drone actually sees. The paper explicitly acknowledges that intra-class variation in geographic features is severe, yet the proposed solution — diversity-driven replay — is validated only in the domain-incremental setting where mission boundaries are known in advance. In the wild, those boundaries are fuzzy. Finally, the compute and memory budgets claimed to be 'onboard-feasible' are not benchmarked on actual embedded hardware, leaving open the question of whether this runs in real time on a Jetson-class device or merely fits in RAM.
Longs
- AVAV (AeroVironment) — long-endurance ISR drones that require persistent geo-localization
- KTOS (Kratos Defense) — autonomous drone programs needing GPS-denied navigation
- ESLT (Elbit Systems) — airborne intelligence systems with onboard processing constraints
- JOBY — future urban air mobility fleets dependent on robust visual localization
- BOTZ (robotics/automation ETF) — broad exposure to autonomous aerial and ground systems
Shorts
- Traditional HD-map vendors (HERE Technologies, TomTom) — if onboard learned maps replace expensive pre-built map products for aerial platforms
- GPS-augmentation hardware suppliers — if robust visual localization reduces dependence on RTK-GPS and WAAS correction services in UAVs
Enablers (Picks & Shovels)
- Maxar Technologies / Planet Labs — sub-meter satellite imagery used as static anchor data
- NVIDIA Jetson edge compute platform — onboard inference for memory-constrained UAVs
- OpenStreetMap and open aerial tile services — geographic prior data for anchor construction
- PyTorch continual learning libraries (e.g., Avalanche, Mammoth) — open-source CL frameworks the method builds on
Private Watchlist
- Shield AI — autonomous military UAV navigation in GPS-denied environments
- Skydio — autonomous drone perception and localization stack
- Joby Aviation (pre-revenue) — urban air mobility requiring robust geo-localization
- Windracers — long-endurance logistics UAVs operating across varied terrain
Resources
The Paper
Robust geo-localization in changing environmental conditions is critical for long-term aerial autonomy. While visual place recognition (VPR) models perform well when airborne views match the training domain, adapting them to shifting distributions during sequential missions triggers catastrophic forgetting. Existing continual learning (CL) methods often fail here because geographic features exhibit severe intra-class variations. In this work, we formulate aerial VPR as a mission-based domain-incremental learning (DIL) problem and propose a novel heterogeneous memory framework. To respect strict onboard storage constraints, our "Learn-and-Dispose" pipeline decouples geographic knowledge into static satellite anchors (preserving global geometric priors) and a dynamic experience replay buffer (retaining domain-specific features). We introduce a spatially-constrained allocation strategy that optimizes buffer selection based on sample difficulty or feature space diversity. To facilitate systematic assessment, we provide three evaluation criteria and a comprehensive benchmark derived from 21 diverse mission sequences. Extensive experiments demonstrate that our architecture significantly boosts spatial generalization; our diversity-driven buffer selection outperforms the random baseline by 7.8% in knowledge retention. Unlike class-mean preservation methods that fail in unstructured environments, maximizing structural diversity achieves a superior plasticity-stability balance and ensures order-agnostic robustness across randomized sequences. These results prove that maintaining structural feature coverage is more critical than sample difficulty for resolving catastrophic forgetting in lifelong aerial autonomy.