Physics-Informed Reinforcement Learning of Spatial Density Velocity Potentials for Map-Free Racing

Generative AI & LLMs

Published: arXiv: 2604.09499v1

Authors

Shathushan Sivashangaran Apoorva Khairnar Sepideh Gohari Vihaan Dutta Azim Eskandarian

Abstract

Autonomous racing without prebuilt maps is a grand challenge for embedded robotics that requires kinodynamic planning from instantaneous sensor data at the acceleration and tire friction limits. Out-Of-Distribution (OOD) generalization to various racetrack configurations utilizes Machine Learning (ML) to encode the mathematical relation between sensor data and vehicle actuation for end-to-end control, with implicit localization. These comprise Behavioral Cloning (BC) that is capped to human reaction times and Deep Reinforcement Learning (DRL) which requires large-scale collisions for comprehensive training that can be infeasible without simulation but is arduous to transfer to reality, thus exhibiting greater performance than BC in simulation, but actuation instability on hardware. This paper presents a DRL method that parameterizes nonlinear vehicle dynamics from the spectral distribution of depth measurements with a non-geometric, physics-informed reward, to infer vehicle time-optimal and overtaking racing controls with an Artificial Neural Network (ANN) that utilizes less than 1% of the computation of BC and model-based DRL. Slaloming from simulation to reality transfer and variance-induced conservatism are eliminated with the combination of a physics engine exploit-aware reward and the replacement of an explicit collision penalty with an implicit truncation of the value horizon. The policy outperforms human demonstrations by 12% in OOD tracks on proportionally scaled hardware, by maximizing the friction circle with tire dynamics that resemble an empirical Pacejka tire model. System identification illuminates a functional bifurcation where the first layer compresses spatial observations to extract digitized track features with higher resolution in corner apexes, and the second encodes nonlinear dynamics.

Paper Summary

Problem

The main problem this paper addresses is the challenge of autonomous racing without prebuilt maps, which requires kinodynamic planning from instantaneous sensor data at the acceleration and tire friction limits. Current approaches rely on detailed prebuilt maps, global reference trajectories, and computationally expensive optimal solvers, but these are brittle in the real world and prevent effective Out-Of-Distribution (OOD) generalization.

Key Innovation

The key innovation of this work is a Deep Reinforcement Learning (DRL) method that parameterizes nonlinear vehicle dynamics from the spectral distribution of depth measurements with a non-geometric, physics-informed reward. This allows the agent to infer vehicle time-optimal and overtaking racing controls with an Artificial Neural Network (ANN) that uses less than 1% of the computation of Behavioral Cloning (BC) and model-based DRL.

Practical Impact

This research has significant practical impact in the field of autonomous racing, as it enables map-free racing without prebuilt maps, which is a grand challenge for embedded robotics. The proposed method can be applied in real-world scenarios, such as racing competitions, where the ability to adapt to new track layouts and conditions is crucial. The method also enables parameterizing dynamics-optimized overtaking with the same RL formulation, making it a promising approach for multi-agent environments.

Analogy / Intuitive Explanation

Imagine driving a car on a track without knowing the map or layout. Traditional approaches would rely on prebuilt maps and precise trajectory planning, but this can be brittle and prone to errors. The proposed method is like having a "smart" co-pilot that can learn the track layout and dynamics from sensor data and adjust its driving strategy accordingly, using a non-geometric, physics-informed reward to optimize performance. This co-pilot can adapt to new track layouts and conditions, making it a game-changer for autonomous racing.

Paper Information

Categories:

cs.RO

Published Date:

arXiv ID:

2604.09499v1

Quick Actions

Back to Home