Abstract
Beacon-aided autonomous underwater vehicle (AUV) localization supporting maritime surveillance applications in underwater acoustic sensor networks selects a fixed number of beacons with constant transmit power, and thus has degradation of localization accuracy with severe channel fading and position fluctuation of beacons. In this paper, we propose a reinforcement learning based AUV localization scheme to choose the beacons and their transmit power to improve the localization accuracy and energy efficiency based on the AUV depth, the received signal strength, the number of selected beacons and the beacon energy consumption. According to the least squares method, the AUV position is calculated based on the isogradient sound speed model and the round-trip time of the localization signals. The localization error averaged over different beacon sets is evaluated to formulate the localization policy distribution. Deep neural network is designed to estimate the expected long-term discounted utility with higher feature extraction efficiency for the underwater networks with a large number of beacons. The Cramer-Rao lower bounds of the proposed localization schemes are derived to analyze the effect of the position fluctuation of beacons on the localization accuracy. Simulation results verify the performance gain in terms of the localization accuracy and the beacon energy consumption over the benchmark.
| Original language | English |
|---|---|
| Pages (from-to) | 7799-7811 |
| Number of pages | 13 |
| Journal | IEEE Transactions on Vehicular Technology |
| Volume | 73 |
| Issue number | 6 |
| DOIs | |
| State | Published - 1 Jun 2024 |
| Externally published | Yes |
UN SDGs
This output contributes to the following UN Sustainable Development Goals (SDGs)
-
SDG 7 Affordable and Clean Energy
Keywords
- Cramer-Rao lower bound
- Underwater acoustic sensor networks
- autonomous underwater vehicles
- localization
- reinforcement learning
Fingerprint
Dive into the research topics of 'Efficient Beacon-Aided AUV Localization: A Reinforcement Learning Based Approach'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver