Review 评论Underwater robot sensing technology: A survey
水下机器人传感技术:调查
Keywords 关键词
1. Introduction 1.导言
Recently, underwater robot sensing technologies have drawn remarkable attention for marine engineering and resource exploration. On the one hand, underwater robots need to perceive the environment and perform autonomous navigation and obstacle avoidance. On the other hand, underwater robots also depend on the ability of sensing technology to perform a variety of practical application tasks (e.g., object detection, underwater robot grasp, and underwater high-precision 3D measurement). In summary, underwater robot sensing technology is playing an increasingly important role in robots.
近年来,水下机器人传感技术在海洋工程和资源勘探领域引起了广泛关注。一方面,水下机器人需要感知环境并进行自主导航 和避障。另一方面,水下机器人也需要依靠传感技术来完成各种实际应用任务(如物体探测、水下机器人抓取和水下高精度三维测量)。总之,水下机器人传感技术在机器人中发挥着越来越重要的作用。
Most underwater sensing technologies rely on acoustic signals (e.g., sonar), light signals, electromagnetic signals, and bionic sensors. Specifically, sonar estimates the position of a submerged object by measuring the travel time and phase difference of acoustic pulses, which can work at a much longer range and cannot be affected by water turbidity. Although underwater acoustic sensing methods have a large sensing range, their resolution is low, which limits the practical applications of underwater sonar. Optical sensors capture the light rays of their surroundings to acquire environmental information, which can achieve higher resolution and refresh rate. However, due to the complicated underwater light conditions (absorption and scattering), optical sensors can only achieve short-range sensing [1]. An electromagnetic-based sensor could be applied in an underwater environment to estimate the distance precisely as well [2]. However, environmental electromagnetic fields may interfere with the precision. Additionally, researchers begin to study underwater bionics sensing technologies (e.g., whisker and lateral lines [3]). However, these technologies are not mature enough and need to be improved in practice [3]. Each of the above mentioned technologies has their own advantages and drawbacks, and researchers have to combine multiple sensing methods to carry out various underwater exploration tasks in practice.
大多数水下传感技术依赖于声学信号(如声纳)、光信号、电磁信号和仿生传感器。具体来说,声纳通过测量声脉冲的传播时间和相位差来估计水下物体的位置,这种方法可以在更远的范围内工作,并且不受水体浑浊度的影响。虽然水下声波传感方法的传感范围大,但分辨率低,这限制了水下声纳的实际应用。光学传感器捕捉周围环境的光线来获取环境信息,可以获得较高的分辨率和刷新率。然而,由于水下光线条件复杂(吸收和散射),光学传感器只能实现短距离传感[1]。在水下环境中应用基于电磁的传感器也可以精确地估计距离 [2]。然而,环境电磁场可能会干扰精确度。此外,研究人员开始研究水下仿生传感技术(如须和侧线[3])。然而,这些技术还不够成熟,需要在实践中不断改进[3]。 上述每种技术都有各自的优点和缺点,研究人员必须结合多种传感方法,才能在实践中完成各种水下探测任务。
2. Underwater Robots 2.水下机器人
Many countries have performed long-term research on underwater robots. For instance, the U.S. military designed the "bluefin" autonomous underwater vehicle (AUV), as shown in Fig. 1(a), which can perform autonomous underwater navigation and object detection and played a significant role in the search for missing Malaysia Airlines MH370 data in 2014. Russia designed the "Peace 1" and "Peace 2" underwater robots, which are the only pair of manned submersibles in the world that can perform collaborative underwater exploration [4]. Germany developed an AUV called "Deep C", which is an undersea vehicle of 4000 meters and can work 60 hours in the deep sea. France develops the "VICTOR 6000", as shown in Fig. 1(b), which is a cable-operated underwater robot that can acquire a high-quality underwater optical image [5]. Britain developed the fully automatic "Autosub6000" submarine, as shown in Fig. 1(c), which installed batteries and sensors enabling them to navigate independently [6]. Japan developed a deep ocean underwater robot, named the "Kaiko" ROV as shown in Fig. 1(d), which is mounted with various underwater sensors and has dived 296 times. China has also performed extensive research on submarine robots. For instance, the Shenyang Institute of Automation (SIA) developed the "Qianlong" and "Haidou" underwater robots, as shown in Fig. 1(e) and Fig. 1(f), which are equipped with sonar, cameras and lights and have performed a large variety of manipulation tasks at different depths from the sea surface to the seabed. The China Ship Scientific Research Center, SIA and other institutions developed the Jiaolong- and Fendouzhe-manned underwater submarines, as shown in Fig. 1(g) and Fig. 1(h), respectively, which have been used for deep sea exploration. Additionally, Harbin Engineering University developed underwater robots such as "Orange Shark" and "Hai Ling," which can perform underwater environment exploration by installing a variety of underwater sensors. The Institute of Automation, CAS China designed the "Bionic Dolphin" underwater robot that operates at a depth of up to 800 meters.
许多国家都对水下机器人进行了长期研究。例如,美国军方设计的 "蓝鳍 "自主水下机器人(AUV) , 如图 1(a) 所示 ,可进行自主水下导航和物体探测,在 2014 年马航MH370 失联数据搜寻中发挥了重要作用。俄罗斯设计了 "和平 1 号 "和 "和平 2 号 "水下机器人,这是 世界上唯一一对 可以进行水下协同探测的载人 潜水器 [4]。德国开发了一种名为 "Deep C "的自动潜航器,这是一种深达 4000 米的水下航行器,可在深海工作 60 小时。法国研制出 "VICTOR 6000", 如图 1(b)所示 ,它是一种缆索操作的水下机器人,可获取高质量的水下光学图像[5]。英国开发了全自动 "Autosub6000 "潜艇, 如图 1(c)所示,该 潜艇安装了电池和传感器,能够独立航行 [6] 。 日本开发了一种深海水下机器人,命名为 "Kaiko "ROV,如图 1(d)所示,该机器人安装了各种水下传感器,已下潜 296 次。中国也对水下机器人进行了大量研究。例如,沈阳自动化研究所(SIA)开发了 "乾隆 "和 "海斗 "水下机器人, 如图 1( e)和图 1(f)所示,它们配备了声纳、摄像头和照明灯,在从海面到海底的不同深度执行了大量的操纵任务。中国船舶重工集团公司科学研究中心、新航等单位研制的 "蛟龙 "号 和"风斗箭 "号载人水下潜艇 ,分别如图 1(g)和图 1(h)所示 ,已用于深海探测。此外,哈尔滨工程大学研制了 "橙鲨 "和 "海灵 "等水下机器人,通过安装各种水下传感器,可以进行水下环境探测。中国科学院自动化研究所设计了 "仿生海豚 "水下机器人,其工作深度可达 800 米。
Although there are many robot platforms for underwater environment exploration, they often require a variety of sensors to achieve environmental information. Therefore, the development of sensor sensing technologies has an important influence on underwater exploration.
虽然有许多用于水下环境探测的机器人平台,但它们往往需要各种传感器来获取环境信息。因此,传感器传感技术的发展对水下探测有着重要影响。
3. Underwater acoustic sensing
3.水下声学传感
The acoustic sensing technique is widely applied in underwater environments (e.g., in underwater robot localization and navigation, marine engineering, ship maintenance and pipeline measurement). In addition, in military applications, based on sonar data, warships can promptly identify threats (e.g., torpedoes, submarines and anti-submarine aircraft). In this section, we introduce underwater acoustic sensors and underwater acoustic sensing applications.
声学传感技术广泛应用于水下环境(如水下机器人定位和导航、海洋工程、船舶维护和管道测量)。此外,在军事应用中,根据声纳数据,军舰可以及时识别威胁(如鱼雷、潜艇和反潜机)。本节将介绍水下声学传感器和水下声学传感应用。
3.1. Underwater Acoustic Sensors
3.1.水下声学传感器
Underwater acoustic sensors are the most favorable sensing technology in underwater robot applications. Generally, underwater acoustic sensors can be roughly categorized into the following two classes: acoustic ranging/imaging sensors and acoustic positioning sensors.
水下声学传感器是水下机器人应用中最受欢迎的传感技术。一般来说,水下声学传感器可大致分为以下两类:声学测距/成像传感器和声学定位传感器。
3.1.1. Underwater Acoustic Ranging/Imaging Sensor
3.1.1.水下声学测距/成像传感器
Underwater acoustic ranging/imaging sensors mainly include single-beam sonar, side-scan sonar, and multibeam sonar.
水下声学测距/成像传感器主要包括单波束声纳、侧扫声纳和 多波束声纳。
Single-Beam Sonar: As shown in Fig. 2(a), the single-beam sonar receives a beam of the short-pulse acoustic signal emitted by a transducer and computes the depth of a submerged object by travel time accordingly. Due to its low cost and easy to use, single-beam sonar is widely used in marine engineering and resource exploration. However, it cannot obtain high-precision measurement results and a wide range of coverage.
单波束声纳:如图 2(a)所示,单波束声纳接收换能器发射的一束短脉冲声学信号,并通过相应的传播时间计算出水下物体的深度。由于单波束声纳成本低、使用方便,被广泛应用于海洋工程和资源勘探。然而,它无法获得高精度的测量结果和大范围的覆盖范围。
Side-Scan Sonar: The side-scan sonar is composed of submodules such as the control unit, towed body, cable, and recorder, which aims at detailed study of topography, geology, and minerals and further performs object search and tracking. As shown in Fig. 2(b), the side-scan sonar emits directional pulse acoustic signals, where the horizontal beam angle is extremely small (less than 2 degrees) and the vertical beam angle is large (approximately 32 degrees). By analyzing the received acoustic image data, an object on the sea floor can be identified. However, side-scan sonar can only roughly estimate the direction of the object and cannot accurately measure the depth of the submarine.
侧扫声纳:侧扫声纳由控制单元、拖曳体、电缆和记录器 等子模块组成,旨在对地形、地质和矿物进行详细研究,并进一步执行目标搜索和跟踪。 如图 2(b)所示 ,侧扫声纳发射定向脉冲声波信号,其中水平波束角极小(小于 2 度),垂直波束角较大(约 32 度)。通过分析接收到的声学图像数据,可以识别海底的物体。不过,侧扫声纳只能大致估计物体的方向,无法准确测量潜艇的深度。
Multibeam Sonar: Multibeam sonar is the combination of multiple single-beam sonars, as shown in Fig. 2(c), which can obtain the high-precision direction and depth value of the submarine object by travel time. Compared with single-beam sonar, the multibeam system can provide larger coverage of the seabed area with faster speed and higher accuracy, which greatly improves the efficiency of ocean exploration.
多波束声纳:多波束声纳是由多个单波束声纳组合而成,如图 2(c)所示,它可以通过行进时间获得海底物体的高精度方向和深度值。与单波束声纳相比,多波束系统能以更快的速度和更高的精度覆盖更大的海底区域,大大提高了海洋探测的效率。
3.1.2. Underwater acoustic positioning sensors
3.1.2.水下声学定位传感器
Underwater acoustic positioning sensors can estimate the position of a measured object (e.g., underwater robot). Depending on the length of the baseline, underwater acoustic positioning systems can be divided into the following three categories: the ultrashort baseline (USBL), the short baseline (SBL), and the long baseline (LBL) positioning systems, as shown in Fig. 3.
水下声学定位传感器可以估算被测物体(如水下机器人) 的位置。根据基线长度的不同,水下声学定位系统可分为以下三类:超短基线(USBL)、短基线(SBL)和长基线(LBL)定位系统, 如图 3 所示 。
USBL and SBL: Both methods combine an array of acoustic transducers installed on a ship and a submarine transponder placed on an underwater robot. By providing the space transformation matrix between the array of acoustic transducers and the hull, we can estimate the relative pose by travel time and phase difference. The major difference between USBL and SBL is the space distances among the transceivers. For the USBL, the distance among transceivers is less than one meter. The SBL system generally contains more than three transducers with a distance of 20–50 m. By increasing the space distances, the measurement accuracy can be improved. However, the main drawback of these sensors is that they are difficult to calibrate.
USBL 和 SBL:这两种方法都是将安装在船上的声学传感器阵列和安装在水下机器人上的水下应答器结合起来。通过提供 声学传感器阵列与船体之间的空间 变换矩阵,我们可以通过移动时间和相位差来估计相对姿态。USBL 和 SBL 的主要区别在于收发器之间的空间距离。对于 USBL,收发器之间的距离不到一米。SBL 系统一般包含三个以上的收发器,距离为 20-50 米。不过,这些传感器的主要缺点是难以校准。
LBL: Different from the USBL, as shown in Figs. 3(b) and 3(c), LBL uses a set of acoustic transponders placed on the sea floor with a known relative position. Based on at least three acoustic beacons with a baseline length of 100 m to 20 km, we can compute the localization of robots within the coverage area of the acoustic signal. The LBL sensor can obtain high measurement accuracy and is not affected by the water depth, but the system is costly to establish and maintain because an underwater acoustic network needs to be deployed and recovered regularly.
LBL:与 USBL 不同,如图 3(b) 和 3(c) 所示,LBL 使用一组放置在海底的已知相对位置的声学应答器。根据至少三个基线长度为 100 米至 20 千米的声学信标,我们可以计算出声学信号覆盖范围内机器人的定位。LBL 传感器可以获得很高的测量精度,并且不受水深的影响,但该系统的建立和维护成本很高,因为需要定期部署和恢复水下声学网络。
3.2. Underwater Acoustic Sensing Application
3.2.水下声学传感应用
Applications based on sonar images mainly include underwater acoustic image detection/tracking and 3D reconstruction.
基于声纳图像的应用主要包括水下声学图像检测/跟踪和三维重建。
3.2.1. Underwater Acoustics Image Detection/Tracking
3.2.1.水下声学图像探测/跟踪
Since sonar can acquire medium- and long-distance underwater acoustic image data, it is widely used in underwater object detection and tracking, as shown in Figs. 4(a) and 4(b). In the early period, object detection and tracking based on sonar were performed through expert processing (e.g., handcrafted selection). However, this method is time-consuming and affects the performance of sonar sensors. Then, researchers began to explore acoustic features (e.g., time, frequency, and time-frequency features). Among the above features, the time-frequency feature is suitable for nonstationary object detection and can obtain better results. For example, depending on the wavelet analysis and the relative Hilbert-Huang transform, Wang et al. [9] performed frequency decomposition and reconstruction of acoustic signals for underwater object detection.
由于声纳可以获取中远距离的水下声学图像数据,因此被广泛应用于水下物体探测和跟踪, 如图 4(a)和 4(b)所示 。早期,基于声纳的物体探测和跟踪是通过专家处理(如手工选择)来完成的。然而,这种方法耗时较长,而且会影响声纳传感器的性能。随后,研究人员开始探索声学特征(如时间、频率和时频特征)。在上述特征中,时频特征适用于非稳态物体检测,能获得更好的结果。例如,根据小波分析和相对希尔伯特-黄变换,Wang 等人 [9] 对声学信号进行了频率分解和重构,用于水下物体检测。
Recently, machine learning has achieved many progresses in object detection and classification. For example, Zhang et al. [10] summarized 19 conventional classifiers for underwater acoustic object detection (e.g., support vector machines (SVMs), K nearest neighbors (KNNs), and decision trees (DTs)). Ke et al. [11] proposed a supervised feature extraction algorithm to perform the object classification of underwater sonar images. In addition, deep learning is widely used in underwater acoustic signal analysis. Wang et al. [12] adopt deep neural networks (DNNs) to learn and fuse features and further utilize the Gaussian mixture model (GMM) to improve the performance. Phung et al. [13] proposed an unsupervised image statistics algorithm that combines deep semantic features to localize sonar targets.
最近,机器学习在物体检测和分类方面取得了许多进展。例如,Zhang等人[10]总结了 19 种用于水下声学物体检测的传统分类器(如 支持向量机(SVM)、K 近邻 (KNN) 和决策树 (DT))。Ke 等人[11] 提出了一种监督特征提取算法,用于对水下声纳图像进行物体分类。此外,深度学习也被广泛应用于水下声学信号分析。Wang 等人[12] 采用深度神经网络(DNN)来学习和融合特征,并进一步利用 高斯混合模型(GMM)来提高性能。Phung 等人[13] 提出了一种结合深度语义特征的无监督图像统计算法来定位声纳目标。
3.2.2. Underwater Acoustics Image 3D Reconstruction
3.2.2.水下声学图像三维重建
Acoustic images also obtain significant developments in underwater acoustic 3D reconstruction (e.g., sparse reconstruction, dense reconstruction, and acoustic and optical fusion reconstruction), as shown in Figs. 4(c) and 4(d) [14], [15].
如图 4(c)和 4(d)所示,声学图像在水下声学三维重建(如稀疏重构、密集重构以及声光融合重构)方面也取得了重大进展[14],[15 ]。
Sparse Reconstruction: Approaches to perform sparse reconstruction typically detect the corner points on the sonar image and further solve the elevation angle and relative attitude transformation based on the sampling method. However, these methods are susceptible to the degradation of the sonar sensor model and the initial value of the algorithm [16]. An alternative method is structure from motion, which tracks and matches the scene's sparse features and further performs sparse scene reconstruction. Huang et al. [17] designed an acoustic based structure from motion model named as ASFM, which adopts handcrafted features to recover the 3D location of landmarks. Subsequently, Yonghoon et al. [18] optimized the ASFM algorithm to improve feature extraction and data association modules for acoustic sparse reconstruction.
稀疏重建:进行稀疏重建的方法通常是检测声纳图像上的角点,并根据采样方法进一步解决仰角和相对姿态变换问题。然而,这些方法容易受到声纳传感器模型和算法初始值退化的影响 [16]。另一种方法是运动结构法,它可以跟踪和匹配场景的稀疏特征,并进一步进行稀疏场景重构。Huang等人[17]设计了一种基于声学结构的运动模型,命名为 ASFM,该模型采用手工特征来恢复地标的三维位置。随后,Yonghoon等人[18]优化了 ASFM 算法,改进了声学稀疏重建的特征提取和数据关联模块。
Dense Reconstruction: Acoustic dense reconstruction is a challenging problem because of the low signal-noise ratio of acoustic images. To address this problem, Zerr et al. [19] propose a two-step algorithm for dense reconstruction of the sea floor, which can generate a 3D target model by combining a height map and reflection map. Cho et al. [20] propose to estimate the pitch angle and assume that the top of the submarine target generates acoustic backscatter, which can obtain more details in the reconstructed results.
密集重建:声学密集重建是一个具有挑战性的问题,因为声学图像的信噪比很低。针对这一问题,Zerr等人[19]提出了一种两步海底密集重建算法,该算法可通过结合高度图和反射图生成三维目标模型。Cho等人[20] 提出估计俯仰角并假定潜艇目标顶部会产生声学反向散射,这样可以在重建结果中获得更多细节。
Acoustic and Optical Fusion Reconstruction: Methods based on acoustic and optical fusion reconstruction can combine the advantages of both sonar and visual images to improve underwater reconstruction performance. For example, Sharmin et al. [21] designed a navigation algorithm using sonar, vision and inertial information for underwater scene reconstruction. Since acoustic data provide robust information about underwater obstacles, the proposed method combines visual features with sonar features to optimize the robot position and 3D scene points. Subsequently, they also achieved high-precision 3D reconstruction results of underwater caves by combining sonar with vision sensors [22].
声光融合重建:基于声光融合重建的方法可以结合声纳和视觉图像的优势,提高水下重建性能。例如,Sharmin等人[21] 设计了一种利用声纳、视觉和惯性信息进行水下场景重建的导航算法。由于声学数据可提供有关水下障碍物的可靠信息,因此所提出的方法将视觉特征与声纳特征相结合,以优化机器人位置和三维场景点。随后,他们还通过将声纳与视觉传感器相结合,实现了水下洞穴的高精度三维重建结果 [22]。
4. Underwater Optical Sensing
4.水下光学传感
Underwater acoustic sensing methods have a large sensing range, but their resolution is low, which limits the practical applications of underwater sonar. Underwater optical images can achieve high resolution and accuracy at short distances (e.g., underwater small object detection, underwater biological observation [8], underwater robot grasping, and underwater archaeology [23], [24]), as shown in Fig. 5. In this section, we mainly review the underwater 2D visual sensing and underwater 3D visual sensing technology.
水下声学传感方法传感范围大,但分辨率低,限制了水下声纳的实际应用。水下光学图像可在短距离内实现高分辨率和高精度(如水下小物体探测、水下生物观测[8]、水下机器人抓取和水下考古[23]、[24 ]),如图 5 所示。本节主要介绍水下二维视觉传感和水下三维视觉传感技术。
4.1. 2D Visual Sensing 4.1. 2D 视觉传感
Underwater 2D visual sensing is crucial for underwater environment observation and object recognition in underwater robots due to its simplicity. In this subsection, we mainly introduce underwater image quality restoration and underwater object detection and tracking.
水下二维视觉传感因其简单易行而对水下环境观测和水下机器人的目标识别至关重要。本小节主要介绍水下图像质量恢复和水下物体检测与跟踪。
4.1.1. Underwater Image Enhancement
4.1.1.水下图像增强
Due to the complex underwater scenario with light absorption and scattering, capturing a clear underwater images is still a challenging problem. Furthermore, these effects can reduce visibility and contrast [25]. As shown in Fig. 6(a), the image captured in the marine environment obtains low-quality results. By enhancing the image quality, we can achieve a high-quality image, as shown in Fig. 6(b). Generally, the methods of underwater image enhancement can be roughly subdivided into three categories (i.e., nonphysical model-based methods, physical model-based methods, and data-driven methods).
由于水下环境复杂,存在光吸收和散射现象,拍摄清晰的水下图像仍然是一个具有挑战性的问题。此外,这些影响会降低可见度和对比度 [25]。如图 6(a)所示,在海洋环境中拍摄的图像质量较低。通过增强图像质量,我们可以获得高质量的图像,如图 6(b) 所示。一般来说,水下图像增强方法可大致分为三类(即基于非物理模型的方法、基于物理模型的方法和数据驱动的方法)。
Nonphysical model-based methods: These methods often enhance the image quality by changing the contrast and color space of the image. Iqbal et al. [26] adjusted the pixel range in HSV and RGB color space to enhance the underwater image quality. Based on the multiscale fusion strategy, Ancuti et al. [27] combined contrast enhancement with color correction methods to improve the image quality. However, since these methods do not build a real-world physical model for underwater image enhancement, they cannot obtain high-quality image results.
基于非物理模型的方法:这些方法通常通过改变图像的对比度和色彩空间来提高图像质量。Iqbal等人[26]调整了 HSV 和 RGB 色彩空间的像素范围,以提高水下图像质量。Ancuti等人[27]基于多尺度融合策略,将对比度增强与色彩校正方法相结合,提高了图像质量。然而,由于这些方法没有为水下图像增强建立真实世界的物理模型,因此无法获得高质量的图像结果。
Physical Model-Based Methods: Methods based on physical models improve image quality by building a physical model of degradation. (e.g., Jaffe-McGlamery equation), which can be denoted as:where denotes the pixel value, is the distance from the object to the camera, denotes the veiling light, and is the true but unknown color of the 3D point. and represent the light attenuation coefficient.
基于物理模型的方法:基于物理模型的方法通过建立退化的物理模型来提高图像质量。(例如,Jaffe-McGlamery 方程),可表示为: ,其中 表示像素值, 表示物体到摄像机的距离, 表示遮挡光线, 表示三维点真实但未知的颜色。 和 表示光衰减系数。
Based on the Jaffe-McGlamery equation, Galdran et al. [28], [29] considered that the absorption rate of red light is greater than both blue and green light and proposed a model based on the color channel of the image, which achieves image color restoration by establishing the relationship between the color value of the image and the medium parameters of the underwater environment. However, these methods require an accurate physical model of light degradation and a uniform distribution of the absorption coefficient. Therefore, these methods easily lead to unstable image restoration results. Recently, Akkaynak et al. [24] assumed that the absorption coefficient is not uniformly distributed in an underwater scenario and depends on the distance and reflection of the object. Based on this finding, they built an accurate underwater attenuation model to enhance the quality of the underwater image, which improves the robustness of image quality restoration.
Galdran等人[28]、[29] 在 Jaffe-McGlamery 方程的基础上,考虑到红光的吸收率大于蓝光和绿光,提出了基于图像颜色通道的模型,通过建立图像颜色值与水下环境介质参数之间的关系实现图像颜色还原。然而,这些方法需要精确的光降解物理模型和均匀分布的吸收系数。因此,这些方法很容易导致不稳定的图像复原结果。最近,Akkaynak等人[24]假设吸收系数在水下场景中并非均匀分布,而是取决于距离和物体的反射。基于这一发现,他们建立了一个精确的水下衰减模型来提高水下图像的质量,从而提高了图像质量恢复的鲁棒性。
Data-Driven Methods: These methods can be roughly subdivided into two categories (i.e., methods based on synthetic data and methods based on real data). More specifically, methods based on synthetic data are trained through artificially synthesized data pairs. However, underwater image models depend on special scenes, light conditions, temperature, and turbidity. Therefore, this method cannot be used conveniently in a practical environment. An alternative strategy is to train the model through real underwater data. Li et al. [30] propose an underwater image restoration model (i.e., WaterGAN), which combines the RGBD data in the air with the scene parameters to synthesize underwater images. Based on the synthetic data, they trained the deep learning model and enhanced the underwater image quality with a two-stage network. Li et al. [31] proposed a cycle-consistent weakly supervised underwater color restoration model, which trains the model with an adversarial network and can be used in an unknown water domain. However, due to the nature of multiple potential outputs [32], it tends to produce inauthentic results in some instances . Li et al. [33] built an underwater image enhancement dataset including low-quality underwater images and reference high-quality images and evaluated related underwater image enhancement algorithms. However, the performance of deep learning based methods are worse than physical model based models in terms of robustness and generalization.
数据驱动方法:这些方法大致可分为两类(即基于合成数据的方法和基于真实数据的方法)。更具体地说,基于合成数据的方法是通过人工合成的数据对进行训练。然而,水下图像模型取决于特殊的场景、光照条件、温度和浊度。因此,这种方法无法方便地应用于实际环境。另一种策略是通过真实的水下数据来训练模型。Li 等人[30]提出了一种水下图像复原模型(即 WaterGAN),该模型结合空气中的 RGBD 数据和场景参数合成水下图像。基于合成数据,他们训练了深度学习模型,并通过两级网络增强了水下图像质量。Li 等人[31]提出了一种循环一致的弱监督水下色彩还原模型,该模型采用对抗网络进行训练,可用于未知水域。然而,由于存在多个潜在输出[32],在某些情况下往往会产生不真实的结果。Li等人[33] 建立了一个水下图像增强数据集,包括低质量水下图像和参考高质量图像,并评估了相关的水下图像增强算法。然而,基于深度学习的方法在鲁棒性和泛化方面的表现不如基于物理模型的模型。
4.1.2. Underwater Object Detection and Tracking
4.1.2.水下物体探测和跟踪
Underwater object detection aims to find the objects of interest in an image, as shown in Fig. 6(c). Generally, underwater object detection can be roughly divided into two categories (i.e., methods based on traditional features and methods based on deep learning).
水下物体检测的目的是在图像中找到感兴趣的物体,如图 6(c) 所示。一般来说,水下物体检测可大致分为两类(即基于传统特征的方法和基于深度学习的方法)。
Methods Based on Traditional Features: These methods use hand-crafted features (e.g., local binary patterns (LBP), histogram of oriented gradient (HOG), and Haar features (Haar)). The method includes the following three steps: 1) generating the multilevel image pyramid by down-sampling the original image in both the horizontal and vertical directions and 2) extracting the image features by sliding on the image pyramid with a fixed-size window and sending the extracted features to the extractor and classifier (e.g., SVM and AdaBoost).
基于传统特征的方法:这些方法使用手工制作的特征(如 局部二值模式(LBP)、定向梯度直方图(HOG)和哈氏特征(Haar))。该方法包括以下三个步骤:1) 通过在水平和垂直方向上对原始图像进行下采样,生成多级图像金字塔;2) 通过在图像金字塔上滑动固定大小的窗口提取图像特征,并将提取的特征发送给提取器和分类器(如 SVM和 AdaBoost)。
Methods Based on Deep Learning: These methods can be categorized into single-stage object detection methods and two-stage object detection methods. More specifically, the first method directly localizes objects by estimate the score of the bounding box directly that can be obtained by dense sampling on the input image (e.g., YOLO [34] and SSD [35]), which has fast detection speed but low accuracy. In contrast, the second method can obtain higher precision by generating object proposals and obtain more accurate results through region proposal networks (RPNs) (e.g., Faster RCNN [36]).
基于深度学习的方法:这些方法可分为单阶段物体检测方法和双阶段物体检测方法。具体来说,第一种方法通过对输入图像进行密集采样(如 YOLO[34]和 SSD[35 ]),直接估计边界框的得分来定位物体,这种方法检测速度快,但精度低。相比之下,第二种方法可以通过生成对象建议来获得更高的精度,并通过区域建议网络(RPN)获得更精确的结果(如 Faster RCNN[36])。
Underwater object tracking involves continuously predicting and updating the statuses (scale, position, and rotation) of a specified object in the subsequent images, as shown in Fig. 6(d), which often encounters some problems (e.g., object deformation, object occlusion, similar object characteristics in the object and background, shadow and illumination changes). Classical methods can be divided into the following three categories: optical flow, mean shift, and convolutional network tracking (CNT). More specifically, the method based on the optical flow requires robust features in the image, which is not effective for motion blur; the method based on mean shift also requires features, which can track non-rigid objects efficiently and is robust to distance changes; CNTs can directly learn features from raw image data, which can extract the internal structure and local geometric information of the image and obtain better performance in this task.
水下物体跟踪是指连续预测和更新指定物体在后续图像中的状态(比例、位置和旋转),如图 6(d)所示,其中经常会遇到一些问题(如物体变形、物体遮挡、物体和背景中相似的物体特征、阴影和光照变化等)。经典方法可分为以下三类:光流、均值偏移和卷积网络跟踪(CNT)。具体来说,基于光流的方法需要图像中的稳健特征,对运动模糊效果不佳;基于均值平移的方法也需要特征,可以高效跟踪非刚性物体,对距离变化具有稳健性;卷积网络跟踪可以直接从原始图像数据中学习特征,从而提取图像的内部结构和局部几何信息,在这项任务中获得更好的性能。
4.2. 3D Visual Sensing 4.2.3D 视觉传感
Underwater 3D visual sensing has attracted much attention due to its significance in robot environment exploration, which can not only recover underwater 3D structures but also accurately localize object positions. In this subsection, the underwater camera calibration methods are demonstrated. We then introduce underwater structured light 3D reconstruction and underwater multiview 3D reconstruction.
水下三维视觉传感因其在机器人环境探索中的重要作用而备受关注,它不仅可以恢复水下三维结构,还能准确定位物体位置。本小节将 演示 水下相机 校准方法。然后介绍水下结构光三维重建和水下多视角三维重建。
4.2.1. Underwater Camera Calibration
4.2.1.水下摄像机校准
Underwater camera calibration that relates image pixels and 3D world points is a prerequisite for many tasks (e.g., 3D reconstruction and localization). Most studies ignore the influence of medium refraction and directly use the pinhole camera model in underwater scenarios. However, these methods often lead to inaccurate 3D reconstruction results [37]. To improve the accuracy of underwater camera calibration, many studies focus on exploring the impact of refraction on the underwater camera model. The Woods Hole Oceanographic Institute studies the ray-based underwater camera model and found that the back-projection rays do not intersect at one point, so the underwater camera model cannot be represented by a single-view pinhole camera model [37]. Agrawal et al. [38] proposed a refractive camera model, which can simplify the underwater camera model as an axis camera model. Furthermore, based on the axis camera model, the forward and backward projection equations can be well formulated. Chen et al. [39] proposed a three-wavelength dispersion method to calibrate an underwater camera. Tomasz et al. [40] built a precomputed lookup table to quickly correct refraction distortion, but they assumed that refraction distortion is not related with the depth of the scene and rectified all images through the same look-up table.
将图像像素与三维世界点联系起来的水下相机校准是许多任务(如三维重建和定位) 的先决条件。大多数研究都忽略了介质折射的影响,在水下场景中直接使用针孔摄像机模型。然而,这些方法往往会导致不准确的三维重建结果 [37]。为了提高水下摄像机校准的准确性,许多研究都侧重于探索折射对水下摄像机模型的影响。伍兹霍尔海洋研究所研究了基于射线的水下摄像机模型,发现背投射线并不相交于一点,因此水下摄像机模型不能用单视角针孔摄像机模型表示[37]。Agrawal等人[38] 提出了折射相机模型,可将水下相机模型简化为轴相机模型。此外,在轴相机模型的基础上,可以很好地建立前向和后向投影方程。Chen 等人[39]提出了校准水下摄像机的三波长色散方法。Tomasz等人[38]提出了一种校准水下摄像机的三波长色散方法。[40] 建立了一个预先计算的查找表来快速纠正折射失真,但他们假设折射失真与场景深度无关,并通过相同的查找表对所有图像进行纠正。
4.2.2. Underwater 3D Data Acquisition
4.2.2.水下三维数据采集
Some studies achieve 3D reconstruction using structured light, laser imaging (e.g., TOF), and multiview reconstruction. Among the above techniques, the structured light with high precision has been used for underwater 3D data acquisition. Therefore, we mainly introduce underwater structured light 3D reconstruction in this subsection.
一些研究利用结构光、激光成像(如 TOF)和多视角重建实现了三维重建。在上述技术中,高精度的结构光已被用于水下三维数据采集。因此,本小节主要介绍水下结构光三维重建。
Underwater structured light sensors acquire the 3D point cloud by computing the intersection point between the laser light plane and the camera beam. Bodenmann et al. [41] designed an underwater laser scanner, as shown in Fig. 7(a), which can simultaneously capture the gray image and laser stripe image. Furthermore, they transform the collected color image data into 3D point clouds, which can obtain accurate shape and color information. Bleier et al. [42] designed a cross-line laser scanner, as shown in Fig. 7(b), where the camera captures the cross-line projected by the laser and computes the 3D point clouds by ray-plane triangulation. However, these methods ignore the influence of medium refraction in underwater environments. Palomer et al [43] developed an underwater scene scanning sensor with only laser rotation, which uses a stepper motor to quickly rotate the laser in the field of view and can perform 3D scene reconstruction in the field of view in a short time. Subsequently, Palomer et al [43], [44] found that the laser plane entering the underwater scenario is refracted twice and cannot be described by a plane equation. Therefore, they denote the light plane as an elliptic cone and obtain higher 3D reconstruction accuracy with ray-cone triangulation. They also install the designed laser scanner on the underwater robot to perform underwater robot grasping, as shown in Fig. 7(c). Recently, the Shenyang Institute of Automation designed an underwater structured light sensor, as shown in Fig. 7(d). By explicitly considering medium refraction, the sensor can achieve high-precision reconstruction performance [45].
水下结构光传感器通过计算激光光平面与相机光束之间的交点来获取三维点云。Bodenmann等人[41] 设计了一种水下激光扫描仪 ,如图 7(a) 所示 ,可以同时采集灰度图像和激光条纹图像。此外,他们还将采集到的彩色图像数据转化为三维点云,从而获得精确的形状和颜色信息。Bleier等人[42]设计了一种交叉线激光扫描仪,如图 7(b) 所示,相机捕捉激光投射的交叉线,并通过射线平面三角测量法计算三维点云。然而,这些方法都忽略了水下环境中介质折射的影响。Palomer等人[43] 开发了一种仅靠激光旋转的水下场景扫描传感器,它利用步进电机在 视场 中快速旋转激光 ,可在短时间内完成视场中的三维场景重建。 随后,Palomer等人[43]、[44] 发现,进入水下场景的激光平面会发生两次折射,无法用平面方程描述。因此,他们将光面表示为椭圆锥,并通过射线锥三角测量法获得了更高的三维重建精度。他们还将设计的激光扫描仪安装在水下机器人上,实现了水下机器人抓取,如图 7(c)所示。最近,沈阳自动化研究所设计了一种水下结构光传感器,如图 7(d) 所示。通过明确考虑介质折射,该传感器可实现高精度重建性能[45]。
4.2.3. Underwater Multi-View 3D Reconstruction
4.2.3.水下多视角三维重建
Underwater multiview 3D reconstruction can simultaneously estimate the camera pose and reconstruct the 3D scene structure based on the 2D re-projection error, where the minimization function can be denoted as:where is the captured feature that occurs in the image. is the spatial transformation matrix from the image to the image. denotes the 3D point. denotes the projection function. Classical multiview 3D reconstruction can be classified into the following two classes: simultaneous localization and mapping (SLAM) [[46] and structure from motion (SFM) [47]].
水下多视角三维重建可同时估计摄像机姿态并基于二维重投影误差重建三维场景结构,其中最小化函数可表示为: ,其中 是 在 图像中出现的捕捉到的 特征。 是 图像到 图像的 空间变换矩阵 。 表示三维点。 表示投影函数。经典的多视角三维重建可分为以下两类:同步定位与映射(SLAM)[[46]和运动结构(SFM)[47]]。
Underwater SLAM: SLAM can estimate the robot's pose and reconstruct the 3D scene structure simultaneously. Due to the complicated underwater environment (e.g., scattering, absorption, turbidity, and refraction), underwater SLAM is a challenging problem [48]. Ferrera et al [49] collected multiple underwater datasets for the underwater SLAM task, where the trajectories computed by the structure-from-motion (SfM) library Colmap were used as reference trajectories. Bharat et al. [46] also collected an underwater SLAM dataset with an underwater robot, where they performed underwater trajectory estimation and evaluated the performance of the most recent open-source packages (e.g., LSD-SLAM, DSO, SVO, ORB-SLAM2, ROVIO, OKVIS, VINS-Mono) on the collected datasets, as shown in Fig. 8(a). Table 1 summarizes the characteristics of the open-source SLAM method. From the evaluated results, OKVIS and ROVIO can achieve good results in terms of robustness and accuracy; ORB-SLAM2, SVO, and VINS-Mono achieve good results in terms of the absolute scale; based on minimizing a photometric error, DSO can obtain dense 3D reconstruction results.
水下 SLAM:SLAM 可以同时估计机器人的姿态和重建三维场景结构。由于水下环境复杂(如散射、吸收、浊度和折射),水下 SLAM 是一个具有挑战性的问题[48]。Ferrera等人[49] 为水下 SLAM 任务收集了多个水下数据集,并将结构运动(SfM)库 Colmap 计算出的轨迹作为参考轨迹。Bharat 等人[46]也收集了水下机器人的水下 SLAM 数据集,他们在数据集上进行了水下轨迹估计,并评估了最新开源软件包(如 LSD-SLAM、DSO、SVO、ORB-SLAM2、ROVIO、OKVIS、VINS-Mono)的性能,如图 8(a)所示。表 1总结了开源 SLAM 方法的特点。从评估结果来看,OKVIS 和 ROVIO 在鲁棒性和准确性方面都能达到很好的效果;ORB-SLAM2、SVO 和 VINS-Mono 在绝对尺度方面都能达到很好的效果;DSO 在最小化光度误差的基础上,能获得高密度的三维重建结果。
Method 方法 | Camera 照相机 | IMU | Loop Closure 环形闭合 |
---|---|---|---|
LSD-SLAM | mono 单声道 | × | √ |
DSO | mono 单声道 | × | × |
SVO | multi 多 | Optional 可选 | × |
ORB-SLAM2 | Mono, stereo 单声道、立体声 | × | √ |
ROVIO | multi 多 | √ | × |
OKVIS | multi 多 | √ | × |
VINS-Mono | mono 单声道 | √ | √ |
Underwater SFM: SFM recovers the 3D scene structure from unordered images. Most existing methods ignore the influence of medium variation and adopt the pinhole camera model for underwater SFM directly, as shown in Fig. 8(b) [50]. However, if we adopt the pinhole camera model for underwater SFM, the systematic geometric bias will interfere with the precision of the 3D reconstruction because of the multirefraction between different media. Recently, some methods have begun to consider the influence of medium refraction for 3D reconstruction in underwater scenarios. To perform underwater refractive pose estimation, Jordt et al. [51] formulate a cost function by combining a real camera (refractive camera model) and a virtual camera (pinhole camera model). Along with refractive pose estimation, they perform underwater 3D reconstruction based on minimizing the 2D reprojection error. Chadebecq et al. [52] introduced Plucker coordinates and constructed a refractive fundamental matrix for underwater object and fluid-immersed organ reconstruction. However, these methods of underwater SFM often require a complex computation process and cannot perform underwater SFM efficiently.
水下 SFM:SFM 可从无序图像中恢复三维场景结构。现有的大多数方法都忽略了介质变化的影响,直接采用针孔摄像机模型进行水下 SFM, 如图 8(b)所示 [50]。然而,如果采用针孔摄像机模型进行水下 SFM,由于不同介质之间存在多重折射,系统性几何偏差会干扰三维重建的精度。最近,一些方法开始考虑介质折射对水下场景三维重建的影响。为了进行水下折射姿态估计,Jordt等人[51] 结合真实摄像机(折射摄像机模型)和虚拟摄像机(针孔摄像机模型)制定了一个成本函数。在进行折射姿态估计的同时,他们还根据二维重投影误差最小化原则进行水下三维重建。Chadebecq 等人[52] 引入了 Plucker 坐标,并构建了用于水下物体和流体浸没器官重建的折射基本矩阵。然而,这些水下 SFM 方法往往需要复杂的计算过程,无法高效地进行水下 SFM。
5. Other Underwater Sensing Methods
5.其他水下传感方法
Although underwater acoustic and optical sensing technologies have been widely used in many tasks, there still exist many challenging problems (e.g., acoustic multipath effects, acoustic reverberation and optical attenuation). To improve the sensing ability of underwater robots, there is an urgent need to explore diversified underwater sensing technology. Recently, underwater electromagnetic sensing [53] and underwater bionic sensing [54] have drawn considerable attention in marine engineering.
尽管水下声学和光学传感技术已在许多任务中得到广泛应用,但仍存在许多具有挑战性的问题(如声学多径效应、声学 混响 和光学衰减)。为了提高水下机器人的传感能力,迫切需要探索多样化的水下传感技术。最近,水下电磁传感 [53]和水下仿生传感[54]在海洋工程领域引起了广泛关注。
5.1. Underwater Electromagnetic Sensing
5.1.水下电磁传感
In this section, we introduce both underwater electric field sensing and underwater magnetic sensing.
本节将介绍水下电场传感和水下磁场传感。
Underwater Electric Field Sensing: Underwater electric field sensing can enable robot communication in complicated underwater environments and effectively avoid acoustic multipath effects. For example, inspired by the South American electric eel and African pipe fish, Xie et al. [55] developed a communication system based on the bionic electric field that can perform effective communication in a complex underwater environment.
水下电场传感:水下电场感应可使机器人在复杂的水下环境中进行通信,并有效避免声学多径效应。例如,受南美电鳗和非洲管鱼的启发,Xie等人[55] 开发了一种基于仿生电场的通信系统,可在复杂的水下环境中进行有效通信。
Underwater Magnetic Sensing: Underwater magnetic sensing has many advantages (e.g., high concealment, robust detection performance and high positioning accuracy); therefore, this technique can work in complicated conditions (e.g., turbid water and turbulent water flow) [53]. For example, American and Canadian navies deployed electromagnetic induction electrodes on icebergs around the Bering Straitand cooperated with the satellite positioning system to successfully detect the former Soviet Union's "Tresala" nuclear submarine [53]. The Russian VNIIOFI research institute developed an ultralong-range, long-range (100 km, resolution 250 m) underwater electromagneticearly warning system called Anagram, which has been successfully used for the detection and tracking of signals from submarines andships. Chinese research on underwater magnetic sensing technologyalso makes significant strides in electromagnetic feld generationmechanisms and marine electromagnetic exploration technology. For example, the Naval University of Engineering developed low-noise, high-precision silver electrode sensors (e.g., fluxgate sensors and optical pump magnetometers) for underwater electromagnetic feld measurements [53].
水下磁感应:水下磁感应有许多优点(如隐蔽性强、探测性能稳定和定位精度高),因此,这种技术可以在复杂的条件下(如浑浊的水和湍急的水流)发挥作用[53]。例如,美国和加拿大海军在白令海峡附近的冰山上部署了电磁感应电极,并与卫星定位系统合作,成功探测到了前 苏联的"特雷萨拉 "号核潜艇 [53]。俄罗斯 VNIIOFI 研究所开发了一种名为 "Anagram "的超远程、远距离(100 千米,分辨率 250 米)水下电磁预警系统,已成功用于探测和跟踪来自潜艇和舰艇的信号。中国的水下磁感应技术研究在电磁场产生机制和海洋电磁探测技术方面也取得了重大进展。例如,海军工程大学开发了用于水下电磁场测量的低噪声、高精度银电极传感器(如浮门传感器和光泵磁力计)[53]。
5.2. Underwater Bionic Sensing
5.2.水下仿生传感
To improve the sensing ability of underwater robots, especially in the case of ultraclose distances (less than 1 meter), researchers have gradually paid attention to underwater sensing technology based on bionic principles (e.g., lateral lines [54] and whiskers [56], [57]).
为了提高水下机器人的传感能力,尤其是在超近距离(小于 1 米)的情况下,研究人员逐渐关注基于仿生原理的水下传感技术(如侧线[54]和须[ 56 ]、[57 ])。
Lateral Line: The lateral line is a sensing organ of the fish, which could perceive changes in the flow of the surrounding water and further help the fish perceive the surrounding environment in darkness, as shown in Fig. 9(a). Inspired by bionics, artificial lateral lines mainly focus on sensor and system design, which is summarized in [54]. In Fig. 10, representative artificial lateral line sensors are given. Additionally, researchers have begun to apply sensors to practical environments. For example, Dervries et al. [58] explore the application of distributed lateral line sensing systems in the closed-loop control strategy of a robotic fish.
侧线侧线是鱼类的感知器官,可以感知周围水流的变化,并进一步帮助鱼类感知周围的黑暗环境,如图 9(a)所示。受仿生学的启发,人工侧线主要集中在传感器和系统设计方面,相关研究综述见文献[54]。图 10 给出了具有代表性的人工侧线传感器。此外,研究人员已开始将传感器应用到实际环境中。例如,Dervries等人[58] 探索了分布式侧线传感系统在机器鱼闭环控制策略中的应用。
Whiskers: Whiskers are important sensing organs for underwater creatures that are used to identify, locate and track prey [64]. Inspired by the seal's whiskers, as shown in Fig. 9(b), researchers have developed many artificial whisker sensors. For example, Wolf et al. [56] developed an artificial whisker based on the cantilever beam structure, which can sense a change in hydrological information of 5 microns/second. Gui et al. [65] designed a fully 3D printed artificial whisker by graphene 3D printing technology, which can perform qualitative analysis of the underwater vortex, velocity and flow direction. Leporade et al. [11] developed an active whisker that uses a motor to control its movement. Experiments show that active whiskers can achieve higher sensing precision.
须须是水下生物重要的传感器官,用于识别、定位和追踪猎物[64]。受海豹胡须的启发(如图 9(b) 所示),研究人员开发了许多人工胡须传感器。例如,Wolf等 人 [56] 开发了一种基于悬臂梁结构的人工须,它能感知每秒 5 微米的水文信息变化。Gui 等[65] 利用石墨烯3D 打印技术设计了一种全 3D 打印的人工晶须,可对水下涡流、流速和流向进行定性分析。Leporade 等人 [11]开发了一种主动晶须,利用电机控制其运动。实验表明,主动晶须可以实现更高的传感精度。
6. Challenges and Future Directions
6.挑战和未来方向
Although many progresses have been achieved for underwater sensing technology, there are still some problems (e.g., sensing distance, accuracy, efficiency, robustness) that affect the underwater information acquisition and further limit the efficiency of the underwater robot exploration. Inspired by the recent works on the underwater sensing technology, we have the following suggestions.
尽管水下传感技术已经取得了许多进展,但仍存在一些问题(如传感距离、精度、效率、鲁棒性),影响了水下信息的获取,进一步限制了水下机器人的探测效率。受近年来水下传感技术研究的启发,我们提出以下建议。
Underwater Bionic Sensing: Compared with underwater creatures, current artificial underwater sensors exist a large gap not only in detection accuracy, distance, sensitivity and other technical indicators, but also in the sensor power consumption and volume. Aiming at this challenge, the first strategy is to learn how the underwater creatures sense the environmental information. For instance, by observing animals to avoid obstacle through acoustics, we design the sonar sensor. The second one is to learn the perceptual structure of animal organs, by learning its benefits, the sensing abilities of underwater robots can be improved effectively.
水下仿生传感:与水下生物相比,目前的人工水下传感器不仅在探测精度、距离、灵敏度等技术指标上存在较大差距,而且在传感器功耗和体积上也存在较大差距。针对这一挑战,第一个策略是学习水下生物是如何感知环境信息的。例如,通过观察动物通过声学来避开障碍物,我们设计了声纳传感器。第二个策略是学习动物器官的感知结构,通过学习其优点,有效提高水下机器人的感知能力。
New Underwater Materials R&D: How to convert various environmental information into electric signals to obtain the environmental information, and perform autonomous processing are the core of underwater sensors, some strategies could be adopted. The first strategy is to enhance the data conversion capabilities of the existing materials. For instance, we can continue to develop the underwater sonar transducer systems to improve the measurement accuracy. The second strategy is to design a new material to achieve the underwater information. Relying on the new material, the underwater robot can achieve more valuable information.
新型水下材料研发:如何将各种环境信息转化为电信号,从而获取环境信息,并进行自主处理,是水下传感器的核心,可以采取一些策略。第一种策略是增强现有材料的数据转换能力。例如,我们可以继续开发水下声纳换能器系统,以提高测量精度。第二种策略是设计一种新材料来实现水下信息。依靠新材料,水下机器人可以获得更多有价值的信息。
Multi-Source Information Fusion: As we known, each sensing mechanism exists its limitations. Moreover, underwater creatures usually rely on multiple sensing organs to percept the environments. Therefore, how to fuse multiple sensors, learn from each other's strengths and obtain complementary benefits is still a challenge. In order to overcome these, the first strategy to fuse multiple data source together, where the data from one sensor is considered as an independent source to other sensors. The second one is to integrate multiple data source into a tightly coupled algorithms by optimizing them simultaneously.
多源信息融合:众所周知,每种感知机制都有其局限性。此外,水下生物通常依靠多种感知器官来感知环境。因此,如何融合多种传感器,取长补短,仍然是一个难题。为了克服这些问题,第一种策略是将多个数据源融合在一起,将一个传感器的数据视为其他传感器的独立数据源。第二种策略是将多个数据源整合到一个紧密耦合的算法中,同时对它们进行优化。
Underwater Data Sets and Robot Competitions: In recent years, many underwater robot competitions have been held, which acquire various underwater datasets. Representative competitions include the Underwater Robot Target Grasping Competition (URPC) organized by the National Natural Science Foundation of China, and the Underwater Robot Competition (URC) organized by the International Underwater Robot Federation, the International Underwater Robot Competition (RoboSub) jointly organized by the International Federation of Unmanned Systems (AUVSI) and the US Naval Equipment Research Institute (ONR) as shown in Table 2. In the future, for the typical applications of underwater robots, the construction of common and general test platforms and offline dataset is a trend in underwater sensing technologies.
水下数据集和机器人竞赛:近年来,举办了许多水下机器人竞赛,获取各种水下数据集。如表 2 所示,具有代表性的竞赛包括由国家自然科学基金委员会组织的水下机器人目标抓取竞赛(URPC)、由国际水下机器人联合会组织的水下机器人竞赛(URC)、由国际无人系统联合会(AUVSI)和美国海军装备研究所(ONR)联合组织的国际水下机器人竞赛(RoboSub)等。未来,针对水下机器人的典型应用,构建通用的通用测试平台和离线数据集是水下传感技术的发展趋势。
Name 名称 | Competition content 竞赛内容 | URL |
---|---|---|
URPC | Underwater target recognition and grasp 水下目标识别与抓取 | http://www.cnurpc.org/index.html |
URC | Underwater robot manipulation and racing 水下机器人操纵和竞赛 | http://www.ilur.org/comp |
RoboSub 机器人潜艇 | Autonomous driving, grasp, positioning 自动驾驶、把握、定位 | https://robosub.org/programs |
7. Conclusion 7.结论
In this review, we intend to contribute to this growing area of research in underwater robot sensing. Therefore, we survey the related works of underwater robot sensing technologies including underwater acoustic sensing, underwater optical sensing, underwater magnetic sensing, as well as underwater bionic sensing. Finally, we also propose some valuable suggestions and future challenges and directions for future researches.
在本综述中,我们希望为水下机器人传感这一日益增长的研究领域做出贡献。因此,我们调查了水下机器人传感技术的相关工作,包括水下声学传感、水下光学传感、水下磁性传感以及水下仿生传感。最后,我们还提出了一些有价值的建议以及未来的挑战和研究方向。
Declaration of Competing Interest
竞争利益声明
The authors declare no conflict of interest.
作者声明没有利益冲突。
Acknowledgements 致谢
This work is supported by the National Key Research and Development Program of China (2019YFB1310300) and National Nature Science Foundation of China under Grant (61722311, 61821005).
本研究得到国家重点研发计划(2019YFB1310300)和国家自然科学基金(61722311、61821005)的资助。
References
- [1]Challenges in underwater visual navigation and slam, in AI Technology for Underwater RobotsSpringer, 96 (2020), pp. 125-135
- [2]Underwater 3-d spatial attenuation characteristics of electromagnetic waves with omnidirectional antennaIEEE/ASME Trans. Mechatron., 21 (3) (2016), pp. 1409-1419
- [3]Artificial lateral line with biomimetic neuromasts to emulate fish sensingBioinspiration Biomim., 5 (1) (2010), Article 016001
- [4]Modeling of the manipulation operation sunken submarines for the underwater remotelyoperated vehicleInternational Journal of Modeling and Optimization, 2 (5) (2012), p. 579
- [5]L.D. Barker, M.V. Jakuba, A.D. Bowen, et al., “Scientific challenges and present capabilities in underwater robotic vehicle design and navigation for oceanographic exploration under-ice,” Remote Sensing, vol. 12, no. 16, p. 2588, 2020.
- [6]Experimental analysis of low-altitude terrain following for hover-capable flight-style autonomous underwater vehiclesJournal of Field Robotics, 36 (8) (2019), pp. 1399-1421
- [7]Review of mission planning for autonomous marine vehicle fleetsJournal of Field Robotics, 36 (2) (2019), pp. 333-354
- [8]Novel event analysis for human-machine collaborative underwater explorationPattern Recognition, 96 (2019), pp. 1-11
- [9]Bark-wavelet analysis and hilbert–huang transform for underwater target recognitionDefence Technology, 9 (2) (2013), pp. 115-120
- [10]W. Zhang, Y. Wu, D. Wang, et al., “Underwater target feature extraction and classification based on gammatone filter and machine learning,” in Proceedings of the 2018 International Conference on Wavelet Analysis and Pattern Recognition, 2018, pp. 42–47.
- [11]Underwater acoustic target classification based on dense convolutional neural networkIEEE Geoscience and Remote Sensing Letters (2020)
- [12]Underwater acoustic target recognition: A combination of multi-dimensional fusion features and modified deep neural networkRemote Sensing, 11 (16) (2019), p. 1888
- [13]Deep gabor neural network for automatic detection of mine-like objects in sonar imageryIEEE Access, 8 (6) (2020), pp. 94 126-94 139
- [14]Three-dimensional reconstruction of underwater objects using wide-aperture imaging sonarJournal of Field Robotics, 35 (6) (2018), pp. 890-905
- [15]S. Suresh, “Localization and active exploration in indoor underwater environments,” Master’s thesis, Pittsburgh, PA, August 2019.
- [16]N. Brahim, D. Gueriot, S. Daniel, et al., “3d reconstruction of ´ underwater scenes using didson acoustic sonar image sequences through evolutionary algorithms,” in OCEANS, 2011, pp. 1–6.
- [17]T.A. Huang and M. Kaess, “Incremental data association for acoustic structure from motion,” in 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2016, pp. 1334–1341.
- [18]Y. Ji, S. Kwak, A. Yamashita, et al., “Acoustic camera-based 3d measurement of underwater objects through automated extraction and association of feature points,” in Proceedings of the 2016 IEEE Conference on Multisensor Fusion and Integration for Intelligent Systems, 2016, pp. 224–230.
- [19]Three-dimensional reconstruction of underwater objects from a sequence of sonar images, 3 (1996), pp. 927-930
- [20]Auv-based underwater 3-d point cloud generation using acoustic lens-based multibeam sonarIEEE Journal of Oceanic Engineering, 43 (4) (2017), pp. 856-872
- [21]S. Rahman, A.Q. Li, I. Rekleitis, “Sonar visual inertial slam of underwater structures,” in Proceedings of the 2018 IEEE International Conference on Robotics and Automation, 2018, pp. 1–7.
- [22]S. Rahman and A. Quattrini, “Contour based reconstruction of underwater structures using sonar, visual, inertial, and depth sensor,” in Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019, pp. 8054–8059.
- [23]In-water visual ship hull inspection using a hover-capable underwater vehicle with stereo visionJournal of Field Robotics, 36 (3) (2019), pp. 531-546
- [24]D. Akkaynak and T. Treibitz, “Sea-thru: A method for removing water from underwater images,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019, pp. 1682–1691.
- [25]C. Li, C. Guo, W. Ren, et al., “An underwater image enhancement benchmark dataset and beyond,” IEEE Transactions on Image Processing, vol. 29, pp. 4376–4389, 2020.
- [26]Enhancing the low quality images using unsupervised colour correction methodMan and Cybernetics (2010), pp. 1703-1709
- [27]X. Fu, Z. Fan, M. Ling, et al., “Two-step approach for single underwater image enhancement,” in Proceedings of the 2017 International Symposium on Intelligent Signal Processing and Communication Systems, 2017, pp. 789–794.
- [28]Initial results in underwater single image dehazingOceans (2010), pp. 1-8
- [29]Automatic red-channel underwater image restorationJournal of Visual Communication and Image Representation, 26 (2015), pp. 132-145
- [30]Watergan: Unsupervised generative network to enable real-time color correction of monocular underwater imagesIEEE Robotics and Automation Letters, 3 (1) (2017), pp. 387-394
- [31]C. Li, S. Anwar, F. Porikli, “Underwater scene prior inspired deep underwater image and video enhancement,” Pattern Recognition, vol. 98, p. 107038, 2020.
- [32]Underwater image enhancement using a multiscale dense generative adversarial networkIEEE Journal of Oceanic Engineering, 45 (3) (2020), pp. 862-870
- [33]An underwater image enhancement benchmark dataset and beyondIEEE Transactions on Image Processing, 29 (2019), pp. 4376-4389
- [34]You only look once: Unified, real-time object detection, IEEE (2016), pp. 779-788
- [35]Ssd: Single shot multibox detector,” in European Conference on Computer V, Springer (2016), pp. 21-37
- [36]H. Jiang and E. Learned-Miller, “Face detection with the faster r-cnn,” in Proceedings of the 2017 12th IEEE conference on Automatic Face Gesture Recognition, 2017, pp. 650–657.
- [37]Flat refractive geometry,” IEEE transactions on pattern analysis and machine intelligence, 34 (1) (2011), pp. 51-65
- [38]A. Agrawal, S. Ramalingam, Y. Taguchi, et al., “A theory of multi-layer flat refractive geometry,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2012, pp. 3346–3353.
- [39]F. Chadebecq, F. Vasconcelos, G. Dwyer, et al., “Refractive structure-from-motion through a flat refractive interface,” in Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 5315–5323.
- [40]The pinax-model for accurate and efficient refraction correction of underwater cameras in flat-pane housingsOcean Engineering, 133 (2017), pp. 9-22
- [41]Generation of high-resolution three-dimensional reconstructions of the seafloor in color using a single camera and structured lightJournal of Field Robotics, 34 (5) (2017), pp. 833-851
- [42]Scout3d–an underwater ¨ laser scanning system for mobile mapping,” International Archives of PhotogrammetryRemote Sensing and Spatial Information Sciences, 42 (2019), pp. 13-18
- [43]Inspection of an underwater structure using point-cloud slam with an auv and a laser scannerJournal of Field Robotics, 36 (8) (2019), pp. 1333-1344
- [44]Underwater laser scanner: Ray-based model and calibrationIEEE/ASME Transactions on Mechatronics, 24 (5) (2019), pp. 1986-1997
- [45]C. Gu, Y. Cong, G. Sun, "Three Birds, One Stone: Unified Laser-Based 3-D Reconstruction Across Different Media," in IEEE Transactions on Instrumentation and Measurement, vol. 70, pp. 1-12, 2021.
- [46]B. Joshi, S. Rahman, M. Kalaitzakis, et al., “Experimental comparison of open source visual-inertial-based state estimation algorithms in the underwater domain,” in Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2019, pp. 7227–7233.
- [47]F. Chadebecq, F. Vasconcelos, G. Dwyer, et al., “Refractive structure-from-motion through a flat refractive interface,” in Proceedings of the IEEE International Conference on Computer Vision, 2017, pp. 5315–5323.
- [48]Environment driven underwater cameraimu calibration for monocular visual-inertial slam, IEEE (2019), pp. 2405-2411
- [49]Aqualoc: An underwater dataset for visual–inertial–pressure localizationThe International Journal of Robotics Research, 38 (14) (2019), pp. 1549-1559
- [50]E. Iscar, K.A. Skinner, M. Johnson-Roberson, “Multi-view 3d reconstruction in underwater environments: Evaluation and benchmark,” in OCEANS, 2017, pp. 1–8.
- [51]A. Jordt-Sedlazeck and R. Koch, “Refractive structure-from-motion on underwater images,” in Proceedings of the IEEE International Conference on Computer Vision, 2013, pp. 57–64.
- [52]Refractive two-view reconstruction for underwater 3d visionInternational Journal of Computer Vision, 128 (2019), pp. 1-17
- [53]R. J. Jinfang Cheng, Jiawei Zhang, “Development of underwater electromagnetic detection technology,” Mine Warfare and Ship Protection, vol. 002, no. 004, pp. 45–49, 2019.
- [54]A review of artificial lateral line in sensor fabrication and bionic applications for robot fish,”AppliedBionics and Biomechanics,2016,(2016-12-27) (2016), pp. 1-15
- [55]W. Wang, J. Liu, G. Xie, et al., “A bio-inspired electrocommunication system for small underwater robots,” Bioinspiration biomimetics, vol. 12, no. 3, p. 036002, 2017.
- [56]B. J. Wolf, J. A. Morton, W. N. MacPherson, et al., “Bio-inspired all-optical artificial neuromast for 2d flow sensing,” Bioinspiration & biomimetics, vol. 13, no. 2, p. 026013, 2018.
- [57]H. Beem, M. Hildner, M. Triantafyllou, “Calibration and validation of a harbor seal whisker-inspired flow sensor,” Smart Materials and Structures, vol. 22, no. 1, p. 014012, 2012.
- [58]Distributed flow estimation and closed-loop control of an underwater vehicle with a multi-modal artificial lateral lineBioinspirationbio biomimetics, 10 (2) (2015), p. 025002
- [59]Y. Yang, N. Nguyen, N. Chen, et al., “Artificial lateral line with biomimetic neuromasts to emulate fish sensing,” Bioinspiration and biomimetics, vol. 5, no. 1, p. 016001, 2010.
- [60]A flexible underwater pressure sensor array using a conductive elastomer strain gaugeJournal of microelectromechanical systems, 21 (4) (2012), pp. 897-907
- [61]Artificial fish skin of self-powered micro-electromechanical systems hair cells for sensing hydrodynamic flow phenomenaJournal of the Royal Society Interface, 12 (111) (2015), p. 20150322
- [62]Y. Yang, J. Chen, J. Engel, et al., “Distant touch hydrodynamic imaging with an artificial lateral line,” Proceedings of the National Academy of Sciences, vol. 103, no. 50, pp. 18 891–18 895, 2006.
- [63]Wiegerink,“Mems based hair flow-sensors as model systems for acoustic perception studiesNanotechnology, 17 (4) (2006), p. S84
- [64]N.F. Lepora, N. Burnus, Y. Tao, et al., “Active touch with a biomimetic 3d-printedwhiskered robot,” in Proceedings of the Conference on Biomimetic and Biohybrid Systems. Spring.
- [65]Fully 3d printed multi-material soft bio-inspired whisker sensor for underwater-induced vortex detection, SoftRobotics, 5 (2) (2018), pp. 122-132
Cited by (81)
A comprehensive review on the evolution of bio-inspired sensors from aquatic creatures
2024, Cell Reports Physical ScienceMagnetic focusing sensor and its characterizations of defect non-destructive testing for ferromagnetic steel plate
2024, NDT and E InternationalVisual sensing on marine robotics for the 3D documentation of Underwater Cultural Heritage: A review
2024, Journal of Archaeological Science
Yang Cong is a full professor of Chinese Academy of Sciences. He received the B.Sc. degree from Northeast University in 2004, and the Ph.D. degree from State Key Laboratory of Robotics, Chinese Academy of Sciences in 2009. He was a Research Fellow of National University of Singapore (NUS) and Nanyang Technological University (NTU) from 2009 to 2011, respectively; and a visiting scholar of University of Rochester. He has served on the editorial board of the Journal of Multimedia. His current research interests include image processing, compute vision, machine learning, multimedia, medical imaging, data mining and robot navigation. He has authored over 100 technical papers. He is also a senior member of IEEE.