Iterative refinement for singular value decomposition based on matrix multiplication

doi:10.1016/j.cam.2019.112512

Journal of Computational and Applied Mathematics

Volume 369, 1 May 2020, 112512
第 369 卷，2020 年 5 月 1 日，112512

https://doi.org/10.1016/j.cam.2019.112512 Get rights and content 获取权利和内容

Under an Elsevier user license
在 Elsevier 用户许可下

open archive 打开档案

Abstract 抽象

We propose a refinement algorithm for singular value decomposition (SVD) of a real matrix. In the same manner as Newton’s method, the proposed algorithm converges quadratically if a modestly accurate initial guess is given. Since the proposed algorithm is based on matrix multiplication, it can efficiently be implemented. Numerical results demonstrate the excellent performance of the proposed algorithm in terms of the convergence rate and the measured computing time compared to a standard approach using multiple precision arithmetic.
我们提出了一种用于实矩阵奇异值分解（SVD）的细化算法。与牛顿方法相同，如果给出适度准确的初始估计值，则所提出的算法将呈二次方收敛。由于所提出的算法是基于矩阵乘法的，因此可以有效地实现。数值结果表明，与使用多精度算术的标准方法相比，所提算法在收敛速率和实测计算时间方面具有优异的性能。

MSC MSC 系列

65F30

15A18

15A23

Keywords 关键字

SVD

Iterative refinement

Convergence analysis

Higher-precision arithmetic

SVD

迭代细化

收敛分析

更高精度的算术

1. Introduction 1. 引言

Let

A

be a real

m \times n

matrix. In this paper, we propose a refinement algorithm for the singular value decomposition (SVD) of

A

with

m \geq n

. If

m < n

, considering the SVD of

A^{T}

yields equivalent results. It is well known that the SVD has many applications in various fields, such as signal processing [1], [2], statistical analysis [3], [4], and so forth. Excellent overviews of the SVD can be found in [5], [6].
设

A

为实

m \times n

矩阵。在本文中，我们提出了一种 with

m \geq n

的

A

奇异值分解（SVD）的细化算法。如果

m < n

，考虑的 SVD

A^{T}

会产生等效的结果。众所周知，SVD 在各个领域都有许多应用，例如信号处理、统计分析等。SVD 的精彩概述可以在、中找到。

Throughout the paper, let

I_{n}

and

O

denote the

n \times n

identity matrix and the zero matrix of appropriate size, respectively. Moreover,

‖ \cdot ‖

denotes the spectral norm for matrices. If necessary, we distinguish between the approximate quantities and the computed results, e.g., for some quantity

α

, we write

\tilde{α}

and

\hat{α}

as an approximation of

α

and a computed result for

α

, respectively.
在整篇论文中，let

I_{n}

和

O

分别表示适当大小的

n \times n

单位矩阵和零矩阵。此外，

‖ \cdot ‖

表示矩阵的谱范数。如有必要，我们区分近似量和计算结果，例如，对于某个量

α

，我们分别将和

\hat{α}

写

\tilde{α}

为

α

的近似

α

值和计算结果。

Let

σ_{i} \in R

i = 1, \dots, n

, denote the singular values of

A

. We consider the (full size) SVD of

A

such that

A = U Σ V^{T}, U \in R^{m \times m}, V \in R^{n \times n}, Σ \in R^{m \times n},

where both

U

and

V

are orthogonal and

Σ

is diagonal with

Σ_{i i} = σ_{i}

. For simplicity, we assume that

σ_{1} > σ_{2} > \dots > σ_{n} > 0 .

In other words, we consider the case that all singular values are simple, and

A

has full column rank. If there are multiple or nearly multiple singular values, we need some special care as in [7], [8].
设

σ_{i} \in R

，

i = 1, \dots, n

，表示的

A

奇异值。我们考虑的（全尺寸）SVD 是

A

这样的，其中

U

和

V

都是正交的，并且

Σ

是与

Σ_{i i} = σ_{i}

的对角线。为简单起见，我们假设换句话说，我们考虑所有奇异值都是简单的，并且

A

具有完整的列排名。如果存在多个或几乎多个奇异值，则需要特别小心，如，。

Recently, the authors proposed refinement algorithms for symmetric eigenvalue decomposition in [7], [8]. In the same spirit of the previous papers, the use of higher-precision arithmetic in our proposed refinement algorithm for the SVD is primarily restricted to matrix multiplication, which accounts for most of the computational cost. There are several approaches for higher-precision matrix multiplication. For example, XBLAS (extra precise BLAS) [9] and fast and accurate algorithms for dot products [10] and matrix products [11] based on error-free transformations are available for efficient implementation.
最近，作者在 [7]、[8] 中提出了对称特征值分解的改进算法。本着与前几篇论文相同的精神，在我们提出的 SVD 细化算法中，更高精度算术的使用主要限于矩阵乘法，这占了大部分计算成本。有几种方法可以进行更高精度的矩阵乘法。例如，XBLAS （超精确 BLAS） [9] 和基于无差错变换的点积 [10] 和矩阵积 [11] 的快速准确算法可用于高效实现。

The idea of our algorithm is to use the following relations: (1)

U^{T} U = I_{m} (orthogonality of U)

(2)

V^{T} V = I_{n} (orthogonality of V)

(3)

U^{T} A V = Σ (diagonality of A as the SVD)

Using these relations, we develop a refinement algorithm for the SVD in the same manner as Newton’s method. Thus, the proposed algorithm has quadratic convergence.
我们算法的思路是使用以下关系式：（1）

U^{T} U = I_{m} （U 的 正交性 ）（

2）

V^{T} V = I_{n} （V 的 正交性 ）

（3）

Unexpected text node: '（'

利用这些关系，我们以与牛顿方法相同的方式为 SVD 开发了一种细化算法。因此，所提出的算法具有二次收敛性。

There exist several refinement algorithms for SVD that are based on Newton’s method for nonlinear equations (cf. e.g., [12]). Since this sort of algorithm is designed to improve a triplet

(σ, u, v) \in R \times R^{n} \times R^{n}

individually, where

σ

is a singular value and

u

and

v

are corresponding left and right singular vectors, applying such an approach to all triplets requires

O (n^{4})

arithmetic operations. In [13], Davies and Smith proposed an iterative refinement algorithm for updating the singular value decomposition in

O (n^{3})

operations. However, similarly to the Davies–Modi algorithm [14] for the symmetric eigenvalue decomposition as mentioned in the previous paper [7], the Davies–Smith algorithm has the limitation of achievable accuracy of the results. The reason is as follows. The Davies–Smith algorithm assumes that a given real matrix

A

is preconditioned to a nearly diagonal matrix such as

{\hat{U}}^{T} A \hat{V}

, where

\hat{U}

and

\hat{V}

are computed SVD factors, i.e.,

\hat{U}

and

\hat{V}

are approximately orthogonal matrices. Since

\hat{U}

and

\hat{V}

involve numerical errors, the matrix multiplications in

{\hat{U}}^{T} A \hat{V}

are generally not orthogonal transformations, and the singular values of

{\hat{U}}^{T} A \hat{V}

are slightly perturbed from the original matrix

A

. Then, the singular vectors are also perturbed. Therefore, even if the Davies–Smith algorithm provides accurate singular vectors of

{\hat{U}}^{T} A \hat{V}

, they are not necessarily accurate ones of

A

. On the other hand, our proposed algorithm uses the original matrix

A

for obtaining accurate singular vectors of

A

.
SVD 有几种基于牛顿非线性方程法的细化算法（参见 e.g.）。由于这种算法旨在单独改进三元组

(σ, u, v) \in R \times R^{n} \times R^{n}

，其中

σ

是奇异值和

u

和

v

是相应的左和右奇异向量，因此将这种方法应用于所有三元组需要

O (n^{4})

算术运算。在中，Davies 和 Smith 提出了一种迭代细化算法，用于更新运算中的

O (n^{3})

奇异值分解。然而，与上一篇文章中提到的对称特征值分解的 Davies-Modi 算法类似，Davies-Smith 算法在结果的可实现精度方面存在局限性。原因如下。Davies-Smith 算法假设给定的实数矩阵

A

被预设为接近对角线的矩阵，例如

{\hat{U}}^{T} A \hat{V}

，其中

\hat{U}

和

\hat{V}

是计算的 SVD 因子，即

\hat{U}

和

\hat{V}

是近似正交矩阵。由于

\hat{U}

和

\hat{V}

涉及数值误差，因此中的

{\hat{U}}^{T} A \hat{V}

矩阵乘法通常不是正交变换，并且的

{\hat{U}}^{T} A \hat{V}

奇异值与原始矩阵

A

略有不同。然后，奇异向量也会受到扰动。因此，即使 Davies-Smith 算法提供了的

{\hat{U}}^{T} A \hat{V}

精确奇异向量，它们也不一定是的

A

准确向量。另一方面，我们提出的算法使用原始矩阵

A

来获取的

A

精确奇异向量。

The rest of the paper is organized as follows. In Section 2, we present a refinement algorithm for the SVD. In Section 3, we provide a convergence analysis of the proposed algorithm. In Section 4, we present some numerical results showing the behavior and performance of the proposed algorithm.
本文的其余部分组织如下。在第 2 节中，我们提出了 SVD 的优化算法。在第 3 节中，我们提供了所提出的算法的收敛分析。在第 4 节中，我们提供了一些数值结果，显示了所提出的算法的行为和性能。

2. Proposed algorithm 2. 建议的算法

Let

\hat{U} \in R^{m \times m}

and

\hat{V} \in R^{n \times n}

be given approximation of

U

and

V

, respectively. Let further

F \in R^{m \times m}

and

G \in R^{n \times n}

be correction matrices satisfying

U = \hat{U} (I_{m} + F)

and

V = \hat{V} (I_{n} + G)

, respectively. Let

ϵ

be defined as (4)

ϵ ≔ max (ϵ_{F}, ϵ_{G}), ϵ_{F} ≔ ‖ F ‖, ϵ_{G} ≔ ‖ G ‖ .

We assume that

ϵ < 1

. Then, both

I_{m} + F

and

I_{n} + G

are nonsingular, and

{(I_{m} + F)}^{- 1} = I_{m} - F + Δ_{F}, Δ_{F} ≔ \sum_{k = 2}^{\infty} {(- F)}^{k}, ‖ Δ_{F} ‖ \leq \frac{ϵ_{F}}{1 - ϵ_{F}}, {(I_{n} + G)}^{- 1} = I_{n} - G + Δ_{G}, Δ_{G} ≔ \sum_{k = 2}^{\infty} {(- G)}^{k}, ‖ Δ_{G} ‖ \leq \frac{ϵ_{G}}{1 - ϵ_{G}} .

Inserting

U = \hat{U} (I_{m} + F)

into (1), we have

(I_{m} + F^{T}) {\hat{U}}^{T} \hat{U} (I_{m} + F) = I_{m}

and

{\hat{U}}^{T} \hat{U} = {(I_{m} + F^{T})}^{- 1} {(I_{m} + F)}^{- 1} = (I_{m} - F^{T} + Δ_{F}^{T}) (I_{m} - F + Δ_{F}),

which yields (5)

F + F^{T} = I_{m} - {\hat{U}}^{T} \hat{U} + Δ_{1}, Δ_{1} ≔ Δ_{F} + Δ_{F}^{T} + {(F - Δ_{F})}^{T} (F - Δ_{F}) .

Similarly, inserting

V = \hat{V} (I_{n} + G)

into (2), we have (6)

G + G^{T} = I_{n} - {\hat{V}}^{T} \hat{V} + Δ_{2}, Δ_{2} ≔ Δ_{G} + Δ_{G}^{T} + {(G - Δ_{G})}^{T} (G - Δ_{G}) .

Moreover, inserting

U = \hat{U} (I_{m} + F)

and

V = \hat{V} (I_{n} + G)

into (3), we have (7)

Σ - F^{T} Σ - Σ G = {\hat{U}}^{T} A \hat{V} + Δ_{3}, Δ_{3} ≔ - Σ Δ_{G} - Δ_{F}^{T} Σ - {(F - Δ_{F})}^{T} Σ (G - Δ_{G}) .

Here, (8)

‖ Δ_{1} ‖ \leq \frac{(3 - 2 ϵ_{F}) ϵ_{F}^{2}}{{(1 - ϵ_{F})}^{2}} \leq χ (ϵ) ϵ^{2},

(9)

‖ Δ_{2} ‖ \leq \frac{(3 - 2 ϵ_{G}) ϵ_{G}^{2}}{{(1 - ϵ_{G})}^{2}} \leq χ (ϵ) ϵ^{2},

(10)

‖ Δ_{3} ‖ \leq \frac{ϵ_{F}^{2} + ϵ_{G}^{2} + (1 - ϵ_{F} - ϵ_{G}) ϵ_{F} ϵ_{G}}{(1 - ϵ_{F}) (1 - ϵ_{G})} ‖ Σ ‖ \leq χ (ϵ) ϵ^{2} ‖ A ‖,

where
设

\hat{U} \in R^{m \times m}

和

\hat{V} \in R^{n \times n}

分别得到

U

和

V

的近似值。设 further

F \in R^{m \times m}

和

G \in R^{n \times n}

分别是满足

U = \hat{U} (I_{m} + F)

和

V = \hat{V} (I_{n} + G)

的校正矩阵。设

ϵ

定义为我们假设

ϵ < 1

。那么，和都是

I_{m} + F

奇异的，并且插入

U = \hat{U} (I_{m} + F)

到，我们有和，得到同样，插入

V = \hat{V} (I_{n} + G)

到，我们有此外，插入

U = \hat{U} (I_{m} + F)

和

V = \hat{V} (I_{n} + G)

进入，我们有这里

I_{n} + G

，其中(11)

χ (ϵ) ≔ \frac{3 - 2 ϵ}{{(1 - ϵ)}^{2}} .

Omitting the second-order terms

Δ_{1}

Δ_{2}

, and

Δ_{3}

from (5), (6), and (7) in a similar way to Newton’s method, we obtain a system of matrix equations for

\tilde{F} = ({\tilde{f}}_{i j}) \in R^{m \times m}

\tilde{G} = ({\tilde{g}}_{i j}) \in R^{n \times n}

, and

\tilde{Σ} = diag ({\tilde{σ}}_{i}) \in R^{m \times n}

as (12)

\{\begin{matrix} \tilde{F} + {\tilde{F}}^{T} = R, R ≔ I_{m} - {\hat{U}}^{T} \hat{U} \\ \tilde{G} + {\tilde{G}}^{T} = S, S ≔ I_{n} - {\hat{V}}^{T} \hat{V} \\ \tilde{Σ} - {\tilde{F}}^{T} \tilde{Σ} - \tilde{Σ} \tilde{G} = T, T ≔ {\hat{U}}^{T} A \hat{V} \end{matrix}

(13)

\Leftrightarrow \{\begin{matrix} {\tilde{f}}_{i j} + {\tilde{f}}_{j i} = r_{i j} & for 1 \leq i, j \leq m \\ {\tilde{g}}_{i j} + {\tilde{g}}_{j i} = s_{i j} & for 1 \leq i, j \leq n \\ {\tilde{Σ}}_{i j} - {\tilde{σ}}_{j} {\tilde{f}}_{j i} - {\tilde{σ}}_{i} {\tilde{g}}_{i j} = t_{i j} & for 1 \leq i \leq m, 1 \leq j \leq n \end{matrix} .

All that remains is to solve (12) for

\tilde{F}

\tilde{G}

, and

\tilde{Σ}

.
以与牛顿方法类似的方式省略（5）、（6）和（7）中的二阶项

Δ_{1}

、

Δ_{2}

和

Δ_{3}

，我们得到一个矩阵

\tilde{F} = ({\tilde{f}}_{i j}) \in R^{m \times m}

方程组，

\tilde{G} = ({\tilde{g}}_{i j}) \in R^{n \times n}

\tilde{Σ} = diag ({\tilde{σ}}_{i}) \in R^{m \times n}

因为 (12)

\{\begin{matrix} \tilde{F} + {\tilde{F}}^{T} = R, R ≔ I_{m} - {\hat{U}}^{T} \hat{U} \\ \tilde{G} + {\tilde{G}}^{T} = S, S ≔ I_{n} - {\hat{V}}^{T} \hat{V} \\ \tilde{Σ} - {\tilde{F}}^{T} \tilde{Σ} - \tilde{Σ} \tilde{G} = T, T ≔ {\hat{U}}^{T} A \hat{V} \end{matrix}

(13)

\Leftrightarrow \{\begin{matrix} {\tilde{f}}_{i j} + {\tilde{f}}_{j i} = r_{i j} & for 1 \leq i, j \leq m \\ {\tilde{g}}_{i j} + {\tilde{g}}_{j i} = s_{i j} & for 1 \leq i, j \leq n \\ {\tilde{Σ}}_{i j} - {\tilde{σ}}_{j} {\tilde{f}}_{j i} - {\tilde{σ}}_{i} {\tilde{g}}_{i j} = t_{i j} & for 1 \leq i \leq m, 1 \leq j \leq n \end{matrix} .

剩下的就是求解

\tilde{F}

、和

\tilde{G}

\tilde{Σ}

的（12）。

In the following, we will show that we can easily solve the system of matrix equations (12). We partition

\tilde{F}, \tilde{Σ}, R, T

as follows:

\begin{matrix} \overset{n}{\overset{︷}{}} & \overset{m - n}{\overset{︷}{}} \\ \tilde{F} =[ & {\tilde{F}}_{11} & {\tilde{F}}_{12} & ] & } n \\ {\tilde{F}}_{21} & {\tilde{F}}_{22} & } m - n, \end{matrix} \begin{matrix} \overset{n}{\overset{︷}{}} \\ \tilde{Σ} =[ & {\tilde{Σ}}_{n} & ] & } n \\ O & } m - n \end{matrix} \begin{matrix} \overset{n}{\overset{︷}{}} & \overset{m - n}{\overset{︷}{}} \\ R =[ & R_{11} & R_{12} & ] & } n \\ R_{21} & R_{22} & } m - n, \end{matrix} \begin{matrix} \overset{n}{\overset{︷}{}} \\ T =[ & T_{1} & ] & } n \\ T_{2} & } m - n \end{matrix}

Then it follows from (12) that (14a)

{\tilde{F}}_{11} + {\tilde{F}}_{11}^{T} = R_{11},

(14b)

{\tilde{F}}_{21} + {\tilde{F}}_{12}^{T} = R_{21},

(14c)

{\tilde{F}}_{22} + {\tilde{F}}_{22}^{T} = R_{22},

and (15a)

{\tilde{Σ}}_{n} - {\tilde{F}}_{11}^{T} {\tilde{Σ}}_{n} - {\tilde{Σ}}_{n} \tilde{G} = T_{1},

(15b)

{\tilde{F}}_{12}^{T} {\tilde{Σ}}_{n} = - T_{2} \Leftrightarrow {\tilde{Σ}}_{n} {\tilde{F}}_{12} = - T_{2}^{T} .

在下文中，我们将展示我们可以轻松求解矩阵方程组（12）。我们按如下方式进行分区

\tilde{F}, \tilde{Σ}, R, T

：{n {m−n F ̃= F ̃11 F ̃12 }n F ̃21 F ̃22 }m−n， {n Σ ̃= Σ ̃n }n O }m−n {n {m−n R= R11 R12 }n R21 R22 }m−n， {n T= T1 }n T2 }m−n 然后，从（12） (14a)

{\tilde{F}}_{11} + {\tilde{F}}_{11}^{T} = R_{11},

(14b)

{\tilde{F}}_{21} + {\tilde{F}}_{12}^{T} = R_{21},

(14c)

{\tilde{F}}_{22} + {\tilde{F}}_{22}^{T} = R_{22},

可以得出和 (15a)

{\tilde{Σ}}_{n} - {\tilde{F}}_{11}^{T} {\tilde{Σ}}_{n} - {\tilde{Σ}}_{n} \tilde{G} = T_{1},

(15b)

{\tilde{F}}_{12}^{T} {\tilde{Σ}}_{n} = - T_{2} \Leftrightarrow {\tilde{Σ}}_{n} {\tilde{F}}_{12} = - T_{2}^{T} .

First, we focus on the diagonal parts of

{\tilde{F}}_{11}

and

\tilde{G}

. It follows from the first and second equations in (13) that

{\tilde{f}}_{i i} = \frac{r_{i i}}{2}, {\tilde{g}}_{i i} = \frac{s_{i i}}{2} for 1 \leq i \leq n .

Moreover, the third equation in (13) yields

(1 - {\tilde{f}}_{i i} - {\tilde{g}}_{i i}) {\tilde{σ}}_{i} = (1 - (r_{i i} + s_{i i}) ∕ 2) {\tilde{σ}}_{i} = t_{i i} for 1 \leq i \leq n .

Thus, if

r_{i i} + s_{i i} \neq 2

for

1 \leq i \leq n

, we have (16)

{\tilde{σ}}_{i} = \frac{t_{i i}}{1 - (r_{i i} + s_{i i}) ∕ 2} for 1 \leq i \leq n .

首先，我们关注

{\tilde{F}}_{11}

和的

\tilde{G}

对角线部分。从（13）中的第一个和第二个方程中可以得出，

{\tilde{f}}_{i i} = \frac{r_{i i}}{2}, {\tilde{g}}_{i i} = \frac{s_{i i}}{2} for 1 \leq i \leq n .

此外，（13）中的第三个方程得出

(1 - {\tilde{f}}_{i i} - {\tilde{g}}_{i i}) {\tilde{σ}}_{i} = (1 - (r_{i i} + s_{i i}) ∕ 2) {\tilde{σ}}_{i} = t_{i i} for 1 \leq i \leq n .

因此，如果

r_{i i} + s_{i i} \neq 2

for

1 \leq i \leq n

，我们有 (16)

{\tilde{σ}}_{i} = \frac{t_{i i}}{1 - (r_{i i} + s_{i i}) ∕ 2} for 1 \leq i \leq n .

Remark 1 注 1

In theory, there is a possibility that

r_{i i} + s_{i i} = 2

. However,

R

and

S

are residuals in terms of orthogonality, and it is likely that

| r_{i i} | ≪ 1

and

| s_{i i} | ≪ 1

, and

| r_{i i} + s_{i i} | ≪ 1

in practice.
理论上，有可能

r_{i i} + s_{i i} = 2

.但是，

R

and

S

是正交性的残差，并且很可能是

| r_{i i} | ≪ 1

和

| s_{i i} | ≪ 1

，并且

| r_{i i} + s_{i i} | ≪ 1

在实践中。

Next, we focus on the off-diagonal parts of

{\tilde{F}}_{11}

and

\tilde{G}

. Combining (13), (16), they can be determined by solving 4 × 4 linear systems (17)

{\tilde{f}}_{i j} + {\tilde{f}}_{j i} = r_{i j}

(18)

{\tilde{g}}_{i j} + {\tilde{g}}_{j i} = s_{i j}

(19)

{\tilde{σ}}_{i} {\tilde{f}}_{i j} + {\tilde{σ}}_{j} {\tilde{g}}_{j i} = - t_{j i}

(20)

{\tilde{σ}}_{j} {\tilde{f}}_{j i} + {\tilde{σ}}_{i} {\tilde{g}}_{i j} = - t_{i j}

for

1 \leq i, j \leq n

i \neq j

. By multiplying (19) by

{\tilde{σ}}_{i}

and (20) by

{\tilde{σ}}_{j}

{\tilde{σ}}_{i}^{2} {\tilde{f}}_{i j} + {\tilde{σ}}_{i} {\tilde{σ}}_{j} {\tilde{g}}_{j i} = - {\tilde{σ}}_{i} t_{j i},

{\tilde{σ}}_{j}^{2} {\tilde{f}}_{j i} + {\tilde{σ}}_{i} {\tilde{σ}}_{j} {\tilde{g}}_{i j} = - {\tilde{σ}}_{j} t_{i j},

and

{\tilde{σ}}_{i}^{2} {\tilde{f}}_{i j} + {\tilde{σ}}_{j}^{2} {\tilde{f}}_{j i} + {\tilde{σ}}_{i} {\tilde{σ}}_{j} ({\tilde{g}}_{i j} + {\tilde{g}}_{j i}) = - {\tilde{σ}}_{i} t_{j i} - {\tilde{σ}}_{j} t_{i j} .

Inserting (18) into this yields

{\tilde{σ}}_{i}^{2} {\tilde{f}}_{i j} + {\tilde{σ}}_{j}^{2} {\tilde{f}}_{j i} = - {\tilde{σ}}_{i} t_{j i} - {\tilde{σ}}_{j} t_{i j} - {\tilde{σ}}_{i} {\tilde{σ}}_{j} s_{i j} .

Combining this and (17), we have

({\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}) {\tilde{f}}_{i j} = {\tilde{σ}}_{j}^{2} r_{i j} + {\tilde{σ}}_{i} t_{j i} + {\tilde{σ}}_{j} t_{i j} + {\tilde{σ}}_{i} {\tilde{σ}}_{j} s_{i j} = {\tilde{σ}}_{j} (t_{i j} + {\tilde{σ}}_{j} r_{i j}) + {\tilde{σ}}_{i} (t_{j i} + {\tilde{σ}}_{j} s_{i j}) .

Similarly, using (17)–(20), we obtain

({\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}) {\tilde{g}}_{i j} = {\tilde{σ}}_{i} (t_{i j} + {\tilde{σ}}_{j} r_{i j}) + {\tilde{σ}}_{j} (t_{j i} + {\tilde{σ}}_{j} s_{i j}) .

Hence, (21)

\begin{matrix} {\tilde{f}}_{i j} = \frac{α_{i j} {\tilde{σ}}_{j} + β_{i j} {\tilde{σ}}_{i}}{{\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}}, {\tilde{g}}_{i j} = \frac{α_{i j} {\tilde{σ}}_{i} + β_{i j} {\tilde{σ}}_{j}}{{\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}} \end{matrix} if {\tilde{σ}}_{i} \neq {\tilde{σ}}_{j} for 1 \leq i, j \leq n, i \neq j,

where

α_{i j} ≔ t_{i j} + {\tilde{σ}}_{j} r_{i j}

and

β_{i j} ≔ t_{j i} + {\tilde{σ}}_{j} s_{i j}

. Moreover, combining (15b), (16),

{\tilde{F}}_{12}

can also be determined as (22)

{\tilde{f}}_{i j} = - \frac{t_{j i}}{{\tilde{σ}}_{i}} if {\tilde{σ}}_{i} \neq 0 for 1 \leq i \leq n, n + 1 \leq j \leq m .

Furthermore, combining (14b), (22),

{\tilde{F}}_{21}

is determined as

{\tilde{F}}_{21} = R_{21} - {\tilde{F}}_{12}^{T}

and

{\tilde{f}}_{i j} = r_{i j} - {\tilde{f}}_{j i} = r_{i j} + t_{i j} ∕ {\tilde{σ}}_{j} if {\tilde{σ}}_{j} \neq 0 for n + 1 \leq i \leq m, 1 \leq j \leq n .

Finally,

{\tilde{F}}_{22}

can arbitrarily be determined on the condition (14c). Thus we choose

{\tilde{f}}_{i j}

{\tilde{f}}_{i j} = \frac{r_{i j}}{2} for n + 1 \leq i, j \leq m, i \neq j .

接下来，我们关注

{\tilde{F}}_{11}

和

\tilde{G}

的非对角线部分。将（13）、（16）组合起来，可以通过求解、的 4 × 4 个线性方程组 (17)

{\tilde{f}}_{i j} + {\tilde{f}}_{j i} = r_{i j}

(18)

{\tilde{g}}_{i j} + {\tilde{g}}_{j i} = s_{i j}

(19)

{\tilde{σ}}_{i} {\tilde{f}}_{i j} + {\tilde{σ}}_{j} {\tilde{g}}_{j i} = - t_{j i}

(20)

{\tilde{σ}}_{j} {\tilde{f}}_{j i} + {\tilde{σ}}_{i} {\tilde{g}}_{i j} = - t_{i j}

1 \leq i, j \leq n

来确定

i \neq j

它们。通过将（19）乘以

{\tilde{σ}}_{i}

和（20）乘

{\tilde{σ}}_{j}

以，

{\tilde{σ}}_{i}^{2} {\tilde{f}}_{i j} + {\tilde{σ}}_{i} {\tilde{σ}}_{j} {\tilde{g}}_{j i} = - {\tilde{σ}}_{i} t_{j i},

{\tilde{σ}}_{j}^{2} {\tilde{f}}_{j i} + {\tilde{σ}}_{i} {\tilde{σ}}_{j} {\tilde{g}}_{i j} = - {\tilde{σ}}_{j} t_{i j},

并将

{\tilde{σ}}_{i}^{2} {\tilde{f}}_{i j} + {\tilde{σ}}_{j}^{2} {\tilde{f}}_{j i} + {\tilde{σ}}_{i} {\tilde{σ}}_{j} ({\tilde{g}}_{i j} + {\tilde{g}}_{j i}) = - {\tilde{σ}}_{i} t_{j i} - {\tilde{σ}}_{j} t_{i j} .

（18）插入其中，得到

{\tilde{σ}}_{i}^{2} {\tilde{f}}_{i j} + {\tilde{σ}}_{j}^{2} {\tilde{f}}_{j i} = - {\tilde{σ}}_{i} t_{j i} - {\tilde{σ}}_{j} t_{i j} - {\tilde{σ}}_{i} {\tilde{σ}}_{j} s_{i j} .

将此和（17）组合起来，我们得到

({\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}) {\tilde{f}}_{i j} = {\tilde{σ}}_{j}^{2} r_{i j} + {\tilde{σ}}_{i} t_{j i} + {\tilde{σ}}_{j} t_{i j} + {\tilde{σ}}_{i} {\tilde{σ}}_{j} s_{i j} = {\tilde{σ}}_{j} (t_{i j} + {\tilde{σ}}_{j} r_{i j}) + {\tilde{σ}}_{i} (t_{j i} + {\tilde{σ}}_{j} s_{i j}) .

同样，使用（17）–（20），我们得到

({\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}) {\tilde{g}}_{i j} = {\tilde{σ}}_{i} (t_{i j} + {\tilde{σ}}_{j} r_{i j}) + {\tilde{σ}}_{j} (t_{j i} + {\tilde{σ}}_{j} s_{i j}) .

因此， (21)

\begin{matrix} {\tilde{f}}_{i j} = \frac{α_{i j} {\tilde{σ}}_{j} + β_{i j} {\tilde{σ}}_{i}}{{\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}}, {\tilde{g}}_{i j} = \frac{α_{i j} {\tilde{σ}}_{i} + β_{i j} {\tilde{σ}}_{j}}{{\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}} \end{matrix} if {\tilde{σ}}_{i} \neq {\tilde{σ}}_{j} for 1 \leq i, j \leq n, i \neq j,

其中

α_{i j} ≔ t_{i j} + {\tilde{σ}}_{j} r_{i j}

和

β_{i j} ≔ t_{j i} + {\tilde{σ}}_{j} s_{i j}

。此外，结合（15b）、（16）

{\tilde{F}}_{12}

也可以确定为 (22)

{\tilde{f}}_{i j} = - \frac{t_{j i}}{{\tilde{σ}}_{i}} if {\tilde{σ}}_{i} \neq 0 for 1 \leq i \leq n, n + 1 \leq j \leq m .

此外，结合（14b）、（22）

{\tilde{F}}_{21}

被确定为

{\tilde{F}}_{21} = R_{21} - {\tilde{F}}_{12}^{T}

和

{\tilde{f}}_{i j} = r_{i j} - {\tilde{f}}_{j i} = r_{i j} + t_{i j} ∕ {\tilde{σ}}_{j} if {\tilde{σ}}_{j} \neq 0 for n + 1 \leq i \leq m, 1 \leq j \leq n .

最后，

{\tilde{F}}_{22}

可以根据条件（14c）任意确定。因此，我们选择

{\tilde{f}}_{i j}

{\tilde{f}}_{i j} = \frac{r_{i j}}{2} for n + 1 \leq i, j \leq m, i \neq j .

Summarizing the above discussion, we present a refinement algorithm for the SVD of a real matrix in Algorithm 1.
总结上述讨论，我们在算法 1 中提出了一种实矩阵 SVD 的细化算法。

Remark 2 注 2

In Algorithm 1, we assume that

{\tilde{σ}}_{i} \neq {\tilde{σ}}_{j}

for all

(i, j)

. If

{\tilde{σ}}_{i} = {\tilde{σ}}_{j}

for some

(i, j)

, we need some care in a similar way to the treatment for the symmetric eigenvalue problem in [7].
在算法 1 中，

{\tilde{σ}}_{i} \neq {\tilde{σ}}_{j}

我们假设对于所有

(i, j)

.如果

{\tilde{σ}}_{i} = {\tilde{σ}}_{j}

对于某些

(i, j)

，我们需要一些小心，就像 [7] 中对称特征值问题的处理一样。

Remark 3 注 3

Algorithm 1 would not work for the thin SVD unless

C (\hat{U}) = C (U)

, as

C ({\hat{U}}^{'}) \subset C (\hat{U})

at each iteration, where

C (X)

is the column space of a matrix

X

.
算法 1 不适用于瘦 SVD，除非

C (\hat{U}) = C (U)

，就像在每次迭代中一样

C ({\hat{U}}^{'}) \subset C (\hat{U})

，其中

C (X)

是矩阵

X

的列空间。

In the next section, we will discuss the convergence of the proposed algorithms in this section, which is proved to be quadratic.
在下一节中，我们将讨论本节中提出的算法的收敛性，它被证明是二次的。

3. Convergence analysis 3. 收敛分析

Here we prove quadratic convergence of Algorithm 1. Let

ϵ

be defined as in (4). Recall that

F, \tilde{F}, G, \tilde{G}

are obtained from the following equations: (23)

F + F^{T} = R + Δ_{1}, R ≔ I_{m} - {\hat{U}}^{T} \hat{U}, ‖ Δ_{1} ‖ \leq χ (ϵ) ϵ^{2},

(24)

G + G^{T} = S + Δ_{2}, S ≔ I_{n} - {\hat{V}}^{T} \hat{V}, ‖ Δ_{2} ‖ \leq χ (ϵ) ϵ^{2},

(25)

Σ - F^{T} Σ - Σ G = T + Δ_{3}, T ≔ {\hat{U}}^{T} A \hat{V}, ‖ Δ_{3} ‖ \leq χ (ϵ) ‖ A ‖ ϵ^{2},

(26)

\tilde{F} + {\tilde{F}}^{T} = R,

(27)

\tilde{G} + {\tilde{G}}^{T} = S,

(28)

\tilde{Σ} - {\tilde{F}}^{T} \tilde{Σ} - \tilde{Σ} \tilde{G} = T .

在这里，我们证明了算法 1 的二次收敛性。设

ϵ

定义为（4）中。回想一下，

F, \tilde{F}, G, \tilde{G}

从以下方程式中获得： (23)

F + F^{T} = R + Δ_{1}, R ≔ I_{m} - {\hat{U}}^{T} \hat{U}, ‖ Δ_{1} ‖ \leq χ (ϵ) ϵ^{2},

(24)

G + G^{T} = S + Δ_{2}, S ≔ I_{n} - {\hat{V}}^{T} \hat{V}, ‖ Δ_{2} ‖ \leq χ (ϵ) ϵ^{2},

(25)

Σ - F^{T} Σ - Σ G = T + Δ_{3}, T ≔ {\hat{U}}^{T} A \hat{V}, ‖ Δ_{3} ‖ \leq χ (ϵ) ‖ A ‖ ϵ^{2},

(26)

\tilde{F} + {\tilde{F}}^{T} = R,

(27)

\tilde{G} + {\tilde{G}}^{T} = S,

(28)

\tilde{Σ} - {\tilde{F}}^{T} \tilde{Σ} - \tilde{Σ} \tilde{G} = T .

The main difference from the discussion about the symmetric eigenvalue decompositions is that we consider the case of rectangular matrices, i.e.,

m > n

. In connection with this, for

n + 1 \leq i \leq m

, the

i

th columns of

U

are not unique. Hence, we uniquely determine

U

depending on a given

\hat{U}

as follows. Define

U

such that the lower right

(m - n) \times (m - n)

submatrix of

{\hat{U}}^{- 1} U

is symmetric positive definite; see [7, § 3.2] for the proof of its uniqueness. Then,

F_{22}

is symmetric. Moreover, the next lemma can be proved in the same manner as [7, Lemma 3].
与对称特征值分解的讨论的主要区别在于，我们考虑了矩形矩阵的情况，即

m > n

。与此相关，对于

n + 1 \leq i \leq m

，的

i

U

第 th 列不是唯一的。因此，我们根据给定

\hat{U}

的给定来唯一确定

U

，如下所示。定义

U

的右下角

(m - n) \times (m - n)

子矩阵

{\hat{U}}^{- 1} U

是对称的正定矩阵;参见 [7， § 3.2] 以证明其唯一性。然后，

F_{22}

是对称的。此外，下一个引理可以用与 [7，引理 3] 相同的方式证明。

Lemma 1 引理 1

Let

A \in R^{m \times n}

\hat{U} \in R^{m \times m}

, and

\hat{V} \in R^{n \times n}

with

m > n

. In addition, let

U

be a set of orthogonal matrices comprising the normalized left singular vectors of

A

. For

{\hat{U}}^{'} \in R^{m \times m}

obtained by Algorithm1 and any fixed

U_{α} \in U

, we define

F_{α}

such that (29)

U_{α} = {\hat{U}}^{'} (I_{m} + F_{α}) .

In addition, we define

F^{'}

such that (30)

U^{'} = {\hat{U}}^{'} (I_{m} + F^{'}),

where

U^{'} \in R^{m \times m}

comprises normalized left singular vectors such that the lower right

(m - n) \times (m - n)

submatrix of

{\hat{U}^{'}}^{- 1} U^{'}

is symmetric positive definite. Then, we have (31)

‖ F^{'} ‖ \leq 3 ‖ F_{α} ‖ .

设

A \in R^{m \times n}

\hat{U} \in R^{m \times m}

，和

\hat{V} \in R^{n \times n}

和

m > n

。此外，设

U

为一组正交矩阵，其中包含的

A

归一化左奇异向量。对于

{\hat{U}}^{'} \in R^{m \times m}

通过算法1 和任何固定

U_{α} \in U

获得，我们定义

F_{α}

如下 (29)

U_{α} = {\hat{U}}^{'} (I_{m} + F_{α}) .

此外，我们定义

F^{'}

(30)

U^{'} = {\hat{U}}^{'} (I_{m} + F^{'}),

where 包含归一化的左奇异向量，使得的

{\hat{U}^{'}}^{- 1} U^{'}

右

(m - n) \times (m - n)

下子矩阵是对称的正定。

U^{'} \in R^{m \times m}

那么，我们有 (31)

‖ F^{'} ‖ \leq 3 ‖ F_{α} ‖ .

Noting the above lemma, we prove the quadratic convergence. First, we estimate

‖ \tilde{F} - F ‖

and

‖ \tilde{G} - G ‖

in some neighborhood of the solutions.
注意到上述引理，我们证明了二次收敛。首先，我们估计

‖ \tilde{F} - F ‖

并在

‖ \tilde{G} - G ‖

解的某个邻域中。

Lemma 2 引理 2

Suppose

m \geq n

and

m \geq 2

, and define

σ_{n + 1} ≔ 0

for the sake of convenience. Let

ϵ

be defined as in (4). If (32)

ϵ < \frac{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})}{30 m ‖ A ‖}

is satisfied, then (33)

ϵ < \frac{1}{60} .

Moreover, letting (34)

η (ϵ) ≔ \frac{2 χ (ϵ)}{(1 - 2 ϵ) (1 - 2 ϵ - χ (ϵ) ϵ^{2})},

we obtain (35)

max (‖ \tilde{F} - F ‖, ‖ \tilde{G} - G ‖) \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) m ‖ A ‖ ϵ^{2}}{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1}) - 2 η (ϵ) ‖ A ‖ ϵ^{2}},

where

χ (ϵ)

in(11) and

η (ϵ)

in (34) satisfy (36)

χ (ϵ) \leq 3.068 \dots, η (ϵ) \leq 6.572 \dots .

假设

m \geq n

和

m \geq 2

，并为方便起见定义

σ_{n + 1} ≔ 0

。设

ϵ

定义为 in（4）。如果 (32)

ϵ < \frac{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})}{30 m ‖ A ‖}

满足，则 (33)

ϵ < \frac{1}{60} .

此外，让我们 (34)

η (ϵ) ≔ \frac{2 χ (ϵ)}{(1 - 2 ϵ) (1 - 2 ϵ - χ (ϵ) ϵ^{2})},

得到 (35)

max (‖ \tilde{F} - F ‖, ‖ \tilde{G} - G ‖) \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) m ‖ A ‖ ϵ^{2}}{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1}) - 2 η (ϵ) ‖ A ‖ ϵ^{2}},

其中

χ (ϵ)

in（11）和

η (ϵ)

in（34）满足 (36)

χ (ϵ) \leq 3.068 \dots, η (ϵ) \leq 6.572 \dots .

Proof 证明

Since we have (33) from

ϵ < \frac{1}{30} \cdot \frac{1}{m} \cdot \frac{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})}{‖ A ‖} \leq \frac{1}{30} \cdot \frac{1}{2} \cdot 1 = \frac{1}{60}

in (32), it is easy to see that

χ (ϵ)

in (11) and

η (ϵ)

satisfy (36).
由于我们在（32）中有

ϵ < \frac{1}{30} \cdot \frac{1}{m} \cdot \frac{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})}{‖ A ‖} \leq \frac{1}{30} \cdot \frac{1}{2} \cdot 1 = \frac{1}{60}

（33），因此很容易看出

χ (ϵ)

in （11）并

η (ϵ)

满足（36）。

First of all, we estimate the diagonal elements of

F - \tilde{F}

. From (23), (26), we have (37)

(F - \tilde{F}) + {(F - \tilde{F})}^{T} = Δ_{1}, ‖ Δ_{1} ‖ \leq χ (ϵ) ϵ^{2} .

In addition, we see (38)

(G - \tilde{G}) + {(G - \tilde{G})}^{T} = Δ_{2}, ‖ Δ_{2} ‖ \leq χ (ϵ) ϵ^{2}

in the same manner as (37). Therefore, we obtain (39)

max (| f_{i i} - {\tilde{f}}_{i i} |, | g_{i i} - {\tilde{g}}_{i i} |) \leq \frac{χ (ϵ)}{2} ϵ^{2} for 1 \leq i \leq n, | f_{i i} - {\tilde{f}}_{i i} | \leq \frac{χ (ϵ)}{2} ϵ^{2} for n + 1 \leq i \leq m .

首先，我们估计的

F - \tilde{F}

对角线元素。从（23）、（26）中，我们有 (37)

(F - \tilde{F}) + {(F - \tilde{F})}^{T} = Δ_{1}, ‖ Δ_{1} ‖ \leq χ (ϵ) ϵ^{2} .

此外，我们以与（37）相同的方式看到 (38)

(G - \tilde{G}) + {(G - \tilde{G})}^{T} = Δ_{2}, ‖ Δ_{2} ‖ \leq χ (ϵ) ϵ^{2}

。因此，我们获得 (39)

max (| f_{i i} - {\tilde{f}}_{i i} |, | g_{i i} - {\tilde{g}}_{i i} |) \leq \frac{χ (ϵ)}{2} ϵ^{2} for 1 \leq i \leq n, | f_{i i} - {\tilde{f}}_{i i} | \leq \frac{χ (ϵ)}{2} ϵ^{2} for n + 1 \leq i \leq m .

Next, we estimate

\tilde{Σ} - Σ

. From (25), (28),

\tilde{Σ}

and

Σ

are determined as

{\tilde{σ}}_{i} = t_{i i} ∕ (1 - {\tilde{f}}_{i i} - {\tilde{g}}_{i i})

and

σ_{i} = (t_{i i} - Δ_{3} (i, i)) ∕ (1 - f_{i i} - g_{i i})

. Thus, from easy calculations,

{\tilde{σ}}_{i} - σ_{i} = \frac{t_{i i} (1 - f_{i i} - g_{i i}) - t_{i i} (1 - {\tilde{f}}_{i i} - {\tilde{g}}_{i i})}{(1 - f_{i i} - g_{i i}) (1 - {\tilde{f}}_{i i} - {\tilde{g}}_{i i})} + \frac{Δ_{3} (i, i)}{1 - f_{i i} - g_{i i}} = - \frac{t_{i i} (f_{i i} - {\tilde{f}}_{i i} + g_{i i} - {\tilde{g}}_{i i})}{(1 - f_{i i} - g_{i i}) (1 - f_{i i} - g_{i i} + (f_{i i} - {\tilde{f}}_{i i} + g_{i i} - {\tilde{g}}_{i i}))} + \frac{Δ_{3} (i, i)}{1 - f_{i i} - g_{i i}} .

On the right hand side, we have (40)

| \frac{Δ_{3} (i, i)}{1 - f_{i i} - g_{i i}} | \leq \frac{χ (ϵ) ‖ A ‖ ϵ^{2}}{1 - 2 ϵ} .

In addition,

| t_{i i} - σ_{i} | \leq ‖ A ‖ (2 ϵ + χ (ϵ) ϵ^{2})

from (25). It then follows that

| t_{i i} | \leq ‖ A ‖ (1 + 2 ϵ + χ (ϵ) ϵ^{2}) .

Hence, it is easy to see that (41)

| \frac{t_{i i} (f_{i i} - {\tilde{f}}_{i i} + g_{i i} - {\tilde{g}}_{i i})}{(1 - f_{i i} - g_{i i}) (1 - f_{i i} - g_{i i} + (f_{i i} - {\tilde{f}}_{i i} + g_{i i} - {\tilde{g}}_{i i}))} | \leq \frac{χ (ϵ) ‖ A ‖ (1 + 2 ϵ + χ (ϵ) ϵ^{2}) ϵ^{2}}{(1 - 2 ϵ) (1 - 2 ϵ - χ (ϵ) ϵ^{2})} .

Using (34), we have (42)

| {\tilde{σ}}_{i} - σ_{i} | < η (ϵ) ‖ A ‖ ϵ^{2} for 1 \leq i \leq n .

In addition, we see

{\tilde{σ}}_{i} > 0

for

i = 1, \dots, n

in view of (43)

{\tilde{σ}}_{i} > σ_{i} + η (ϵ) ‖ A ‖ ϵ^{2} > 30 m ‖ A ‖ ϵ + η (ϵ) ‖ A ‖ ϵ^{2} = (30 m - η (ϵ) ϵ) ‖ A ‖ ϵ > 0

from (42), (32), and (36).
接下来，我们估计

\tilde{Σ} - Σ

。从（25）、（28）

\tilde{Σ}

和

Σ

确定为

{\tilde{σ}}_{i} = t_{i i} ∕ (1 - {\tilde{f}}_{i i} - {\tilde{g}}_{i i})

和

σ_{i} = (t_{i i} - Δ_{3} (i, i)) ∕ (1 - f_{i i} - g_{i i})

。因此，从简单的计算中，

{\tilde{σ}}_{i} - σ_{i} = \frac{t_{i i} (1 - f_{i i} - g_{i i}) - t_{i i} (1 - {\tilde{f}}_{i i} - {\tilde{g}}_{i i})}{(1 - f_{i i} - g_{i i}) (1 - {\tilde{f}}_{i i} - {\tilde{g}}_{i i})} + \frac{Δ_{3} (i, i)}{1 - f_{i i} - g_{i i}} = - \frac{t_{i i} (f_{i i} - {\tilde{f}}_{i i} + g_{i i} - {\tilde{g}}_{i i})}{(1 - f_{i i} - g_{i i}) (1 - f_{i i} - g_{i i} + (f_{i i} - {\tilde{f}}_{i i} + g_{i i} - {\tilde{g}}_{i i}))} + \frac{Δ_{3} (i, i)}{1 - f_{i i} - g_{i i}} .

在右侧，我们有 (40)

| \frac{Δ_{3} (i, i)}{1 - f_{i i} - g_{i i}} | \leq \frac{χ (ϵ) ‖ A ‖ ϵ^{2}}{1 - 2 ϵ} .

此外，

| t_{i i} - σ_{i} | \leq ‖ A ‖ (2 ϵ + χ (ϵ) ϵ^{2})

来自（25）。因此

| t_{i i} | \leq ‖ A ‖ (1 + 2 ϵ + χ (ϵ) ϵ^{2}) .

，很容易看出 (41)

| \frac{t_{i i} (f_{i i} - {\tilde{f}}_{i i} + g_{i i} - {\tilde{g}}_{i i})}{(1 - f_{i i} - g_{i i}) (1 - f_{i i} - g_{i i} + (f_{i i} - {\tilde{f}}_{i i} + g_{i i} - {\tilde{g}}_{i i}))} | \leq \frac{χ (ϵ) ‖ A ‖ (1 + 2 ϵ + χ (ϵ) ϵ^{2}) ϵ^{2}}{(1 - 2 ϵ) (1 - 2 ϵ - χ (ϵ) ϵ^{2})} .

使用（34），我们有 (42)

| {\tilde{σ}}_{i} - σ_{i} | < η (ϵ) ‖ A ‖ ϵ^{2} for 1 \leq i \leq n .

此外，我们从 (43)

{\tilde{σ}}_{i} > σ_{i} + η (ϵ) ‖ A ‖ ϵ^{2} > 30 m ‖ A ‖ ϵ + η (ϵ) ‖ A ‖ ϵ^{2} = (30 m - η (ϵ) ϵ) ‖ A ‖ ϵ > 0

（42）、（32）和（36）中看到

{\tilde{σ}}_{i} > 0

for

i = 1, \dots, n

。

In what follows, we estimate the off-diagonal elements of

\tilde{F}

and

\tilde{G}

. Combining (25) with (42), we have (44)

\tilde{Σ} - F^{T} \tilde{Σ} - \tilde{Σ} G = T + {\tilde{Δ}}_{5},

where (45)

| {\tilde{Δ}}_{5} (i, j) | \leq (χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2} for i \neq j .

In addition, from (28), (46)

{(F - \tilde{F})}^{T} \tilde{Σ} + \tilde{Σ} (G - \tilde{G}) = - {\tilde{Δ}}_{5}

holds. Using (37), (38), and (46), we estimate the off-diagonal elements of

\tilde{F}

and

\tilde{G}

.
在下文中，我们估计

\tilde{F}

和

\tilde{G}

的非对角线元素。将（25）与（42）组合，我们得到 (44)

\tilde{Σ} - F^{T} \tilde{Σ} - \tilde{Σ} G = T + {\tilde{Δ}}_{5},

其中 (45)

| {\tilde{Δ}}_{5} (i, j) | \leq (χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2} for i \neq j .

此外，从（28）中， (46)

{(F - \tilde{F})}^{T} \tilde{Σ} + \tilde{Σ} (G - \tilde{G}) = - {\tilde{Δ}}_{5}

成立。使用（37）、（38）和（46），我们估计

\tilde{F}

和

\tilde{G}

的非对角线元素。

Recall

{\tilde{f}}_{i j} = {\tilde{f}}_{j i} = r_{i j} ∕ 2

for

n + 1 \leq i, j \leq m

in the proposed algorithm. Hence, from (37), (47)

| {\tilde{f}}_{i j} - f_{i j} | \leq \frac{χ (ϵ)}{2} ϵ^{2} for n + 1 \leq i, j \leq m .

Next, for

1 \leq i \leq n, n + 1 \leq j \leq m

, from the bottom part of (46), we have (48)

| {\tilde{f}}_{i j} - f_{i j} | \leq \frac{‖ A ‖}{\tilde{σ_{i}}} (χ (ϵ) + 2 η (ϵ) ϵ) ϵ^{2} \leq \frac{(χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2}}{σ_{i} - η (ϵ) ‖ A ‖ ϵ^{2}} .

Combining this with (37), for

n + 1 \leq i \leq m, 1 \leq j \leq n

, we have (49)

| {\tilde{f}}_{i j} - f_{i j} | \leq χ (ϵ) ϵ^{2} + \frac{‖ A ‖}{\tilde{σ_{j}}} (χ (ϵ) + 2 η (ϵ) ϵ) ϵ^{2} \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ - χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{σ_{j} - η (ϵ) ‖ A ‖ ϵ^{2}} .

Moreover, for

1 \leq i, j \leq n, i \neq j

, we have (50)

(f_{i j} - {\tilde{f}}_{i j}) + (f_{j i} - {\tilde{f}}_{j i}) = ϵ_{1, i j}, | ϵ_{1, i j} | \leq χ (ϵ) ϵ^{2},

(51)

(g_{i j} - {\tilde{g}}_{i j}) + (g_{j i} - {\tilde{g}}_{j i}) = ϵ_{2, i j}, | ϵ_{2, i j} | \leq χ (ϵ) ϵ^{2},

(52)

{\tilde{σ}}_{i} (f_{i j} - {\tilde{f}}_{i j}) + {\tilde{σ}}_{j} (g_{j i} - {\tilde{g}}_{j i}) = ϵ_{3, i j}, | ϵ_{3, i j} | \leq (χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2}

from (37), (38), (45), and (46).
Recall

{\tilde{f}}_{i j} = {\tilde{f}}_{j i} = r_{i j} ∕ 2

for

n + 1 \leq i, j \leq m

在建议的算法中。因此，从（37）开始， (47)

| {\tilde{f}}_{i j} - f_{i j} | \leq \frac{χ (ϵ)}{2} ϵ^{2} for n + 1 \leq i, j \leq m .

接下来，对于

1 \leq i \leq n, n + 1 \leq j \leq m

，从（46）的底部，我们有 (48)

| {\tilde{f}}_{i j} - f_{i j} | \leq \frac{‖ A ‖}{\tilde{σ_{i}}} (χ (ϵ) + 2 η (ϵ) ϵ) ϵ^{2} \leq \frac{(χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2}}{σ_{i} - η (ϵ) ‖ A ‖ ϵ^{2}} .

Combine this 与（37），对于

n + 1 \leq i \leq m, 1 \leq j \leq n

，我们有 (49)

| {\tilde{f}}_{i j} - f_{i j} | \leq χ (ϵ) ϵ^{2} + \frac{‖ A ‖}{\tilde{σ_{j}}} (χ (ϵ) + 2 η (ϵ) ϵ) ϵ^{2} \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ - χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{σ_{j} - η (ϵ) ‖ A ‖ ϵ^{2}} .

此外，对于

1 \leq i, j \leq n, i \neq j

，我们有 (50)

(f_{i j} - {\tilde{f}}_{i j}) + (f_{j i} - {\tilde{f}}_{j i}) = ϵ_{1, i j}, | ϵ_{1, i j} | \leq χ (ϵ) ϵ^{2},

(51)

(g_{i j} - {\tilde{g}}_{i j}) + (g_{j i} - {\tilde{g}}_{j i}) = ϵ_{2, i j}, | ϵ_{2, i j} | \leq χ (ϵ) ϵ^{2},

(52)

{\tilde{σ}}_{i} (f_{i j} - {\tilde{f}}_{i j}) + {\tilde{σ}}_{j} (g_{j i} - {\tilde{g}}_{j i}) = ϵ_{3, i j}, | ϵ_{3, i j} | \leq (χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2}

来自（37）、（38）、（45）和（46）。

Similarly to (21), all of

f_{i j} - {\tilde{f}}_{i j}

and

g_{i j} - {\tilde{g}}_{i j}

are calculated as follows. By multiplying (52) by

{\tilde{σ}}_{i}

{\tilde{σ}}_{i}^{2} (f_{i j} - {\tilde{f}}_{i j}) + {\tilde{σ}}_{i} {\tilde{σ}}_{j} (g_{j i} - {\tilde{g}}_{j i}) = {\tilde{σ}}_{i} ϵ_{3, i j}, {\tilde{σ}}_{j}^{2} (f_{j i} - {\tilde{f}}_{j i}) + {\tilde{σ}}_{i} {\tilde{σ}}_{j} (g_{i j} - {\tilde{g}}_{i j}) = {\tilde{σ}}_{j} ϵ_{3, j i},

where the second equation is due to the symmetry of

i

and

j

. Thus,

{\tilde{σ}}_{i}^{2} (f_{i j} - {\tilde{f}}_{i j}) + {\tilde{σ}}_{j}^{2} (f_{j i} - {\tilde{f}}_{j i}) + {\tilde{σ}}_{i} {\tilde{σ}}_{j} ((g_{i j} - {\tilde{g}}_{i j}) + (g_{j i} - {\tilde{g}}_{j i})) = {\tilde{σ}}_{i} ϵ_{3, i j} + {\tilde{σ}}_{j} ϵ_{3, j i} .

Inserting (51) into this yields

{\tilde{σ}}_{i}^{2} (f_{i j} - {\tilde{f}}_{i j}) + {\tilde{σ}}_{j}^{2} (f_{j i} - {\tilde{f}}_{j i}) = {\tilde{σ}}_{i} ϵ_{3, i j} + {\tilde{σ}}_{j} ϵ_{3, j i} - {\tilde{σ}}_{i} {\tilde{σ}}_{j} ϵ_{2, j i} .

Combining this and (50), we have

({\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}) (f_{j i} - {\tilde{f}}_{j i}) = {\tilde{σ}}_{i} ϵ_{3, i j} + {\tilde{σ}}_{j} ϵ_{3, j i} - {\tilde{σ}}_{i} {\tilde{σ}}_{j} ϵ_{2, j i} - {\tilde{σ}}_{i}^{2} ϵ_{1, i j} .

Thus, noting

{\tilde{σ}}_{i} > 0 (i = 1, \dots, n)

as in (43), we have

| ({\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}) (f_{j i} - {\tilde{f}}_{j i}) | \leq {\tilde{σ}}_{i} | ϵ_{3, i j} | + {\tilde{σ}}_{j} | ϵ_{3, j i} | + {\tilde{σ}}_{i} {\tilde{σ}}_{j} | ϵ_{2, j i} | + {\tilde{σ}}_{i}^{2} | ϵ_{1, i j} | \leq ({\tilde{σ}}_{i} + {\tilde{σ}}_{j}) (χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2} + {\tilde{σ}}_{i} ({\tilde{σ}}_{i} + {\tilde{σ}}_{j}) χ (ϵ) ϵ^{2} \leq ({\tilde{σ}}_{i} + {\tilde{σ}}_{j}) ((χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2} + (‖ A ‖ + η (ϵ) ‖ A ‖ ϵ^{2}) χ (ϵ) ϵ^{2}),

where the second inequality is due to (50), (51), and (52), and the third inequality is due to (42) and

σ_{i} \leq ‖ A ‖

. Therefore, for

1 \leq i, j \leq n, i \neq j

, we obtain (53)

| {\tilde{f}}_{i j} - f_{i j} | \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2} ({\tilde{σ}}_{i} + {\tilde{σ}}_{j})}{| {\tilde{σ}}_{i}^{2} - {\tilde{σ}}_{j}^{2} |} \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{| σ_{i} - σ_{j} | - 2 η (ϵ) ‖ A ‖ ϵ^{2}},

where the second inequality is due to (42). Similarly, for

1 \leq i, j \leq n, i \neq j

, we have (54)

| {\tilde{g}}_{i j} - g_{i j} | \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{| σ_{i} - σ_{j} | - 2 η (ϵ) ‖ A ‖ ϵ^{2}} .

与（21）类似，所有和

f_{i j} - {\tilde{f}}_{i j}

g_{i j} - {\tilde{g}}_{i j}

的计算方式如下。通过将（52）乘以

{\tilde{σ}}_{i}

，

{\tilde{σ}}_{i}^{2} (f_{i j} - {\tilde{f}}_{i j}) + {\tilde{σ}}_{i} {\tilde{σ}}_{j} (g_{j i} - {\tilde{g}}_{j i}) = {\tilde{σ}}_{i} ϵ_{3, i j}, {\tilde{σ}}_{j}^{2} (f_{j i} - {\tilde{f}}_{j i}) + {\tilde{σ}}_{i} {\tilde{σ}}_{j} (g_{i j} - {\tilde{g}}_{i j}) = {\tilde{σ}}_{j} ϵ_{3, j i},

其中第二个方程是由于和

j

的

i

对称性。因此，

{\tilde{σ}}_{i}^{2} (f_{i j} - {\tilde{f}}_{i j}) + {\tilde{σ}}_{j}^{2} (f_{j i} - {\tilde{f}}_{j i}) + {\tilde{σ}}_{i} {\tilde{σ}}_{j} ((g_{i j} - {\tilde{g}}_{i j}) + (g_{j i} - {\tilde{g}}_{j i})) = {\tilde{σ}}_{i} ϵ_{3, i j} + {\tilde{σ}}_{j} ϵ_{3, j i} .

将（51）代入其中，得到

{\tilde{σ}}_{i}^{2} (f_{i j} - {\tilde{f}}_{i j}) + {\tilde{σ}}_{j}^{2} (f_{j i} - {\tilde{f}}_{j i}) = {\tilde{σ}}_{i} ϵ_{3, i j} + {\tilde{σ}}_{j} ϵ_{3, j i} - {\tilde{σ}}_{i} {\tilde{σ}}_{j} ϵ_{2, j i} .

将 this 和（50）组合起来，我们得到

({\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}) (f_{j i} - {\tilde{f}}_{j i}) = {\tilde{σ}}_{i} ϵ_{3, i j} + {\tilde{σ}}_{j} ϵ_{3, j i} - {\tilde{σ}}_{i} {\tilde{σ}}_{j} ϵ_{2, j i} - {\tilde{σ}}_{i}^{2} ϵ_{1, i j} .

因此，

{\tilde{σ}}_{i} > 0 (i = 1, \dots, n)

注意在（43）中，我们有

| ({\tilde{σ}}_{j}^{2} - {\tilde{σ}}_{i}^{2}) (f_{j i} - {\tilde{f}}_{j i}) | \leq {\tilde{σ}}_{i} | ϵ_{3, i j} | + {\tilde{σ}}_{j} | ϵ_{3, j i} | + {\tilde{σ}}_{i} {\tilde{σ}}_{j} | ϵ_{2, j i} | + {\tilde{σ}}_{i}^{2} | ϵ_{1, i j} | \leq ({\tilde{σ}}_{i} + {\tilde{σ}}_{j}) (χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2} + {\tilde{σ}}_{i} ({\tilde{σ}}_{i} + {\tilde{σ}}_{j}) χ (ϵ) ϵ^{2} \leq ({\tilde{σ}}_{i} + {\tilde{σ}}_{j}) ((χ (ϵ) + 2 η (ϵ) ϵ) ‖ A ‖ ϵ^{2} + (‖ A ‖ + η (ϵ) ‖ A ‖ ϵ^{2}) χ (ϵ) ϵ^{2}),

第二个不等式是由于（50）、（51）和（52）造成的，第三个不等式是由于（42）和

σ_{i} \leq ‖ A ‖

.因此，对于

1 \leq i, j \leq n, i \neq j

，我们得到 (53)

| {\tilde{f}}_{i j} - f_{i j} | \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2} ({\tilde{σ}}_{i} + {\tilde{σ}}_{j})}{| {\tilde{σ}}_{i}^{2} - {\tilde{σ}}_{j}^{2} |} \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{| σ_{i} - σ_{j} | - 2 η (ϵ) ‖ A ‖ ϵ^{2}},

第二个不等式是由于（42）引起的。同样，对于

1 \leq i, j \leq n, i \neq j

，我们有 (54)

| {\tilde{g}}_{i j} - g_{i j} | \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{| σ_{i} - σ_{j} | - 2 η (ϵ) ‖ A ‖ ϵ^{2}} .

From (39), (47), (48), (49), (53), and (54), we have

| {\tilde{f}}_{i j} - f_{i j} | \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{min_{1 \leq k \leq n} (σ_{k} - σ_{k + 1}) - 2 η (ϵ) ‖ A ‖ ϵ^{2}} for 1 \leq i, j \leq m, | {\tilde{g}}_{i j} - g_{i j} | \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{min_{1 \leq k \leq n} (σ_{k} - σ_{k + 1}) - 2 η (ϵ) ‖ A ‖ ϵ^{2}} for 1 \leq i, j \leq n .

In view of

{‖ \tilde{F} - F ‖}^{2} \leq \sum_{i, j} {| {\tilde{f}}_{i j} - f_{i j} |}^{2}

and

{‖ \tilde{G} - G ‖}^{2} \leq \sum_{i, j} {| {\tilde{g}}_{i j} - g_{i j} |}^{2}

, we obtain (35). □
从（39）、（47）、（48）、（49）、（53）和（54）中，我们有

| {\tilde{f}}_{i j} - f_{i j} | \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{min_{1 \leq k \leq n} (σ_{k} - σ_{k + 1}) - 2 η (ϵ) ‖ A ‖ ϵ^{2}} for 1 \leq i, j \leq m, | {\tilde{g}}_{i j} - g_{i j} | \leq \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ‖ A ‖ ϵ^{2}}{min_{1 \leq k \leq n} (σ_{k} - σ_{k + 1}) - 2 η (ϵ) ‖ A ‖ ϵ^{2}} for 1 \leq i, j \leq n .

鉴于

{‖ \tilde{F} - F ‖}^{2} \leq \sum_{i, j} {| {\tilde{f}}_{i j} - f_{i j} |}^{2}

和

{‖ \tilde{G} - G ‖}^{2} \leq \sum_{i, j} {| {\tilde{g}}_{i j} - g_{i j} |}^{2}

，我们得到（35）。 □

From Lemma 2, the next lemma is readily accessible.
从引理 2 开始，下一个引理很容易获得。

Lemma 3 引理 3

Under the same assumption as in Lemma 2, we obtain (55)

max (‖ \tilde{F} - F ‖, ‖ \tilde{G} - G ‖) < \frac{65}{300} ϵ,

(56)

\underset{ϵ \to 0}{lim sup} \frac{max (‖ \tilde{F} - F ‖, ‖ \tilde{G} - G ‖)}{ϵ^{2}} \leq \frac{6 m ‖ A ‖}{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})} .

在与引理 2 相同的假设下，我们得到 (55)

max (‖ \tilde{F} - F ‖, ‖ \tilde{G} - G ‖) < \frac{65}{300} ϵ,

(56)

\underset{ϵ \to 0}{lim sup} \frac{max (‖ \tilde{F} - F ‖, ‖ \tilde{G} - G ‖)}{ϵ^{2}} \leq \frac{6 m ‖ A ‖}{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})} .

Proof 证明

Noting (36), we have (57)

2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2} \leq 6.360 \dots .

Therefore, we see (58)

‖ \tilde{F} - F ‖ < \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ϵ}{30 (\frac{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})}{30 m ‖ A ‖ ϵ} - \frac{2 η (ϵ) ϵ}{30 m})} < \frac{6.4 ϵ}{30 (1 - \frac{7}{1800})} < \frac{65}{300} ϵ .

Since

‖ \tilde{G} - G ‖ < 65 ϵ ∕ 300

also holds, we have (55). Combining (35) with

χ (0) = 3

, we obtain (56). □
注意（36），我们有 (57)

2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2} \leq 6.360 \dots .

因此，我们看到 (58)

‖ \tilde{F} - F ‖ < \frac{(2 χ (ϵ) + 2 η (ϵ) ϵ + χ (ϵ) η (ϵ) ϵ^{2}) ϵ}{30 (\frac{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})}{30 m ‖ A ‖ ϵ} - \frac{2 η (ϵ) ϵ}{30 m})} < \frac{6.4 ϵ}{30 (1 - \frac{7}{1800})} < \frac{65}{300} ϵ .

Since

‖ \tilde{G} - G ‖ < 65 ϵ ∕ 300

也成立，我们有（55）。将（35）与

χ (0) = 3

组合在一起，我们得到（56）。 □

On the basis of the above lemmas, we obtain the main theorem that states the quadratic convergence.
根据上述引理，我们得到了陈述二次收敛的主定理。

Theorem 1 定理 1

Let

A \in R^{m \times n}

\hat{U} \in R^{m \times m}

, and

\hat{V} \in R^{n \times n}

with

m \geq n

and

m \geq 2

. Define

σ_{n + 1} ≔ 0

for the sake of convenience. Define

ϵ ≔ max (‖ F ‖, ‖ G ‖)

with

F

G

satisfying

U = \hat{U} (I_{m} + F), V = \hat{V} (I_{n} + G)

. Similarly, define

ϵ^{'} ≔ max (‖ F^{'} ‖, ‖ G^{'} ‖)

with

F^{'}

G^{'}

satisfying

U^{'} = {\hat{U}}^{'} (I_{m} + F^{'})

V = {\hat{V}}^{'} (I_{n} + G^{'})

, where

{\hat{U}}^{'}

{\hat{V}}^{'}

are obtained in Algorithm 1. If (59)

ϵ < \frac{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})}{30 m ‖ A ‖}

is satisfied, then (60)

ϵ^{'} < \frac{7}{10} ϵ,

(61)

\underset{ϵ \to 0}{lim sup} \frac{ϵ^{'}}{ϵ^{2}} \leq \frac{18 m ‖ A ‖}{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})} .

设

A \in R^{m \times n}

\hat{U} \in R^{m \times m}

，和

\hat{V} \in R^{n \times n}

和

m \geq n

和

m \geq 2

。为方便起见，定义

σ_{n + 1} ≔ 0

。定义

ϵ ≔ max (‖ F ‖, ‖ G ‖)

F

G

满足

U = \hat{U} (I_{m} + F), V = \hat{V} (I_{n} + G)

.同样，使用 satisfying

U^{'} = {\hat{U}}^{'} (I_{m} + F^{'})

V = {\hat{V}}^{'} (I_{n} + G^{'})

定义

F^{'}

G^{'}

ϵ^{'} ≔ max (‖ F^{'} ‖, ‖ G^{'} ‖)

，其中

{\hat{U}}^{'}

{\hat{V}}^{'}

在算法 1 中获得。如果 (59)

ϵ < \frac{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})}{30 m ‖ A ‖}

满足，则 (60)

ϵ^{'} < \frac{7}{10} ϵ,

(61)

\underset{ϵ \to 0}{lim sup} \frac{ϵ^{'}}{ϵ^{2}} \leq \frac{18 m ‖ A ‖}{min_{1 \leq i \leq n} (σ_{i} - σ_{i + 1})} .

Proof 证明

Define

F_{α}

such that

U = {\hat{U}}^{'} (I_{m} + F_{α})

. Noting

{\hat{U}}^{'} (I_{m} + F_{α}) = \hat{U} (I_{m} + F)

and

{\hat{U}}^{'} = \hat{U} (I_{m} + \tilde{F})

, we have

{\hat{U}}^{'} F_{α} = \hat{U} (I_{m} + F) - {\hat{U}}^{'} = \hat{U} (F - \tilde{F}) = {\hat{U}}^{'} {(I_{m} + \tilde{F})}^{- 1} (F - \tilde{F}) .

It then follows that (62)

F_{α} = {(I_{m} + \tilde{F})}^{- 1} (F - \tilde{F}) .

Noting (55) and

‖ \tilde{F} ‖ \leq ‖ \tilde{F} - F ‖ + ‖ F ‖ < 2 ϵ < 1 ∕ 30

from (33), we have (63)

‖ F_{α} ‖ \leq \frac{‖ F - \tilde{F} ‖}{1 - ‖ \tilde{F} ‖} \leq \frac{\frac{65}{300} ϵ}{1 - \frac{1}{30}} < \frac{7}{30} ϵ .

In Lemma 1, letting

U_{α} ≔ U

, we see (64)

‖ F^{'} ‖ \leq 3 ‖ F_{α} ‖ .

Thus we obtain

‖ F^{'} ‖ < \frac{7}{10} ‖ F ‖ .

Regarding

G^{'}

, it is easy to see that

‖ G^{'} ‖ \leq \frac{‖ G - \tilde{G} ‖}{1 - ‖ \tilde{G} ‖} < \frac{7}{30} ϵ

in the same manner as (63). Therefore, we obtain (60). Moreover, using (56), (62), and (64), we obtain (61). □
定义

F_{α}

，使

U = {\hat{U}}^{'} (I_{m} + F_{α})

.注意

{\hat{U}}^{'} (I_{m} + F_{α}) = \hat{U} (I_{m} + F)

和

{\hat{U}}^{'} = \hat{U} (I_{m} + \tilde{F})

，我们得到

{\hat{U}}^{'} F_{α} = \hat{U} (I_{m} + F) - {\hat{U}}^{'} = \hat{U} (F - \tilde{F}) = {\hat{U}}^{'} {(I_{m} + \tilde{F})}^{- 1} (F - \tilde{F}) .

然后， (62)

F_{α} = {(I_{m} + \tilde{F})}^{- 1} (F - \tilde{F}) .

注意到（55）并从

‖ \tilde{F} ‖ \leq ‖ \tilde{F} - F ‖ + ‖ F ‖ < 2 ϵ < 1 ∕ 30

（33）中，我们有 (63)

‖ F_{α} ‖ \leq \frac{‖ F - \tilde{F} ‖}{1 - ‖ \tilde{F} ‖} \leq \frac{\frac{65}{300} ϵ}{1 - \frac{1}{30}} < \frac{7}{30} ϵ .

在引理 1 中，让

U_{α} ≔ U

，我们看到 (64)

‖ F^{'} ‖ \leq 3 ‖ F_{α} ‖ .

因此我们得到

‖ F^{'} ‖ < \frac{7}{10} ‖ F ‖ .

关于

G^{'}

，很容易看出它与

‖ G^{'} ‖ \leq \frac{‖ G - \tilde{G} ‖}{1 - ‖ \tilde{G} ‖} < \frac{7}{30} ϵ

（63）相同。因此，我们得到（60）。此外，使用（56）、（62）和（64），我们得到（61）。 □

Remark 4 注 4

From (42), singular values are convergent, where the rate can be estimated by

η (ϵ) ‖ A ‖ ϵ^{2}

that is quadratically convergent.
从（42）中，奇异值是收敛的，其中速率可以通过

η (ϵ) ‖ A ‖ ϵ^{2}

来估计是二次收敛的。

4. Numerical results 4. 数值结果

We present numerical results to demonstrate the effectiveness of the proposed algorithm (Algorithm 1). The numerical experiments were conducted using MATLAB R2017b on a PC with 2.5 GHz Intel Core i7 and 16 GB of main memory. To realize multiple-precision arithmetic, we adopt Advanpix Multiprecision Computing Toolbox version 4.6.0 [15], which utilizes well-known, fast, and reliable multiple-precision arithmetic libraries including GMP and MPFR. In the multiple-precision toolbox, we can control the arithmetic precision

d

in decimal digits using the command mp.Digits (

d

).
我们提供了数值结果来证明所提出的算法的有效性（算法 1）。数值实验是在配备 2.5 GHz Intel Core i7 和 16 GB 主内存的 PC 上使用 MATLAB R2017b 进行的。为了实现多精度算术，我们采用了 Advanpix Multiprecision Computing Toolbox 版本 4.6.0 [15]，它利用了众所周知、快速且可靠的多精度算术库，包括 GMP 和 MPFR。在多精度工具箱中，我们可以使用命令 mp 控制以十进制数字为单位的算术精度

d

。数字（

d

）。

4.1. Convergence property
4.1. 收敛属性

First, we confirm the convergence property of the proposed algorithm for various singular value distributions. We generate

m \times n

rectangular real matrices using Higham’s randsvd [16] by the following MATLAB command.

The singular value distribution and condition number of

A

can be controlled by the input arguments

m o d e \in {1, 2, 3, 4, 5}

and

c n d ≕ α \geq 1

, as follows:
首先，我们确认了所提出的算法对各种奇异值分布的收敛特性。我们使用以下 MATLAB 命令使用 Higham 的 randsvd [16] 生成

m \times n

矩形实矩阵。

的奇异值分布和条件数

A

可以通过输入参数

m o d e \in {1, 2, 3, 4, 5}

和

c n d ≕ α \geq 1

来控制，如下所示：

1.
one large: $σ_{1} \approx 1$ , $σ_{i} \approx α^{- 1}$ , $i = 2, \dots, n$
一个大： $σ_{1} \approx 1$ ， $σ_{i} \approx α^{- 1}$ ， $i = 2, \dots, n$
2.
one small: $σ_{n} \approx α^{- 1}$ , $σ_{i} \approx 1$ , $i = 1, \dots, n - 1$
一小： $σ_{n} \approx α^{- 1}$ ， $σ_{i} \approx 1$ ， $i = 1, \dots, n - 1$
3.
geometrically distributed: $σ_{i} \approx α^{- (i - 1) ∕ (n - 1)}$ , $i = 1, \dots, n$
几何分布： $σ_{i} \approx α^{- (i - 1) ∕ (n - 1)}$ ， $i = 1, \dots, n$
4.
arithmetically distributed: $σ_{i} \approx 1 - (1 - α^{- 1}) (i - 1) ∕ (n - 1)$ , $i = 1, \dots, n$
算术分布： $σ_{i} \approx 1 - (1 - α^{- 1}) (i - 1) ∕ (n - 1)$ ， $i = 1, \dots, n$
5.
random with uniformly distributed logarithm: $σ_{i} \approx α^{- r (i)}$ , $i = 1, \dots, n$ , where $r (i)$ are pseudo-random values drawn from the standard uniform distribution on $(0, 1)$ .
具有均匀分布对数的 random： $σ_{i} \approx α^{- r (i)}$ ， $i = 1, \dots, n$ ，其中 $r (i)$ 是从上的 $(0, 1)$ 标准均匀分布中提取的伪随机值。

Here,

κ (A) \approx c n d

for

c n d < u^{- 1} \approx 1 0^{16}

. Note that for

m o d e \in {1, 2}

, there is a cluster of singular values.
在这里，

κ (A) \approx c n d

对于

c n d < u^{- 1} \approx 1 0^{16}

.请注意，对于

m o d e \in {1, 2}

，存在一组奇异值。

We start with small examples such as

m = 10

and

n = 5

to observe the convergence behavior of the algorithm. Moreover, we set

c n d = 1 0^{8}

to generate moderately ill-conditioned problems in binary64. We compute

U^{(0)}

V^{(0)}

as initial approximate left and right singular vector matrices using the MATLAB function svd for the singular value decomposition in binary64 arithmetic. To see the behavior of the proposed algorithm precisely, we use multiple-precision arithmetic with sufficiently long precision to simulate the exact arithmetic in the algorithm. Then, we expect that Algorithm 1 (RefSVD) works effectively for

m o d e \in {3, 4, 5}

, but does not for

m o d e \in {1, 2}

. For reference, we also use the built-in function svd in the multiple-precision toolbox to compute the singular values

σ_{i}

i = 1, 2, \dots, n

. The results are shown in Fig. 1, which provides

{max}_{1 \leq i \leq n} | {\hat{σ}}_{i} - σ_{i} | ∕ | σ_{i} |

as the maximum relative error of the computed singular values

{\hat{σ}}_{i}

max (‖ R ‖, ‖ S ‖)

where

R ≔ I - {\hat{U}}^{T} \hat{U}

and

S ≔ I - {\hat{V}}^{T} \hat{V}

as the orthogonality of computed left and right singular vector matrices,

‖ offdiag ({\hat{U}}^{T} A \hat{V}) ‖ ∕ ‖ A ‖

as the diagonality of

{\hat{U}}^{T} A \hat{V}

, and

max (‖ \tilde{F} ‖, ‖ \tilde{G} ‖)

where

\tilde{F}

and

\tilde{G}

are computed in Algorithm 1. Here,

offdiag (\cdot)

denotes the off-diagonal part. The horizontal axis shows the number of iterations

ν

of Algorithm 1.
我们从

m = 10

和

n = 5

等小例子开始，观察算法的收敛行为。此外，我们设置

c n d = 1 0^{8}

在 binary64 中生成中度病态问题。我们使用 MATLAB 函数 svd 计算

U^{(0)}

，

V^{(0)}

作为初始近似左和右奇异向量矩阵，用于 binary64 算术中的奇异值分解。为了精确地查看所提出的算法的行为，我们使用具有足够长精度的多精度算术来模拟算法中的精确算术。然后，我们期望算法 1 （RefSVD）对有效，

m o d e \in {3, 4, 5}

但

m o d e \in {1, 2}

对无效。作为参考，我们还使用多精度工具箱中的内置函数 svd 来计算奇异值

σ_{i}

。

i = 1, 2, \dots, n

结果如图 1 所示，其中提供

{max}_{1 \leq i \leq n} | {\hat{σ}}_{i} - σ_{i} | ∕ | σ_{i} |

计算奇异值

{\hat{σ}}_{i}

的最大相对误差，

max (‖ R ‖, ‖ S ‖)

其中

R ≔ I - {\hat{U}}^{T} \hat{U}

和

S ≔ I - {\hat{V}}^{T} \hat{V}

作为计算的左和右奇异向量矩阵的正交性，

‖ offdiag ({\hat{U}}^{T} A \hat{V}) ‖ ∕ ‖ A ‖

作为的

{\hat{U}}^{T} A \hat{V}

对角线，

max (‖ \tilde{F} ‖, ‖ \tilde{G} ‖)

其中

\tilde{F}

和

\tilde{G}

在算法 1 中计算。这里，

offdiag (\cdot)

表示非对角线部分。横轴显示算法 1 的迭代

ν

次数。

In the case of

m o d e \in {3, 4, 5}

, all the quantities decrease quadratically in every iteration, i.e., we observe the quadratic convergence of Algorithm 1, as expected. On the other hand, in the case of

m o d e \in {1, 2}

, the algorithm fails to improve the accuracy of approximate singular vector matrices because the test matrices for

m o d e \in {1, 2}

have clustered singular values. In fact, the assumption (59) for the convergence of Algorithm 1 is not satisfied.
在的情况下

m o d e \in {3, 4, 5}

，所有量在每次迭代中都呈二次方递减，即，正如我们预期的那样，我们观察到算法 1 的二次收敛。另一方面，在

m o d e \in {1, 2}

的情况下，该算法无法提高近似奇异向量矩阵的准确率，因为的

m o d e \in {1, 2}

测试矩阵具有聚类奇异值。事实上，算法 1 收敛的假设（59）不满足。

4.2. Computational speed 4.2. 计算速度

To evaluate the computational speed of the proposed algorithm, we compare the computing time of Algorithm 1 to that of an approach using multiple-precision arithmetic, which is called “MP-approach”. In the multiple-precision toolbox, LAPACK’s routine xGESDD, which is based on a divide-and-conquer method, is implemented sophisticatedly with parallelism to solve singular value problems.
为了评估所提出的算法的计算速度，我们将算法 1 的计算时间与使用多精度算术的方法（称为 “MP-approach”）的计算时间进行了比较。在多精度工具箱中，LAPACK 的例程 xGESDD 基于分而治之法，通过并行性复杂地实现，以解决奇异值问题。

We generate a pseudo-random real

n \times n

matrix with

n \in {500, 1000}

using the MATLAB function randn such as A = randn(n). We use the MATLAB function svd in binary64, and iteratively refine the computed left and right singular vectors using Algorithm 1 twice. In Algorithm 1, for matrix multiplication at steps 1 and 8 we adopt a fast and accurate algorithm [11] using IEEE 754 binary64 (double precision) as working precision, and for other parts we use the multiple-precision toolbox with necessary arithmetic precision

d_{ν}

for

ν = 1, 2

, where

ν

denotes the iteration number of Algorithm 1. Since the binary64 arithmetic is used for obtaining initial guesses

{\hat{Σ}}_{0}

{\hat{U}}_{0}

, and

{\hat{V}}_{0}

, it is reasonable for the binary128 (quadruple precision) arithmetic to be used for

ν = 1

in order to achieve the quadratic convergence of the proposed algorithm. In the multiple-precision toolbox, the binary128 arithmetic can be realized when

d = 34

for mp.Digits (

d

), and we set

d_{1} = 34

. For

ν = 2

, we determine

d_{2}

by estimating the error of

{\hat{U}}_{0}

and

{\hat{V}}_{0}

using

ε_{1} ≔ max (‖ {\tilde{F}}_{0} ‖, ‖ {\tilde{G}}_{0} ‖)

where

{\tilde{F}}_{0}

and

{\tilde{G}}_{0}

can be obtained at the first iteration (

ν = 1

). Since we expect that the error of

{\hat{U}}_{1}

and

{\hat{V}}_{1}

is of the order of

ε_{1}^{2}

, the computational precision required in the second iteration should correspond to

{(ε_{1}^{2})}^{2} = ε_{1}^{4}

. Thus, we set

d_{2} = ⌈ 4 {log}_{10} ε_{1}^{- 1} ⌉

. In the MP-approach, we adjust the arithmetic precision

d

d_{1}

and

d_{2}

corresponding to Algorithm 1. Note that the case for

d = 34

is specially tuned in the multiple-precision toolbox and faster than that for

d < 34

, and we do not set

d

such that

d < 34

for timing fairness.

n \in {500, 1000}

我们使用 MATLAB 函数 randn 生成一个伪随机实

n \times n

矩阵，例如 A = randn（n）。我们在 binary64 中使用 MATLAB 函数 svd，并使用算法 1 迭代优化计算出的左和右奇异向量两次。在算法 1 中，对于步骤 1 和 8 的矩阵乘法，我们采用快速准确的算法 [11]，使用 IEEE 754 binary64（双精度）作为工作精度，对于其他部分，我们使用具有必要算术精度

d_{ν}

的多精度工具箱，

ν = 1, 2

其中

ν

表示算法 1 的迭代次数。由于 binary64 算术用于获取初始猜测值

{\hat{Σ}}_{0}

、

{\hat{U}}_{0}

和

{\hat{V}}_{0}

，因此使用

ν = 1

binary128（四倍精度）算术是合理的，以实现所提算法的二次收敛。在多精度工具箱中，当

d = 34

for mp.数字（

d

），我们设置

d_{1} = 34

。对于

ν = 2

，我们

d_{2}

通过估计的误差

{\hat{U}}_{0}

并使用

{\hat{V}}_{0}

ε_{1} ≔ max (‖ {\tilde{F}}_{0} ‖, ‖ {\tilde{G}}_{0} ‖)

where

{\tilde{F}}_{0}

来确定，并且可以

{\tilde{G}}_{0}

在第一次迭代（）时获得

ν = 1

。由于我们预计

{\hat{U}}_{1}

的误差为，

{\hat{V}}_{1}

ε_{1}^{2}

因此第二次迭代所需的计算精度应对应于

{(ε_{1}^{2})}^{2} = ε_{1}^{4}

。因此，我们设置

d_{2} = ⌈ 4 {log}_{10} ε_{1}^{- 1} ⌉

。在 MP 方法中，我们将算术精度

d

调整为

d_{1}

并

d_{2}

对应于算法 1。请注意，case for

d = 34

在 multiple-precision 工具箱中进行了专门调整，并且比 for

d < 34

更快，我们没有设置

d

that

d < 34

以实现 timing fairness。

In Table 1, Table 2, we show

‖ A - \hat{U} \hat{Σ} {\hat{V}}^{T} ‖ ∕ ‖ A ‖

as the relative residual norm,

max (‖ R ‖, ‖ S ‖)

as the orthogonality of

\hat{U}

and

\hat{V}

, and the measured computing time. In addition, we show

max (‖ \tilde{F} ‖, ‖ \tilde{G} ‖)

in Algorithm 1 for each iteration. As can be seen from

max (‖ {\tilde{F}}_{ν} ‖, ‖ {\tilde{G}}_{ν} ‖)

in the tables, Algorithm 1 quadratically improves the accuracy of the computed singular vectors. The residual

‖ A - {\hat{U}}_{ν} {\hat{Σ}}_{ν} {\hat{V}}_{ν}^{T} ‖ ∕ ‖ A ‖

decreases and the orthogonality

max (‖ R_{ν} ‖, ‖ S_{ν} ‖)

is improved when the iteration number

ν

increases. Moreover, Algorithm 1 is considerably faster than the MP-approach.
在表 1 和表 2 中，我们表示

‖ A - \hat{U} \hat{Σ} {\hat{V}}^{T} ‖ ∕ ‖ A ‖

为相对残差范数，

max (‖ R ‖, ‖ S ‖)

\hat{U}

以及和

\hat{V}

的正交性，以及测得的计算时间。此外，我们在算法 1 中显示了

max (‖ \tilde{F} ‖, ‖ \tilde{G} ‖)

每次迭代的 Algorithm 1。从

max (‖ {\tilde{F}}_{ν} ‖, ‖ {\tilde{G}}_{ν} ‖)

表中可以看出，算法 1 二次方提高了计算的奇异向量的精度。当迭代次数

ν

增加时，残差

‖ A - {\hat{U}}_{ν} {\hat{Σ}}_{ν} {\hat{V}}_{ν}^{T} ‖ ∕ ‖ A ‖

减少，正交性

max (‖ R_{ν} ‖, ‖ S_{ν} ‖)

提高。此外，算法 1 比 MP 方法快得多。

Acknowledgments 确认

The authors wish to thank the anonymous referees for their valuable comments. This study was partially supported by JST CREST Grant Number JPMJCR14D4 and JSPS KAKENHI Grant Numbers 16H03917, 17K14143.
作者感谢匿名审稿人的宝贵意见。这项研究得到了 JST CREST Grant Number JPMJCR14D4 和 JSPS 的部分支持 KAKENHI 授权号 16H03917、17K14143。

Table 1. Results for a pseudo-random real 500 × 500 matrix.
表 1.伪随机实数 500 × 500 矩阵的结果。

Algorithm 1 算法 1	$ν = 0$ (svd in binary64) $ν = 0$ （Binary64 中的 SVD）	$ν = 1$ ( $d_{1} = 34$ ) $ν = 1$ （ $d_{1} = 34$ ）	$ν = 2$ ( $d_{2} = 44$ ) $ν = 2$ （ $d_{2} = 44$ ）
$max (‖ {\tilde{F}}_{ν} ‖, ‖ {\tilde{G}}_{ν} ‖)$	$1.73 \times 1 0^{- 11}$	$1.50 \times 1 0^{- 22}$	$3.40 \times 1 0^{- 44}$
$‖ A - {\hat{U}}_{ν} {\hat{Σ}}_{ν} {\hat{V}}_{ν}^{T} ‖ ∕ ‖ A ‖$	$6.73 \times 1 0^{- 15}$	$2.03 \times 1 0^{- 22}$	$4.75 \times 1 0^{- 44}$
$max (‖ R_{ν} ‖, ‖ S_{ν} ‖)$	$6.55 \times 1 0^{- 15}$	$2.99 \times 1 0^{- 22}$	$6.76 \times 1 0^{- 44}$
Accumulated elapsed time (s) 累计运行时间（s）	0.05	1.24	5.54
MP-approach MP 方法	mp.Digits ( $d$ ) MP.数字（ $d$ ）	$d = 34$	$d = 44$
$‖ A - \hat{U} \hat{Σ} {\hat{V}}^{T} ‖ ∕ ‖ A ‖$		$5.96 \times 1 0^{- 33}$	$4.71 \times 1 0^{- 43}$
$max (‖ R ‖, ‖ S ‖)$		$7.72 \times 1 0^{- 33}$	$4.65 \times 1 0^{- 43}$
Elapsed time (s) 经过时间（s）		18.80	73.92

Table 2. Results for a pseudo-random real 1000 × 1000 matrix.
表 2.伪随机实数 1000 × 1000 矩阵的结果。

Algorithm 1 算法 1	$ν = 0$ (svd in binary64) $ν = 0$ （Binary64 中的 SVD）	$ν = 1$ ( $d_{1} = 34$ ) $ν = 1$ （ $d_{1} = 34$ ）	$ν = 2$ ( $d_{2} = 39$ ) $ν = 2$ （ $d_{2} = 39$ ）
$max (‖ {\tilde{F}}_{ν} ‖, ‖ {\tilde{G}}_{ν} ‖)$	$2.1 \times 1 0^{- 10}$	$2.1 \times 1 0^{- 20}$	$8.5 \times 1 0^{- 40}$
$‖ A - {\hat{U}}_{ν} {\hat{Σ}}_{ν} {\hat{V}}_{ν}^{T} ‖ ∕ ‖ A ‖$	$1.0 \times 1 0^{- 14}$	$4.2 \times 1 0^{- 20}$	$1.6 \times 1 0^{- 39}$
$max (‖ R_{ν} ‖, ‖ S_{ν} ‖)$	$1.0 \times 1 0^{- 14}$	$4.2 \times 1 0^{- 20}$	$1.6 \times 1 0^{- 39}$
Accumulated elapsed time (s) 累计运行时间（s）	0.31	6.07	23.48
MP-approach MP 方法	mp.Digits ( $d$ ) MP.数字（ $d$ ）	$d = 34$	$d = 39$
$‖ A - \hat{U} \hat{Σ} {\hat{V}}^{T} ‖ ∕ ‖ A ‖$		$6.61 \times 1 0^{- 33}$	$6.08 \times 1 0^{- 38}$
$max (‖ R ‖, ‖ S ‖)$		$9.77 \times 1 0^{- 33}$	$8.17 \times 1 0^{- 38}$
Elapsed time (s) 经过时间（s）		131.20	1338.50

References

[1]
Biglieri E., Yao K.
Some properties of SVD and their application to digital signal processing
Signal Process., 18 (3) (1989), pp. 277-289
View PDF View article View in Scopus Google Scholar
[2]
Sahidullah M., Kinnunen T.
Local spectral variability features for speaker verification
Digit. Signal Process., 50 (C) (2016), pp. 1-11
View PDF View article View in Scopus Google Scholar
[3]
Alter O., Brown P.O., Botstein D.
Singular value decomposition for genome-wide expression data processing and modeling
Proc. Natl. Acad. Sci. USA, 97 (18) (2000), pp. 10101-10106
View in Scopus Google Scholar
[4]
Wall M.E., Rechtsteiner A., Rocha L.M.
Singular value decomposition and principal component analysis
Berrar D.P., Dubitzky W., Granzow M. (Eds.), A Practical Approach to Microarray Data Analysis, Kluwer Academic Publishers, Norwell, MA, USA (2003), pp. 91-109
Crossref Google Scholar
[5]
Golub G.H., Van Loan C.F.
Matrix Computations
(fourth ed.), The Johns Hopkins University Press, Baltimore (2013)
Google Scholar
[6]
Muller N., Magaia L., Herbst B.M.
Singular value decomposition, eigenfaces, and 3D reconstructions
SIAM Rev., 46 (2004), pp. 518-545
View in Scopus Google Scholar
[7]
Ogita T., Aishima K.
Iterative refinement for symmetric eigenvalue decomposition
Jpn. J. Ind. Appl. Math., 35 (3) (2018), pp. 1007-1035
Crossref View in Scopus Google Scholar
[8]
Ogita T., Aishima K.
Iterative refinement for symmetric eigenvalue decomposition II: clustered eigenvalues
Jpn. J. Ind. Appl. Math. (2019)
published Online, Feb. 22
Google Scholar
[9]
Li X.S., Demmel J.W., Bailey D.H., Henry G., Hida Y., Iskandar J., Kahan W., Kang S.Y., Kapur A., Martin M.C., Thompson B.J., Tung T., Yoo D.
Design, implementation and testing of extended and mixed precision BLAS
ACM Trans. Math. Software, 28 (2002), pp. 152-205
View in Scopus Google Scholar
[10]
Ogita T., Rump S.M., Oishi S.
Accurate sum and dot product
SIAM J. Sci. Comput., 26 (6) (2005), pp. 1955-1988
Crossref View in Scopus Google Scholar
[11]
Ozaki K., Ogita T., Oishi S., Rump S.M.
Error-free transformations of matrix multiplication by using fast routines of matrix multiplication and its applications
Numer. Algorithms, 59 (1) (2012), pp. 95-118
Crossref View in Scopus Google Scholar
[12]
Dongarra J.J.
Improving the accuracy of computed singular values
SIAM J. Sci. Stat. Comput., 4 (4) (1983), pp. 712-719
Crossref Google Scholar
[13]
Davies P.I., Smith M.I.
Updating the singular value decomposition
J. Comput. Appl. Math., 170 (2004), pp. 145-167
View PDF View article View in Scopus Google Scholar
[14]
Davies R.O., Modi J.J.
A direct method for completing eigenproblem solutions on a parallel computer
Linear Algebra Appl., 77 (1986), pp. 61-74
View PDF View article View in Scopus Google Scholar
[15]
Advanpix: Multiprecision computing toolbox for MATLAB, 2019. Code and documentation available at http://www.advanpix.com/.
Google Scholar
[16]
Higham N.J.
Accuracy and Stability of Numerical Algorithms
(second ed.), SIAM, Philadelphia, PA (2002)
Google Scholar

Cited by (7)

Optimizing a vector of shrinkage factors for continuum regression
2020, Chemometrics and Intelligent Laboratory Systems
Continuum regression (CR) provides a promising regression framework encompassing ordinary least squares (OLS), partial least squares (PLS), and principal component regression (PCR). One important parameter of CR, namely shrinkage factor, determines how CR compromises between OLS and PCR. As the factor suggests, the rationale behind CR is that it aims at realizing a balance between achieving a good fit and establishing a stable model. However, traditional CR always uses a single shrinkage factor when extracting successive latent variables. As a consequence, the power of CR is surely limited. Aiming at this problem, we offer a vector of shrinkage factors for CR and one shrinkage factor for each latent variable. But now, identifying the optimal vector of shrinkage factors becomes a non-deterministic polynomial complete problem. As an effective optimization method, genetic algorithm (GA) is utilized to handle this tedious task. Together, the GACR framework is proposed in this study. The experiments on two real-world datasets illustrate the method’s applicability in practice.
Acceleration of iterative refinement for singular value decomposition
2024, Numerical Algorithms
A mixed precision LOBPCG algorithm
2023, Numerical Algorithms
Mixed precision algorithms in numerical linear algebra
2022, Acta Numerica
Mixed-Precision Algorithm for Finding Selected Eigenvalues and Eigenvectors of Symmetric and Hermitian Matrices<sup>1</sup>
2022, Proceedings of ScalAH 2022: 13th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems, Held in conjunction with SC 2022: The International Conference for High Performance Computing, Networking, Storage and Analysis
Model order determination method of stochastic subspace based on S-type function
2020, Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition)

View all citing articles on Scopus

[1] [1]
Biglieri E., Yao K.
Some properties of SVD and their application to digital signal processing
Signal Process., 18 (3) (1989), pp. 277-289
View PDF View article View in Scopus Google Scholar

[2] [2]
Sahidullah M., Kinnunen T.
Local spectral variability features for speaker verification
Digit. Signal Process., 50 (C) (2016), pp. 1-11
View PDF View article View in Scopus Google Scholar

[3] [3]
Alter O., Brown P.O., Botstein D.
Singular value decomposition for genome-wide expression data processing and modeling
Proc. Natl. Acad. Sci. USA, 97 (18) (2000), pp. 10101-10106
View in Scopus Google Scholar

[4] [4]
Wall M.E., Rechtsteiner A., Rocha L.M.
Singular value decomposition and principal component analysis
Berrar D.P., Dubitzky W., Granzow M. (Eds.), A Practical Approach to Microarray Data Analysis, Kluwer Academic Publishers, Norwell, MA, USA (2003), pp. 91-109
Crossref Google Scholar

[5] [5]
Golub G.H., Van Loan C.F.
Matrix Computations
(fourth ed.), The Johns Hopkins University Press, Baltimore (2013)
Google Scholar

[6] [6]
Muller N., Magaia L., Herbst B.M.
Singular value decomposition, eigenfaces, and 3D reconstructions
SIAM Rev., 46 (2004), pp. 518-545
View in Scopus Google Scholar

[7] [7]
Ogita T., Aishima K.
Iterative refinement for symmetric eigenvalue decomposition
Jpn. J. Ind. Appl. Math., 35 (3) (2018), pp. 1007-1035
Crossref View in Scopus Google Scholar

[8] [8]
Ogita T., Aishima K.
Iterative refinement for symmetric eigenvalue decomposition II: clustered eigenvalues
Jpn. J. Ind. Appl. Math. (2019)
published Online, Feb. 22
Google Scholar

[9] [9]
Li X.S., Demmel J.W., Bailey D.H., Henry G., Hida Y., Iskandar J., Kahan W., Kang S.Y., Kapur A., Martin M.C., Thompson B.J., Tung T., Yoo D.
Design, implementation and testing of extended and mixed precision BLAS
ACM Trans. Math. Software, 28 (2002), pp. 152-205
View in Scopus Google Scholar

[10] [10]
Ogita T., Rump S.M., Oishi S.
Accurate sum and dot product
SIAM J. Sci. Comput., 26 (6) (2005), pp. 1955-1988
Crossref View in Scopus Google Scholar

[11] [11]
Ozaki K., Ogita T., Oishi S., Rump S.M.
Error-free transformations of matrix multiplication by using fast routines of matrix multiplication and its applications
Numer. Algorithms, 59 (1) (2012), pp. 95-118
Crossref View in Scopus Google Scholar

[12] [12]
Dongarra J.J.
Improving the accuracy of computed singular values
SIAM J. Sci. Stat. Comput., 4 (4) (1983), pp. 712-719
Crossref Google Scholar

[13] [13]
Davies P.I., Smith M.I.
Updating the singular value decomposition
J. Comput. Appl. Math., 170 (2004), pp. 145-167
View PDF View article View in Scopus Google Scholar

[14] [14]
Davies R.O., Modi J.J.
A direct method for completing eigenproblem solutions on a parallel computer
Linear Algebra Appl., 77 (1986), pp. 61-74
View PDF View article View in Scopus Google Scholar

[15] [15]
Advanpix: Multiprecision computing toolbox for MATLAB, 2019. Code and documentation available at http://www.advanpix.com/.
Google Scholar

[16] [16]
Higham N.J.
Accuracy and Stability of Numerical Algorithms
(second ed.), SIAM, Philadelphia, PA (2002)
Google Scholar

Outline 大纲

Cited by (7) 被引用次数（7）

Figures (3) 手办（3）

Tables (2)

Journal of Computational and Applied Mathematics

Abstract 抽象

MSC MSC 系列

Keywords 关键字

1. Introduction 1. 引言

2. Proposed algorithm 2. 建议的算法

3. Convergence analysis 3. 收敛分析

4. Numerical results 4. 数值结果

4.1. Convergence property
4.1. 收敛属性

4.2. Computational speed 4.2. 计算速度

Acknowledgments 确认

References

Cited by (7)

Optimizing a vector of shrinkage factors for continuum regression

Acceleration of iterative refinement for singular value decomposition

A mixed precision LOBPCG algorithm

Mixed precision algorithms in numerical linear algebra

Mixed-Precision Algorithm for Finding Selected Eigenvalues and Eigenvectors of Symmetric and Hermitian Matrices<sup>1</sup>

Model order determination method of stochastic subspace based on S-type function

Iterative refinement for singular value decomposition based on matrix multiplication基于矩阵乘法的奇异值分解迭代细化

Abstract 抽象

MSC MSC 系列

Keywords 关键字

1. Introduction 1. 引言

2. Proposed algorithm 2. 建议的算法

3. Convergence analysis 3. 收敛分析

4. Numerical results 4. 数值结果

4.1. Convergence property4.1. 收敛属性

4.2. Computational speed 4.2. 计算速度

Acknowledgments 确认

References

Iterative refinement for singular value decomposition based on matrix multiplication
基于矩阵乘法的奇异值分解迭代细化

4.1. Convergence property
4.1. 收敛属性