Tutorial

Tutorial of Proper Orthogonal Decomposition

Learn the fundamentals of proper orthogonal decomposition theory.

1. Basic Idea of POD

Proper Orthogonal Decomposition (POD) is a widely used method for data reduction and modal decomposition. It extracts the most energetic and representative structures from complex datasets such as flow fields, pressure fields, or experimental measurements.

The core idea of POD is to represent a complex system using a limited number of orthogonal modes that capture the maximum possible energy. For flow field analysis, POD decomposes a transient field into a set of spatial modes and corresponding temporal coefficients, revealing dominant coherent structures in the flow.

Pic-1

Compared with directly analyzing a large number of instantaneous snapshots, POD significantly simplifies the problem by identifying a few dominant modes, making it easier to interpret flow structures, energy distribution, and unsteady behavior.


2. Mathematical Formulation of POD

Let the instantaneous field be denoted as q(x,t)q(\mathbf{x}, t), where x\mathbf{x} is the spatial coordinate and tt is time. POD aims to express it as:

q(x,t)=q(x)+i=1Nai(t)ϕi(x)q(\mathbf{x}, t) = \overline{q}(\mathbf{x}) + \sum_{i=1}^{N} a_i(t)\phi_i(\mathbf{x})

where:

  • q(x)\overline{q}(\mathbf{x}) is the mean field
  • ϕi(x)\phi_i(\mathbf{x}) is the ii-th spatial POD mode
  • ai(t)a_i(t) is the corresponding temporal coefficient
  • NN is the number of modes

In practice, the fluctuating component is first obtained:

q(x,t)=q(x,t)q(x)q'(\mathbf{x}, t) = q(\mathbf{x}, t) - \overline{q}(\mathbf{x})

Then POD is applied to the fluctuating field:

q(x,t)=i=1Nai(t)ϕi(x)q'(\mathbf{x}, t) = \sum_{i=1}^{N} a_i(t)\phi_i(\mathbf{x})

3. Physical Meaning of POD Modes

The spatial modes ϕi(x)\phi_i(\mathbf{x}) represent dominant flow structures. The first mode usually contains the highest energy and corresponds to the most significant coherent structure. Higher-order modes contain progressively less energy and often represent smaller-scale or more complex structures.

The temporal coefficients ai(t)a_i(t) describe how each spatial mode evolves over time.

Pic-1

Each POD mode is associated with an eigenvalue λi\lambda_i, which represents its energy contribution. The energy ratio of the ii-th mode is:

ηi=λij=1Nλj\eta_i = \frac{\lambda_i}{\sum_{j=1}^{N} \lambda_j}

The cumulative energy of the first rr modes is:

ηcum(r)=i=1rλij=1Nλj\eta_{\mathrm{cum}}(r) = \frac{\sum_{i=1}^{r} \lambda_i}{\sum_{j=1}^{N} \lambda_j}

If a few modes capture most of the energy, the system exhibits low-dimensional characteristics.


4. Basic Procedure of POD

The general procedure of POD analysis is as follows:

  1. Collect transient data (velocity, pressure, vorticity, etc.)
  2. Convert each snapshot into a column vector
  3. Construct the snapshot matrix
  4. Perform decomposition to obtain modes, coefficients, and energies
  5. Analyze dominant structures based on energy contribution

For mm spatial points and nn snapshots, the snapshot matrix is:

X=[x1x2xn]Rm×n\mathbf{X} = \begin{bmatrix} | & | & & | \\ \mathbf{x}_1 & \mathbf{x}_2 & \cdots & \mathbf{x}_n \\ | & | & & | \end{bmatrix} \in \mathbb{R}^{m \times n}

SVD-Based POD (SVD-POD)

5. Basic Idea of SVD-POD

SVD-POD is a numerical implementation of POD using Singular Value Decomposition (SVD). Instead of explicitly forming the covariance matrix, SVD directly decomposes the snapshot matrix, making it computationally efficient and stable.

For the snapshot matrix X\mathbf{X}, SVD is given by:

X=UΣVT\mathbf{X} = \mathbf{U}\mathbf{\Sigma}\mathbf{V}^T

where:

  • U\mathbf{U} contains left singular vectors (spatial modes)
  • Σ\mathbf{\Sigma} is a diagonal matrix of singular values
  • V\mathbf{V} contains right singular vectors (time information)
Pic-1


6. Derivation of SVD-POD

Given the snapshot matrix:

X=[x1x2xn]\mathbf{X} = \begin{bmatrix} \mathbf{x}_1 & \mathbf{x}_2 & \cdots & \mathbf{x}_n \end{bmatrix}

Perform SVD:

X=UΣVT\mathbf{X} = \mathbf{U}\mathbf{\Sigma}\mathbf{V}^T

where:

Σ=diag(σ1,σ2,,σr)\mathbf{\Sigma} = \mathrm{diag}(\sigma_1, \sigma_2, \cdots, \sigma_r)

with:

σ1σ2σr>0\sigma_1 \geq \sigma_2 \geq \cdots \geq \sigma_r > 0

The POD modes are given by:

Φ=U\mathbf{\Phi} = \mathbf{U}

The energy associated with each mode is proportional to the square of the singular values:

λi=σi2\lambda_i = \sigma_i^2

Thus, the energy ratio is:

ηi=σi2j=1rσj2\eta_i = \frac{\sigma_i^2}{\sum_{j=1}^{r} \sigma_j^2}

and the cumulative energy is:

ηcum(r)=i=1rσi2j=1Nσj2\eta_{\mathrm{cum}}(r) = \frac{\sum_{i=1}^{r} \sigma_i^2}{\sum_{j=1}^{N} \sigma_j^2}

7. Temporal Coefficients

In SVD-POD, the temporal coefficients can be obtained as:

A=ΣVT\mathbf{A} = \mathbf{\Sigma}\mathbf{V}^T

Thus, the snapshot matrix can be reconstructed as:

X=UA\mathbf{X} = \mathbf{U}\mathbf{A}

or in summation form:

X=i=1rσiuiviT\mathbf{X} = \sum_{i=1}^{r} \sigma_i \mathbf{u}_i \mathbf{v}_i^T

For the ii-th mode, the temporal coefficient at time tkt_k is:

ai(tk)=σivi(tk)a_i(t_k) = \sigma_i v_i(t_k)

8. Low-Order Reconstruction

One key application of POD is reduced-order reconstruction. By retaining only the first rr dominant modes:

Xr=i=1rσiuiviT\mathbf{X}_r = \sum_{i=1}^{r} \sigma_i \mathbf{u}_i \mathbf{v}_i^T

or:

Xr=UrΣrVrT\mathbf{X}_r = \mathbf{U}_r \mathbf{\Sigma}_r \mathbf{V}_r^T
Pic-1

A small number of modes can capture the main structures while filtering out noise. Increasing rr improves reconstruction accuracy.


9. Relationship Between POD and SVD

POD is a theoretical framework for optimal energy-based decomposition, while SVD is a numerical tool used to compute POD modes.

In practice:

POD provides the concept, SVD provides the computation\text{POD provides the concept, SVD provides the computation}

10. Applications in Flow Analysis

POD is widely used in fluid mechanics and turbomachinery:

  1. Extraction of dominant coherent structures
  2. Analysis of energy distribution
  3. Identification of large-scale vortices
  4. Data dimensionality reduction
  5. Low-order reconstruction
  6. Separation of dominant structures and disturbances

For example, in pump flow analysis, low-order modes often correspond to major vortices, recirculation zones, or large-scale unsteady structures, while higher-order modes may represent small-scale turbulence or noise.


11. Advantages and Limitations

Advantages:

  • Optimal energy representation
  • Strong data compression capability
  • Clear identification of dominant structures

Limitations:

  • Modes are ranked by energy, not frequency
  • Multiple physical phenomena may be mixed in one mode

Therefore, additional analysis (e.g., FFT, HHT, or wavelet transform) is often needed to interpret modal frequency characteristics.


12. Summary

Proper Orthogonal Decomposition (POD) is an energy-based modal decomposition method that extracts dominant structures from complex datasets:

q(x,t)=i=1Nai(t)ϕi(x)q'(\mathbf{x}, t) = \sum_{i=1}^{N} a_i(t)\phi_i(\mathbf{x})

SVD-POD implements POD using singular value decomposition:

X=UΣVT\mathbf{X} = \mathbf{U}\mathbf{\Sigma}\mathbf{V}^T

Here, U\mathbf{U} represents spatial modes, Σ\mathbf{\Sigma} represents modal energy, and V\mathbf{V} describes temporal evolution.

POD is highly effective for extracting coherent structures, analyzing energy distribution, and performing reduced-order modeling, especially in unsteady flow analysis.

Tutorial of Proper Orthogonal Decomposition | Hongchi Zhong (Darian)