temo

Okay, let’s break down these two steps in detail. The goal here is to transform the non-linear projection equation into a system of linear equations that we can solve to find the unknown projection matrix $M$ .

Step 1: Formulate Linear Equations (for a single point i)

Starting Point: The fundamental relationship between a 3D world point $P_{w, i}$ and its 2D image projection $p_{i}$ is given by the perspective projection equation: $λ p_{i} = M P_{w, i}$
- $P_{w, i} = [X_{i}, Y_{i}, Z_{i}, 1]^{T}$ : Homogeneous coordinates of the known 3D point.
- $p_{i} = [u_{i}, v_{i}, 1]^{T}$ : Homogeneous coordinates of the measured 2D image point. $u_{i}$ and $v_{i}$ are the pixel coordinates we observe.
- $M$ : The $3 \times 4$ projection matrix we want to find. Its 12 elements are the unknowns.
- $λ$ : An unknown, non-zero scalar representing the projective depth. It varies for each point $i$ .
Expand the Matrix Equation: Let’s represent the rows of the unknown matrix $M$ as $m_{1}^{T}$ , $m_{2}^{T}$ , and $m_{3}^{T}$ . Each $m_{j}^{T}$ is a $1 \times 4$ row vector (or $m_{j}$ is a $4 \times 1$ column vector).
$= m_{1}^{T} m_{2}^{T} m_{3}^{T} X_{i} Y_{i} Z_{i} 1 = m_{1}^{T} P_{w, i} m_{2}^{T} P_{w, i} m_{3}^{T} P_{w, i}$
The terms $m_{j}^{T} P_{w, i}$ are just dot products. For example, $m_{1}^{T} P_{w, i} = m_{11} X_{i} + m_{12} Y_{i} + m_{13} Z_{i} + m_{14}$ .
Write out Scalar Equations: This matrix equation gives us three separate scalar equations:
- (1) $λ u_{i} = m_{1}^{T} P_{w, i}$
- (2) $λ v_{i} = m_{2}^{T} P_{w, i}$
- (3) $λ = m_{3}^{T} P_{w, i}$
Eliminate the Unknown Scale $λ$ : Our goal is to find the elements of $M$ . The scale factor $λ$ is different for every point and is also unknown. We can eliminate it using substitution. Substitute equation (3) into equations (1) and (2):
- Substitute into (1): $(m_{3}^{T} P_{w, i}) u_{i} = m_{1}^{T} P_{w, i}$
- Substitute into (2): $(m_{3}^{T} P_{w, i}) v_{i} = m_{2}^{T} P_{w, i}$
Rearrange into Linear Homogeneous Form: Move all terms to one side to get equations equal to zero. This form ( $A x = 0$ ) is suitable for solving with methods like SVD.
- From (1): $u_{i} (m_{3}^{T} P_{w, i}) - (m_{1}^{T} P_{w, i}) = 0$
- From (2): $v_{i} (m_{3}^{T} P_{w, i}) - (m_{2}^{T} P_{w, i}) = 0$
Crucially, these two equations are linear in the unknown elements of $M$ (the $m_{jk}$ values that make up the row vectors $m_{1}^{T}, m_{2}^{T}, m_{3}^{T}$ ). All the terms involving the known 3D point $P_{w, i}$ and the measured 2D point $(u_{i}, v_{i})$ act as known coefficients multiplying the unknown matrix elements.

Step 2: Set up Homogeneous Linear System (for n points)

The Goal: We have two linear equations for each point correspondence. If we have $n$ correspondences (where $n \geq 6$ ), we have a total of $2 n$ linear equations. We want to write this entire system in a single matrix equation form: $P m = 0$
The Unknown Vector $m$ : We need to arrange the 12 unknown elements of $M$ into a single column vector $m$ . A standard way is to stack the row vectors (transposed):
$m = m_{1} m_{2} m_{3} = m_{11} m_{12} m_{13} m_{14} m_{21} m_{22} m_{23} m_{24} m_{31} m_{32} m_{33} m_{34} (a 12 \times 1 vector)$
The Coefficient Matrix $P$ : We need to construct the large matrix $P$ such that when multiplied by $m$ , it reproduces the $2 n$ equations we derived. Let’s look at the two equations for a single point $i$ again:
- Equation 1: $- m_{1}^{T} P_{w, i} + 0 \cdot (m_{2}^{T} P_{w, i}) + u_{i} (m_{3}^{T} P_{w, i}) = 0$
- Equation 2: $0 \cdot (m_{1}^{T} P_{w, i}) - m_{2}^{T} P_{w, i} + v_{i} (m_{3}^{T} P_{w, i}) = 0$
Now, let’s express the dot products using the vector $P_{w, i}^{T} = [X_{i}, Y_{i}, Z_{i}, 1]$ and the blocks of $m$ :
- $m_{1}^{T} P_{w, i} = P_{w, i}^{T} m_{1}$
- $m_{2}^{T} P_{w, i} = P_{w, i}^{T} m_{2}$
- $m_{3}^{T} P_{w, i} = P_{w, i}^{T} m_{3}$
Substituting back:
- Equation 1: $- P_{w, i}^{T} m_{1} + 0^{T} m_{2} + u_{i} P_{w, i}^{T} m_{3} = 0$
- Equation 2: $0^{T} m_{1} - P_{w, i}^{T} m_{2} + v_{i} P_{w, i}^{T} m_{3} = 0$
These two equations correspond to two rows in the matrix $P$ . The structure provided in the notes matches these equations (possibly with an overall sign flip on one or both, which doesn’t affect the solution of $P m = 0$ ):
- Row for Eq 2: $[0^{T} - P_{w, i}^{T} v_{i} P_{w, i}^{T}]$ . This is a $1 \times 12$ row vector. When multiplied by $m = [m_{1}^{T}, m_{2}^{T}, m_{3}^{T}]^{T}$ , it gives Eq 2. This corresponds to row1 in the provided structure.
- Row for Eq 1: $[P_{w, i}^{T} 0^{T} - u_{i} P_{w, i}^{T}]$ . This is a $1 \times 12$ row vector. When multiplied by $m$ , it gives Eq 1 (with a sign flip). This corresponds to row2 in the provided structure.
(Note: The naming row1/row2 in the notes might seem reversed relative to the $u, v$ equations, but the structure is correct for generating the necessary constraints.)

The full $P$ matrix is constructed by stacking these two rows for each point $i$ from $1$ to $n$ :
$P = 0^{T} P_{w, 1}^{T} 0^{T} P_{w, 2}^{T} ⋮ 0^{T} P_{w, n}^{T} - P_{w, 1}^{T} 0^{T} - P_{w, 2}^{T} 0^{T} ⋮ - P_{w, n}^{T} 0^{T} v_{1} P_{w, 1}^{T} - u_{1} P_{w, 1}^{T} v_{2} P_{w, 2}^{T} - u_{2} P_{w, 2}^{T} ⋮ v_{n} P_{w, n}^{T} - u_{n} P_{w, n}^{T} (a 2 n \times 12 matrix)$
Each block like $P_{w, i}^{T}$ is $1 \times 4$ , and $0^{T}$ is also $1 \times 4$ .
The Result: We now have the system $P m = 0$ . The matrix $P$ contains only known quantities (from the 3D points and 2D measurements). The vector $m$ contains the 12 unknowns we want to find. This system can be solved using Singular Value Decomposition (SVD) to find the non-trivial solution for $m$ (which corresponds to the vector associated with the smallest singular value).

Quartz 4

Explorer

temo

Graph View

Backlinks