Definition of Quaternion
Quaternions can be represented as:
\[\begin{gather*} q = [s, x, y, z] = s + xi + yj + zk = s + v v = (i, j, k) \end{gather*}\]i, j, k are 3 complex dimensions, where:
\[\begin{gather*} i^2 = j^2 = k^2 = ijk = -1 \end{gather*}\]i, j, k Are Not Commutative
\[\begin{gather*} ij = -ij(kk) = (-ijk)k = k \\ jk = -(ii)jk = -i(ijk) = i \\ ki = -ki*(jj) = -k*k*j = j \\ \end{gather*}\]Note i, j, k are NOT commutative.
\[\begin{gather*} ji = j * jk = -k \\ kj = k * ki = -i \\ ik = i * ij = -j \end{gather*}\]Product of Quaternions
The product of two quaternions are:
\[\begin{gather*} q_a q_b = s_a s_b - x_a x_b - y_a y_b - z_a z_b \\ + (s_a x_b + x_a s_b + y_a z_b - z_a y_b) \, i \\ + (s_a y_b - x_a z_b + y_a s_b + z_a x_b) \, j \\ + (s_a z_b + x_a y_b - y_a x_b + z_a s_b) \, k. \end{gather*}\]This can be also written in the form of vectors s
and v
:
Note that the $v_a \times v_b$ is not commutative, so the quaternion is not commutative, unless $v_a, v_b$ are co-linear
Length of q
\[\begin{gather*} |q| = \sqrt{(s^2 + x^2 + y^2 + z^2)} = \sqrt{s^2 + v^T v} \end{gather*}\]One can prove that the length of a product is the product of the lengths
\[\begin{gather*} |q_a q_b| = |q_a| |q_b| \end{gather*}\]q conjugate: $q^{*}$
\[\begin{gather*} q^{*} = s - xi - yj - zk = [s, -v] \\ q^{*}q = [s_a s_b + v^T v, 0] = q q^{*} \end{gather*}\]Because $q^{*}q = [s_a s_b + v^T v, 0]$, we can see that the inverse of q is
\[\begin{gather*} q^{-1} = \frac{q}{|q|} \\ q q^{-1} = \frac{[s_a s_b + v^T v, 0]}{|q|^2} = 1 \end{gather*}\]Products’ Conjugate is the Product of Conjugates
\[\begin{gather*} (q_a q_b)^{-1} = \frac{q_a q_b}{|q_a q_b|^2} = \frac{q_a}{|q_a|^2} \frac{q_b}{|q_b|^2} = q_a^{-1} q_b^{-1} \end{gather*}\]Rotation Quaternion To Rotation Matrix
A general quaternion does NOT have to be a unit vector, however, a rotation quaternion must be a unit vector.
1. Rotation is $p’=qpq^{-1}$
For a point (x, y, z)
, p' = Rp
. If we want to do rotation in quaternions, we write p = [0, x, y, z]
. $p’ = qpq^{-1}$. Actually p'
is a pure imaginary number. Here is why:
- Let’s define 2 matrices: (the only difference is the sign of $v^{\land}$)
We can prove that $q_1^{+} q_2 = q_1 q_2$
\[\begin{gather*} \begin{bmatrix} s_1 & -\mathbf{v}_1^\top \\ \mathbf{v}_1 & s_1 \mathbf{I} + \mathbf{v}_1^\wedge \end{bmatrix} \begin{bmatrix} s_2 \\ \mathbf{v}_2 \end{bmatrix} = \begin{bmatrix} -\mathbf{v}_1^\top \mathbf{v}_2 + s_1 s_2 \\ s_1 \mathbf{v}_2 + s_2 \mathbf{v}_1 + \mathbf{v}_1^\wedge \mathbf{v}_2 \end{bmatrix} = q_1 q_2 \end{gather*}\]Similarly, $q_1^{+} q_2 = q_1 q_2 = q_2^{\oplus} q_1$. So
\[\begin{gather*} p' = qpq^{-1} = q^{+} p^{+} q^{-1} = q^{+} q^{-1 \oplus} p \end{gather*}\]Then, we can write out $q^{+} q^{-1 \oplus}$, using that a rotation quaternion is unit length
\[\begin{gather*} q^+ \left( q^{-1} \right)^\oplus = \begin{bmatrix} s & -\mathbf{v}^\top \\ \mathbf{v} & s \mathbf{I} + \mathbf{v}^\wedge \end{bmatrix} \begin{bmatrix} s & \mathbf{v}^\top \\ -\mathbf{v} & s \mathbf{I} + \mathbf{v}^\wedge \end{bmatrix} = \begin{bmatrix} 1 & 0 \\ 0^\top & \mathbf{v}\mathbf{v}^\top + s^2 \mathbf{I} + 2s\mathbf{v}^\wedge + (\mathbf{v}^\wedge)^2 \end{bmatrix} \end{gather*}\]Here, it’s easy to see that $p’ = qpq^{-1} = q^{+} q^{-1 \oplus} p$ has a zero real part.
2. Rotation Matrix is $R = v v^{T} + s^2 I + 2sv^{\land} + (v^{\land})^2$
Since rotation in quaternion is purely imaginary, the rotation Matrix can be represented as the last element in $q^{+} q^{-1 \oplus}$
\[\begin{gather*} p' = qpq^{-1} = v v^{T} + s^2 I + 2sv^{\land} + (v^{\land})^2 p = Rp \end{gather*}\]So that leads to $R = v v^{T} + s^2 I + 2sv^{\land} + (v^{\land})^2$.
3. From Quaternion To Rotation Angle, $\theta$ Using $tr(R)$
Now, we can represent trace of R
as the real part of quaternion q
, s
Since \(\begin{gather*} v^{\land} = |v|a => (v^{\land})^2 = |v|^2 (a a^T - I) \end{gather*}\)
We have the trace of $(v^{\land})^2$: \(\begin{gather*} tr((v^{\land})^2) = |v|^2 (1 - 3) = -2 |v|^2 \end{gather*}\)
So ultimately, \(\begin{gather*} tr(R) = (v_1^2 + v_2^2 + v_3^2) + 3s^2 + 0 - 2(v_1^2 + v_2^2 + v_3^2) \\ = (1-s^2) + 3s^2 - 2(1-s^2) \\ => \\ tr(R) = 4s^2 - 1 \end{gather*}\)
Since we know that:
\[\begin{gather*} \theta = arccos(\frac{tr(R) - 1}{2}) = arccos(2s^2 - 1) \end{gather*}\]We always require that $\theta \in [0, \pi]$. And for cases where $\theta$ is out of $[0, \pi]$, we change the sign of q. So we can know the rotation angle, $\theta$:
\[\begin{gather*} cos\theta = 2s^2 - 1 = 2 cos \frac{\theta}{2} ^2 - 1 \end{gather*}\]So:
\[\begin{gather*} s = cos \frac{\theta}{2} \\ => \\ \theta = 2arccos(s) \end{gather*}\]4. From Quaternion To Rotation Axis
The virtual part of the rotation quaternion q, v is actually on the rotation axis. Here is the proof:
\[\begin{gather*} R v = q^{+} q^{-1 \oplus}[0, v] = v \end{gather*}\]Only a point on the rotation axis itself will not move by rotation. So the rotation axis (unit vector) is simply:
\[\begin{gather*} [n_x, n_y, n_y] = \frac{v}{|v|} \\ = \frac{v}{\sqrt{1 - s^2}} \\ = \frac{v}{\sqrt{1 - s^2}} \\ => \\ [n_x, n_y, n_y] = \frac{v}{sin(\frac{\theta}{2})} \end{gather*}\]Most libraries will use quaternion for rotations as it only requires 4 numbers. However, as a user, we don’t need to worry too much about the lower level implementation details
Rotation Matrix To Rotation Quaternion
1. Define “Quaternion Angular Velocity” $q^{*}q’ = \bar{w}$
Since we have $qq^{*} = 1$, we can get the derivative of:
\[\begin{gather*} q^{*}q' + q'^{*}q = 0 \\ => \\ q^{*}q' = -q'^{*}q = -(q^{*}q')^{*} \end{gather*}\]So we can see that $q^{}q’$ must be a pure imaginary number $q^{}q’ = \bar{w} = [0, w_1, w_2, w_3]$
2. Purely Imaginary Quaternion’s Exponential $Exp(\bar{w})$ is A Unit Quaternion
With pure imaginary $\bar{w} = [0, w]$, the derivative of the rotation quaternion is
$$
\begin{gather}
q^{}q’ = \bar{w} => (qq^{*})q’ = q \bar{w}
=> q’ = q \bar{w}
\end{gather*} $$
So we can write the solution to q
as:
Now the question is what does this exponential look like? Quaternion exponential can be defined with Taylor Series as well:
\[\begin{gather*} exp(\bar{w}) = \sum_{K=0} \frac{1}{k!} (\bar{w})^k \end{gather*}\]Similar to Rodrigues formula, there’s an interesting attribute of quaternion:
\[\begin{gather*} \bar{w} = \theta u \\ u^2 = -1 \\ u^3 = -u \end{gather*}\]Where u
is a unit vector, $\theta$ is the angle of rotation. This leads to the extension of the Euler’s Formula in Quaternion:
Refresher: in the complex plane, Euler’s Formula is:
\[\begin{gather*} exp(i \theta) = cos \theta + i sin \theta \end{gather*}\]Exponential of a pure imaginary quaternion $exp(\bar{w})$ is a unit quaternion, by the definition of u
:
3. Rotation Quaternion Is Half of Rotation Vector: $\bar{w} = [0, \frac{\phi}{2}]$
We have defined R
with a Cartesian angular velocity $\phi$:
To go from rotation matrix to rotation quaternion $exp(\bar{w}) = [s, v]$, we know
\[\begin{gather*} \theta = 2arccos(s) \\ [n_x, n_y, n_z] = \frac{v}{sin \frac{\theta}{2}} \end{gather*}\]So the rotation quaternion is:
\[\begin{gather*} s = cos \frac{\theta}{2} \\ v = n sin \frac{\theta}{2} \\ => \\ exp(\bar{w}) = [cos \frac{\theta}{2}, n sin \frac{\theta}{2}] \end{gather*}\]Since we define $q*q’ = \bar{w} = [0, \theta_w u]$, we can see that the “quaternion rotation velocity” $\bar{w}$ update is half of that in so(3)
:
So in $q’ = q \bar{w}$:
\[\begin{gather*} q' = q [0, \frac{\phi}{2}] \end{gather*}\]4. Quaternion Rotation Update
4.1 Quaternion Rotation Update Can Be Approximated As $q(t) \approx q(t_0)(1, \frac{\phi}{2})$
We have known that $q(t) = q(t_0) exp(\bar{w} \Delta t)$. How do we approximate $exp(\bar{w} \Delta t)$? If we slightly change our above Cartesian angular velocity’s definition to angular increment: $\theta^i = \theta \Delta t$, $\phi^i = \phi \Delta t$
\[\begin{gather*} exp(\bar{w} \Delta t) = [cos \theta_w^i, u sin \theta_w^i] = [cos(\frac{\theta^i}{2}), u sin (\frac{\theta^i}{2})] \\ \end{gather*}\]When $\theta \rightarrow 0$, we have
\[\begin{gather*} exp(\bar{w} \Delta t) \approx [1, u \frac{\theta^i}{2}] = [1, \frac{1}{2} \phi^i] \\ => \\ q(t) \approx q(t_0)exp(\bar{w} \Delta t) = q[t_0](1, \frac{\phi^i}{2}) \end{gather*}\]This shows that when we measure the Cartesian angular increment $\phi^i$ in $\Delta t$, in quaternion, the rotation quaternion update is approximately $qt_0$.
However, one can see that $[1, \frac{\phi^i}{2}]$ is not a unit vector, so after some updates, we need to normalize q? TODO: I’m not sure what normalization here entails.
4.2 “Accurate” Quaternion Update Without Approximation and Normalization
To avoid normalization, we can use
\[\begin{gather*} exp(\bar{w} \Delta t) = [cos(\frac{\theta^i}{2}), u sin (\frac{\theta^i}{2})] \\ q(t) = q(t_0)exp(\bar{w} \Delta t) = q[t_0](cos(\frac{\theta^i}{2}), u sin (\frac{\theta^i}{2})) \end{gather*}\]5. Update SO(3) or Rotation Quaternion Using The Same Update Function
Since we have established the relationship between quaternion updates and Cartesian angular increment, we can update SO(3) or rotation quaternion using the same update function in filters, and graph optimization.
\[\begin{gather*} q(t) \approx q(t_0)exp(\bar{w} \Delta t) = q[t_0](1, \frac{\phi^i}{2}) \\ \text{or} \\ q(t) = q(t_0)exp(\bar{w} \Delta t) = q[t_0](cos(\frac{\theta^i}{2}), u sin (\frac{\theta^i}{2})) \end{gather*}\]