Adversarial Tracking: Attacking 3D Multi-Object Tracking in Real Time

Han Wu, Dr. Johan Wahlström and Dr. Sareh Rowlands

Source Code

Monocular 3D Object Tracking

Step 0: The Ground Truth (Training Set)

Step 1: 2D Object Detection (FRCNN)

Step 2: 3D Object Detection (Orientation)

Step 3: Motion Tracking (LSTM + GPS / IMU)

We do not use Lidar Data (No depth information)
Not easy to label 3D bounding boxes by human. (No training set)

Step 0: How can we generate 3D Bounding Boxes?

(Ground Truth / Training set)

Method 1: Detouring - Extract rendering resources from Commercial Games (Direct3D 11)

FSV 2018 [URL]

GTA5 2016 [URL]

To intercept the communication between the game and the graphics hardware.

Method 2: Develop a Simlator using Game Engine (Unity3D)

Virtual KITTI 2016 (Unity3D) [URL]

SYNTHIA 2016 (Unity3D) [PDF]

Method 3: Develop a Simlator using Game Engine (Unreal)

4 DoF - (x, y) (w, h)

7 DoF - (x, y, z) (w, h, l) and local yaw

CARLA API Tutorial [Blog Post]

Step 0: From 3D global coordinates to 2D image coordinates

(Ground Truth / Training set)

Camera Projective Geometry

Projection from world coordinates (x, y, z) to image coordinates (w, h):

$O_{image} = K[R|t] O_{world}\ \rightarrow\ O_{image}^{'} = KM^{'}[R|t] O_{world}$

where $M^{'} = \begin{bmatrix} cos(\pi) & -sin(\pi) & 0\\ sin(\pi) & cos(\pi) & 0\\ 0 & 0 & 1\\ \end{bmatrix} = \begin{bmatrix} -1 & 0 & 0\\ 0 & -1 & 0\\ 0 & 0 & 1\\ \end{bmatrix}$ for vehicles partially behind the camera (truncation).

Intrinsic Matrix: $K = \begin{bmatrix} f & 0 & \frac{w}{2}\\ 0 & f & \frac{h}{2} \\ 0 & 0 & 1 \end{bmatrix} $ where $f = \frac{w}{2.0 * tan(fov\ *\frac{\pi}{360})}$

Extrinsic Matrix: $[R | t] = \begin{bmatrix} cos(k) & -sin(k) & 0 & x\\ sin(k) & cos(k) & 0 & y\\ 0 & 0 & 1 & z \\ 0 & 0 & 0 & 1 \\ \end{bmatrix}^{-1}$

$\begin{aligned}O_{image}^{'} &= KM^{'}[R|t] O_{world}\\ &= K\begin{bmatrix} -1 & 0 & 0\\ 0 & -1 & 0\\ 0 & 0 & 1\\ \end{bmatrix}\ [R|t] = \begin{bmatrix} f & 0 & \frac{w}{2}\\ 0 & f & \frac{h}{2} \\ 0 & 0 & 1 \end{bmatrix} \begin{bmatrix} -1 & 0 & 0\\ 0 & -1 & 0\\ 0 & 0 & 1\\ \end{bmatrix} [R|t] = \begin{bmatrix} -f & 0 & \frac{w}{2}\\ 0 & -f & \frac{h}{2} \\ 0 & 0 & 1 \end{bmatrix} [R|t] = K^{'}\ [R|t]\end{aligned}$