Skip to content
Language
GLOSSARY

Motion Tracking

The process of analyzing pixel movement across frames to attach graphics, stabilize footage, or extract 3D camera data — and the reason half your VFX shots are late when the track fails.

What motion tracking actually does

Motion tracking analyzes the movement of pixels, patterns, or features across sequential frames to derive position, rotation, scale, and sometimes perspective data. There are three fundamentally different approaches, and using the wrong one is the most common tracking mistake. Point tracking (used in DaVinci Resolve's Tracker node, After Effects' built-in tracker) follows a small rectangular region — a user-defined search area around a feature with high contrast. It works by comparing the pixel pattern in the search region frame-to-frame, finding the best match, and recording the offset. Point tracking is fast and sufficient for simple stabilization or attaching a logo to a flat surface. It fails when the tracked point rotates significantly (the pattern changes shape beyond the tracker's ability to match), gets occluded (something passes in front of it), or leaves the frame. Planar tracking (Mocha Pro, Resolve's Planar Tracker, After Effects with Mocha AE) doesn't track a point — it tracks a surface. You define a region (a plane), and the algorithm uses the entire pattern within that region to calculate affine or perspective transforms. This is dramatically more robust than point tracking for surfaces that rotate, tilt, or change perspective. Want to replace a phone screen that rotates 45 degrees and partially goes off-screen? Planar track. The Mocha Pro plugin for After Effects and Nuke is the industry standard — its surface tracking handles reflections, partial occlusion, and out-of-frame surfaces better than any alternative. 3D camera tracking (After Effects' 3D Camera Tracker, Nuke's CameraTracker, PFTrack, SynthEyes) is a completely different beast. Instead of tracking a specific point or surface, it analyzes the parallax of features across the entire frame to reverse-engineer the 3D camera position, orientation, and lens properties for every frame. This gives you a virtual camera in 3D space that matches the real camera's movement — you can place 3D objects, text, or effects into the scene with correct perspective and depth.

When tracking breaks and how to fix it

Tracking fails for predictable reasons, and most have workarounds. Motion blur is enemy number one — fast-moving objects create streaked pixels that the tracker can't match between frames. Solution: increase the tracker's search region to be larger than the blur trail, or manually keyframe the track points during blurry sections and let the tracker fill in the rest. Low contrast is the second failure mode. Trackers work by matching pixel patterns — if there's no pattern (a uniform wall, a blurry sky, a dark shadow), there's nothing to match. The fix is to find a nearby feature with contrast and offset the attachment point. Occlusion is the third — when something passes in front of your tracked point. Mocha Pro handles this with its occlusion layer system (you tell Mocha what's in front, and it ignores those pixels), but point trackers simply lose the track. The fix is to split the track into segments and use different track points for each segment, blending the results. Rolling shutter is a sneaky failure mode on CMOS cameras — fast pans create wobble and skew that point trackers interpret as false motion. Planar trackers handle this better because they model the distortion as a perspective change, but it still introduces drift over long tracks. The honest workflow for difficult tracks: start with the automated tracker, evaluate the result frame-by-frame, manually correct drift at keyframes, and accept that about 30% of tracks need manual cleanup on complex shots. No tracking algorithm — including the AI-powered ones in the latest Resolve and After Effects versions — is 100% reliable on footage with heavy motion blur, low contrast, and frequent occlusions. That's why experienced compositors still know how to hand-track.

Motion Tracking FAQ

Point track, planar track, or 3D track — which do I use?
Point track for simple position data (stabilization, attaching a 2D element to a moving object). Planar track for surface replacement (screens, signs, photos in frames). 3D camera track for inserting 3D objects or text into a scene with correct perspective. When in doubt, start with planar — it handles 80% of professional tracking tasks.
Is Mocha Pro worth the cost?
For any studio doing regular screen replacements or complex 2D tracking, yes. Mocha's planar tracker is significantly more robust than built-in alternatives, its Roto and Remove modules add value, and the Insert module handles perspective-correct screen replacement in one step. If you only do occasional tracking, the free Mocha AE (bundled with After Effects) covers most needs.
Why does my tracked element drift over time?
Cumulative subpixel error. Each frame's track position has a tiny margin of error (0.1–0.5 pixels). Over 500 frames, these accumulate into visible drift. Fix by keyframing correction points every 50–100 frames, or use a planar tracker which has built-in drift correction through its surface model.

Need Professional Help With This?

Complex tracking is where experience makes the difference. We've tracked shots that automated tools couldn't handle — from handheld footage with heavy motion blur to aerial plates with rolling shutter.

Get a Free Consultation