Buiding a 3D perception stack - Part 0
When we talk about an autonomous vehicle or any robot navigating its environment, perception fundamentally is about understanding the 3D space around it. For a self-driving car, this boils down to identifying what areas are drivable or navigable, and which are not. We need to build an instantaneous 3D understanding of the world relative to our own vehicle – the “ego” vehicle. Automotive computer vision stacks typically use machine learning (ML) based techniques to identify objects of interest, understand behavioral intent, and track dynamic objects in the scene.