Vignesh Ravikumar | Multi-Task Learning HydraNets for Autonomous Driving

Objective

This project refers to the paper Real-Time Joint Semantic Segmentation and Depth Estimation Using Asymmetric Annotations to implement a model that can perform multiple tasks such as semantic segmentation and depth estimation at once.

Architecture

Multi-task learning (MTL) is a technique in deep learning where a single neural network is trained to perform multiple tasks simultaneously. This approach can improve the performance of the network on all tasks, as the network learns to share information and representations among the tasks.

One common way to implement MTL is to use a shared base network, with task-specific layers attached on top. The shared base network learns a general representation of the input data, while the task-specific layers learn the task-specific features. This allows the network to reuse the learned features across different tasks, reducing the amount of data and computation required for each task.

Another way to implement MTL is to use a network with multiple heads, where each head is responsible for a different task. The multiple heads can be trained together, allowing the network to learn shared representations across the tasks. This approach can be useful when the tasks are highly related and the data for each task is limited.

Code

Click here to view the code on Github