Instance segmentors combine the advantages of semantic segmentors and object detectors. On the one hand, they are able to predict pixel-perfect masks like semantic segmentors achieving greater performance than object detectors. On the other hand, they are also able to distinguish between objects and counting them.
The performance is usually evaluated using mAP.
It is possible to combine semantic segmentation and instance segmentation in one image, then we talk about panoptic segmentation. The idea would be to label the road ahead as a semantic class and the cars as instances. This is not often used in practice, though (yet).
The wiki entry for Mask R-CNN, one of the most common architectures
The wiki entry for mAP, the typical metric to measure the performance