Foundation models break free from predefined and restricted ontologies, enabling them to perceive a wider range of elements in their surroundings. By pretraining these foundation models in an unsupervised way, we can adapt them to a diverse set of tasks.