Explore the opportunities of instruction-tuning for vision-language models. Research how such models can learn about affordances and physical properties of objects. Investigate how instruction-tuning can be leveraged to learn such affordances for the specific robot with its given, limited capabilities. Search for possibilities to combine affordance recognition with the physical properties.