Exploring Spatial Reasoning Abilities of LLMs.
This is my bachelor's thesis project. I answered following questions:
- Can LLMs understand 2D and 3D worlds?
- How can we measure the performance?
- How do we curate a dataset for this problem?
- What kind of prompting is optimal?
- What about fine-tunining a small language model?