AIBullisharXiv โ CS AI ยท 8h ago6/10
๐ง
Scalable Object Relation Encoding for Better 3D Spatial Reasoning in Large Language Models
Researchers introduce QuatRoPE, a novel positional embedding method that improves 3D spatial reasoning in Large Language Models by encoding object relations more efficiently. The method maintains linear scalability with the number of objects and preserves LLMs' original capabilities through the Isolated Gated RoPE Extension.