y0news
#architecture-agnostic1 article
1 articles
AIBullisharXiv โ€“ CS AI ยท 6h ago1
๐Ÿง 

NNiT: Width-Agnostic Neural Network Generation with Structurally Aligned Weight Spaces

Researchers introduced Neural Network Diffusion Transformers (NNiTs), a new approach that generates neural network parameters in a width-agnostic manner by treating weight matrices as tokenized patches. The method achieves over 85% success on unseen network architectures in robotics tasks, solving key challenges in generative modeling of neural networks.