Why RNNs Can't Match Transformers' Efficiency: The Exponential Parameter Gap Revealed
New research proves that while RNNs can theoretically replicate Transformer capabilities, they require exponentially more parameters—a discovery that explains why we've been chasing the wrong metrics for a decade.