PanGu-π is a new large language model architecture that enhances nonlinearity and addresses the 'feature collapse' problem without significantly increasing complexity. It matches the performance of top language models with a 10% faster inference.

3m read timeFrom marktechpost.com
Post cover image

Sort: