2月12日,面壁智能正式發布稀疏-線性注意力混合架構SALA,以及基于該架構的文本模型MiniCPM-SALA,模型僅有9B參數。
特別聲明:以上內容(如有圖片或視頻亦包括在內)為自媒體平臺“網易號”用戶上傳并發布,本平臺僅提供信息存儲服務。
Notice: The content above (including the pictures and videos if any) is uploaded and posted by a user of NetEase Hao, which is a social media platform and only provides information storage services.