Collection
Subscribe
DeepSeek-V4 preview: two MoE models with 1M context and new attention mechanism
#llm
#deepseek
#mixture-of-experts
Last updated May 22
•
14 sources
2 Upvotes
Comment
Bookmark
Copy
Sort:
Oldest first
Share your thoughts
Post
2