Hacker Newsnew | past | comments | ask | show | jobs | submit | qtwhat's submissionslogin
1.DeepSeek-V3: Achieving Efficient LLM Scaling with 2,048 GPUs (arxiv.org)
7 points by qtwhat 10 months ago | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: