Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Sophia: Stochastic Second-Order Optimizer for Language Model Pre-Training (arxiv.org)
7 points by alechammond on May 27, 2023 | hide | past | favorite


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: