☆ Yσɠƚԋσʂ ☆ to TechnologyEnglish · 1 year agoDeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learningarxiv.orgexternal-linkmessage-square0linkfedilinkarrow-up17arrow-down10cross-posted to: technology@hexbear.netmachinelearning@lemmy.ml
arrow-up17arrow-down1external-linkDeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learningarxiv.org☆ Yσɠƚԋσʂ ☆ to TechnologyEnglish · 1 year agomessage-square0linkfedilinkcross-posted to: technology@hexbear.netmachinelearning@lemmy.ml