Lemmygrad
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
☆ Yσɠƚԋσʂ ☆ to TechnologyEnglish · 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

arxiv.org

external-link
message-square
0
link
fedilink
  • cross-posted to:
  • technology@hexbear.net
  • machinelearning@lemmy.ml
7
external-link

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

arxiv.org

☆ Yσɠƚԋσʂ ☆ to TechnologyEnglish · 1 year ago
message-square
0
link
fedilink
  • cross-posted to:
  • technology@hexbear.net
  • machinelearning@lemmy.ml
alert-triangle
You must log in or # to comment.

Technology

technology

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmygrad.ml

A tech news sub for communists

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 27 users / day
  • 196 users / week
  • 320 users / month
  • 927 users / 6 months
  • 617 local subscribers
  • 1.38K subscribers
  • 1.81K Posts
  • 5.55K Comments
  • Modlog
  • mods:
  • Muad'Dibber
  • burlemarx
  • egs81t
  • BE: 0.19.15
  • Modlog
  • Legal
  • Instances
  • Docs
  • Code
  • join-lemmy.org