Comment on Should have seen it coming
theunknownmuncher@lemmy.world 2 days agoarxiv.org/abs/2405.20304 they invented their own reinforcement learning framework called Group Relative Policy Optimization
Comment on Should have seen it coming
theunknownmuncher@lemmy.world 2 days agoarxiv.org/abs/2405.20304 they invented their own reinforcement learning framework called Group Relative Policy Optimization
Sanctus@lemmy.world 1 day ago
Yeah the original comment in this chain more describes US Telcos and shit, not this particular instance.