howrar@lemmy.caMEnglish · 2 months agoKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comexternal-linkmessage-square0fedilinkarrow-up15arrow-down10
arrow-up15arrow-down1external-linkKeynotes from the 2024 Reinforcement Learning Conferenceplus-squarewww.youtube.comhowrar@lemmy.caMEnglish · 2 months agomessage-square0fedilink
howrar@lemmy.caMEnglish · edit-22 months agoOpenAI: Learning to Reason with LLMsplus-squareopenai.comexternal-linkmessage-square0fedilinkarrow-up13arrow-down13
arrow-up10arrow-down1external-linkOpenAI: Learning to Reason with LLMsplus-squareopenai.comhowrar@lemmy.caMEnglish · edit-22 months agomessage-square0fedilink
howrar@lemmy.caMEnglish · 8 months agoIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googleexternal-linkmessage-square0fedilinkarrow-up11arrow-down11
arrow-up10arrow-down1external-linkIntroducing SIMA, a Scalable Instructable Multiworld Agentplus-squaredeepmind.googlehowrar@lemmy.caMEnglish · 8 months agomessage-square0fedilink