They were doing this on youtube years ago. I remember watching this during quarentine. I’d watch for 2-3 minutes, and mario would die in the same place every time.
But it would try something new each time. You’d watch it run into the same goomba each time for 2-3 minutes, but with very slight variation. Then you’d see a few days later it got to halfway through the stage.
Then by the end of the month, it was to the 3rd world. I think it took 6 months to beat the game. Then they’d save that file, start a NEW file, do it again, and then they’d combine the two files. Supposedly each generation of combining AIs would find the most efficient way to win. Meaning each generation is smarter than the last.
They were doing this on youtube years ago. I remember watching this during quarentine. I’d watch for 2-3 minutes, and mario would die in the same place every time.
But it would try something new each time. You’d watch it run into the same goomba each time for 2-3 minutes, but with very slight variation. Then you’d see a few days later it got to halfway through the stage.
Then by the end of the month, it was to the 3rd world. I think it took 6 months to beat the game. Then they’d save that file, start a NEW file, do it again, and then they’d combine the two files. Supposedly each generation of combining AIs would find the most efficient way to win. Meaning each generation is smarter than the last.
I believe that was a genetic algorithm. Something that would be a lot more successful than an LLM in this context.
yes. MarI/O by seth bling
Probably reinforcement learning? LLMs are a bad architecture for something like real time video games
Yes Timmy, we spent months finding the best Marios and then made them have offspring. The wonders of AI!