NostrHTTP for search engine indexing and access to Nostr data

导航栏

Home

@ jb55
2025-02-11 05:19:03

If deepseek did reinforcement learning over chain of thought reasoning to train r1… and alphago used reinforcement learning to find superhuman strategies in Go… maybe scaling up reinforcement learning on chain of thought reasoning will get us closer to superhuman reasoning and dare i say agi? Feels like we’re at the beginning of something huge.

yakihonne.com iris.to jumble.social