-
@ someone
2025-02-26 18:08:04Thanks! RL over nostr will be fun! I thought about using reactions for when determining the pretraining dataset. But right now I don't use them. For RL they can be useful, reactions to answers can be another signal. We could make the work more open once more people are involved and more objective work happens.