I don't know the title of this post.
Suddenly I have time. I need to structure my life again. I was clever enough to book an intensive German course before suspending my work, so my life has some structure. That's a temporary solution. The course ends this week.
I dig around. When something makes more sense, I dig deeper. I also don't want to miss other possibilities. So I'm facing some exploration vs exploitation problem. I heard there are bunch of algorithms like UCB1.
I can make a special software for managing my life based on UCB1. There is going to be a list of activities, each taking 40 minutes. After each try, I say it was good or bad. The software chooses the next activity, trying to hit the balance between having good time (exploitation) and trying newer activities (exploration).
I'm not sure about one thing. Is UCB1 a good algorithm in my use case? I'd be adding more and more activities while I'm figuring things out. I'd be reading new codebase, so I'd be adding another entry in the list. If I add more and more activities on the go, I might be spending all the time on the newer additions. That's not good. To be honest, I tried organizing my life according to UCB1 a while ago, and I had precisely this problem.
In that case, perhaps “randomized probability matching” might feel fairer. Assuming each activity is a Bernoulli distribution, how can I learn the parameters? It looks like it's doable, assuming that the parameter follows Beta distribution.
In the next post, I'll find a convenient implementation of this parameter estimation. I'd be surprized if there is none.
The title of this post is “life organization.”