Summary: Advice on Reinforcement Learning Experimentation
During the BeNeRL seminar talks researchers also briefly share their approach to RL experimentation. The below list keeps track of their main advice.
(BE = Benjamin Eysenbach)
Managing experiments
Maintain a lab/experiment journal (BE)
Number every experiment you perform
Rule of thumb: a paper requires in total ~200-250 experiments (unfruitful research directions will stop earlier)
Before each experiment, write down your hypotheses (BE)
Determine what you want to get out of the next experiment
Think beyond "does my method outperform the baseline?" --> there are many more (interesting) questions
Add reminders to yourself (BE)
In your journal, add notes to yourself: when to check back to a certain experiment (with a date), what to look for, what to do next, etc.Â
Interpreting experiments
Log as much data as possible (BE)
Don't only log learning curves. The more information you log, the more you can analyse
Dig deep into your results (BE)
Analysis > Coding. Spend a significant part of your time on analyzing the output of an experiment
Be curious, try to learn what is going on. Don't only look at learning curves
Visualize (BE)
Try to visualize your results as much as possible