next up previous
Next: Regularization Up: Results and Discussion Previous: Results and Discussion

Noise

To find default noise settings, we ran 50 random seeds for all combinations of nine annealing schedules and seven noise values. Average performance over the runs was typically 5% better than without noise, while the best NLL score over the 50 runs was 12% better than without noise. Given that the typical mode of running SAM is to generate many models and pick the best, this 12% value is quite an improvement. Our chosen default is 5 sequences worth of noise using an exponential annealing schedule with factor 0.8. This is a somewhat arbitrary choice based on the range of scores obtained -- no clear winner among the settings emerged. The tested setpoints added between 20% and 350% more reestimation cycles over the noiseless case. If less time is available, we suggest a linear schedule with 1 noise sequence. In general, as many models should be created as possible and then the best one further refined. This procedure is automated in SAM.


   figure3137

Figure 5: Test NLL scores (average over 117 test sequences) from running SAM 1000 times on 50 other globin sequences with (a) default noise, (b) random starting model lengths and (c) all heuristics including surgery. The solid vertical line at 334 is the average test sequence score without any random heuristics (in this case, surgery has no effect on the non-random training routine).

The histograms in Figure 5 show average test set NLL scores for 1000 training runs on 50 training globins with just default noise, random model lengths without noise, and all heuristics (noise, random model lengths, and surgery). The vertical bar at 334 indicates the NLL score for training without noise. Note in particular how the combination of noise and surgery both improves the test set scores and sharpens their distribution, indicating that far fewer than 1000 runs are needed to generate good models.


next up previous
Next: Regularization Up: Results and Discussion Previous: Results and Discussion

Rey Rivera
Thu Aug 29 15:28:54 PDT 1996