
For the most part there is a fairly linear distribution between 1,250 and 2,250 generations. Below 1,250, the frequency of occurance drops off slightly, never seeming to get below 900 (941 was the lowest of the 500 samples). Above 2,250 there is a sort of long tail where the algorithm took two or three times the average number of generations.
The first of these interval graphs shows the intervals for samples 1 through 483. I chose that range because the greatest interval in that range is 31. Even in this range, you can see that the intervals greater than 10 occur primarily at the two ends.
Now let's look at the intervals in the tail, above sample 484, where you see the intervals range up to several hundred, far outside the previous range. In fact you can see that the very last sample required more than 500 additional generations to complete than its nearest sample. In fact within the tail, more than half (nine of seventeen) of the intervals are greater than any single interval in the other range.
This exposes a potential weakness of naive random genetic algorithms: sometimes the randomness just doesn't work out for you, requiring many additional generations. The worst sample in my set took took 2.5 times longer than the average, and nearly five times as long as the best sample. If I ran 50,000 instances instead of 500, I would probably get more pronounced tails on each end.
No comments:
Post a Comment