If there are any problems with the queue which affect the performance
of any entries, after correcting the problem we will automatically
rerun the affected contest entries to produce valid scores.
We will release the test suite at the end of the contest, at which
time you can check the entries that failed to see what happened.
However, we don't plan to release any other information about the test
cases during the contest.
Your score is the average of the scores for each individual test point. The time and performance statistic displayed on the contest rankings are the average time and average blank space ratings over all the test points. They are listed to give you an idea of how your entry performed, but they are not the numbers used to compute your score. It may be that the highest ranking entry does not have the lowest average time or lowest average performance rating since in general:
score = sum(f(score,time)) != f(sum(score),sum(time)).
From previous contests we've seen a recurring pattern - a particular algorithm is submitted and then "optimized", and then suddenly a contestant submits a better algorithm to take the top spot. It's then optimized, and the cycle continues. The top entries typically have been repeatedly modified by several contestants.
Though tweaking of entries that you see can be frustrating, it's part of the evolution of the entries. The tweaking portion doesn't represent the end of the contest but is just an intermediate phase; the brainstorming and tweaking never really ends!
At the end of the contest we'll post an analysis of the entries
showing how the entries developed over the course of the contest.
This analysis includes highlights of the tweaking that went on in
successful entries. At this point you can more clearly see if
allowing code tweaking has hindered or helped the creativity involved
in the contest. From what we've seen, the tweaking has made the
contest more of a learning experience.