Hello OONI,
The source tree for Chromebot contains a list of "top million URLs" for stability testing [1]. The Chromebot list might present an interesting complement to Alexa for the HTTP tests, since it would be more likely resemble real-world conditions through triggering URL-specific responses from censors -- rather than exclusively measuring filtering based on hostname.
[1] http://src.chromium.org/svn/trunk/tools/chromebot/top-million
Cordially, Collin