I'm cleaning out papers from my files, tired of carrying around all these dead trees. Here are notes on nifty resources mentioned at the JITP conference a few weeks ago.
TDT: topic detection and tracking (http://projects.ldc.upenn.edu/TDT/)
Socrata, the Open data company (http://www.socrata.com/)
Google's Data Liberation Front (http://www.dataliberation.org/)
TESS: Time sharing experiments in the social sciences (http://www.tessexperiments.org/)
TREC (Text retrieval conference) benchmark data sets (http://trec.nist.gov/data.html)
And the good old American National Election Study (ANES) (http://www.electionstudies.org/)