<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-7374448075509148245</id><updated>2011-12-06T08:07:17.917-08:00</updated><category term='randomness'/><category term='decentralization'/><category term='dissertation'/><category term='media'/><category term='education'/><category term='tools'/><category term='missing middle'/><category term='workflow'/><category term='movies'/><category term='mechanism design'/><category term='tikz'/><category term='latex'/><category term='lists'/><category term='poker'/><category term='web spiders'/><category term='text mining'/><category term='methodology'/><category term='game theory'/><category term='open source'/><category term='time management'/><category term='big data'/><category term='breaking news'/><category term='text classification'/><category term='psychology'/><category term='graphic design'/><category term='computational social science'/><category term='data visualization'/><category term='word cloud'/><category term='python'/><category term='new media'/><category term='AI'/><category term='amazon'/><category term='polling'/><category term='goodbye'/><category term='resources'/><category term='social choice theory'/><category term='Benkler'/><category term='video'/><category term='Web utilities'/><category term='code'/><category term='data science'/><category term='prediction'/><category term='EC2'/><category term='institutions'/><category term='blogs'/><category term='papers'/><category term='fraud'/><category term='agent based models'/><category term='notes'/><category term='merge'/><category term='presentations'/><category term='miscellaneous'/><category term='math'/><category term='NLP'/><category term='cloud computing'/><category term='politics'/><category term='snowcrawl'/><category term='policy'/><category term='sentiment analysis'/><category term='opinion mining'/><category term='complex systems'/><category term='networks'/><category term='research design'/><category term='information flow'/><category term='flow charts'/><category term='economics'/><category term='software'/><category term='OPSP'/><category term='surveys'/><category term='voice recognition'/><category term='information technology'/><category term='scientific method'/><category term='statistics'/><category term='content analysis'/><category term='n-gram'/><category term='machine learning'/><category term='social science'/><category term='disagreement'/><category term='crowdsourcing'/><category term='state of the union'/><category term='data'/><category term='conferences'/><category term='computing'/><category term='sampling'/><category term='R'/><title type='text'>The Lowly Wonk</title><subtitle type='html'>Politics, lifehacking, data mining, and a dash of the scientific method from an up-and-coming policy wonk.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><link rel='next' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default?start-index=101&amp;max-results=100'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>154</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-9052142583492657743</id><published>2011-10-24T10:38:00.000-07:00</published><updated>2011-10-24T10:38:44.595-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='computational social science'/><category scheme='http://www.blogger.com/atom/ns#' term='goodbye'/><title type='text'>Moving to a new home...</title><content type='html'>I've been blogging at &lt;a href="http://lowlywonk.blogspot.com/"&gt;http://lowlywonk.blogspot.com&lt;/a&gt;  for most of my time in grad school, and now it's time for a change.&amp;nbsp;  I'm still interested in public policy, but my so much of work is  computational now.&amp;nbsp; It just makes more sense to have "computational" in  the name of the blog.&lt;br /&gt;&lt;br /&gt;So please point your blog readers to &lt;a href="http://compsocsci.blogspot.com/"&gt;http://compSocSci.blogspot.com&lt;/a&gt; so the adventure can continue.&lt;br /&gt;&lt;br /&gt;(The contents of this blog will remain online, searchable, etc. for the forseeable future.)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-9052142583492657743?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/9052142583492657743/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=9052142583492657743' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9052142583492657743'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9052142583492657743'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/10/moving-to-new-home.html' title='Moving to a new home...'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-5750795276850723610</id><published>2011-09-07T05:52:00.000-07:00</published><updated>2011-09-07T05:52:00.216-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='tikz'/><category scheme='http://www.blogger.com/atom/ns#' term='latex'/><category scheme='http://www.blogger.com/atom/ns#' term='data visualization'/><title type='text'>Latex: tikz</title><content type='html'>In the runup to APSA, I discovered the absolutely fantastic &lt;a href="http://www.ctan.org/tex-archive/graphics/pgf/"&gt;tikz packag&lt;/a&gt;e for diagrams in LaTex.&lt;br /&gt;&lt;br /&gt;See the &lt;a href="http://www.texample.net/tikz/examples/"&gt;gallery of example here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Here are some links that were helpful for preparing my presentation:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.jmilne.org/not/CDGuide.html"&gt;http://www.jmilne.org/not/CDGuide.html&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://heather.cs.ucdavis.edu/%7Ematloff/beamer.html"&gt;http://heather.cs.ucdavis.edu/~matloff/beamer.html&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.latex-community.org/forum/viewtopic.php?f=5&amp;amp;t=1152"&gt;http://www.latex-community.org/forum/viewtopic.php?f=5&amp;amp;t=1152&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.texample.net/tikz/examples/"&gt;http://www.texample.net/tikz/examples/&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.uncg.edu/cmp/reu/presentations/Charles%20Batts%20-%20Beamer%20Tutorial.pdf"&gt;http://www.uncg.edu/cmp/reu/presentations/Charles%20Batts%20-%20Beamer%20Tutorial.pdf&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://faq.ktug.or.kr/wiki/uploads/beamer_guide.pdf"&gt;http://faq.ktug.or.kr/wiki/uploads/beamer_guide.pdf&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-5750795276850723610?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/5750795276850723610/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=5750795276850723610' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5750795276850723610'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5750795276850723610'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/09/latex-tikz.html' title='Latex: tikz'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8345912821909709214</id><published>2011-09-06T12:43:00.000-07:00</published><updated>2011-09-06T12:43:00.263-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='notes'/><category scheme='http://www.blogger.com/atom/ns#' term='game theory'/><category scheme='http://www.blogger.com/atom/ns#' term='conferences'/><title type='text'>Topics that came up at ASPA, part 1.</title><content type='html'>I'm looking over my notes from APSA, highlighting papers, ideas, etc. that looked worth passing along.  As usual, some of these were new to me, and others are just worth knowing about.&lt;br /&gt;&lt;br /&gt;From a section on game theory and experimentation:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Quantal_response_equilibrium"&gt;Quantal response equilibrium&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.google.com/url?sa=t&amp;amp;source=web&amp;amp;cd=1&amp;amp;ved=0CB4QFjAA&amp;amp;url=http%3A%2F%2Ffaculty.haas.berkeley.edu%2Fhoteck%2FPAPERS%2FEWA.pdf&amp;amp;rct=j&amp;amp;q=experience%20weighted%20attraction&amp;amp;ei=JidlTtaFJpHVsgbmvNH2CQ&amp;amp;usg=AFQjCNH-T0DSGaz_sc6DE7uM9lQ07Dg24w&amp;amp;cad=rja"&gt;Experience-weighted attraction&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Keynesian_beauty_contest"&gt;Beauty contests and level-k reasoning&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://web.cs.gc.cuny.edu/%7Ekgb/course/crawford.pdf"&gt;"Strategic Information Transmission," Crawford and Sobel&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.iew.uzh.ch/ztree/index.php"&gt;z-tree software for economic experiments&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.jstor.org/stable/3592804"&gt;Maskin and Tirole on pandering in democracy&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://books.google.com/books?id=6WIn6hoKXLEC&amp;amp;printsec=frontcover&amp;amp;dq=Bendor+Diermeier&amp;amp;hl=en&amp;amp;ei=3ihlTtaDI8zIsgajq8SRCg&amp;amp;sa=X&amp;amp;oi=book_result&amp;amp;ct=result&amp;amp;resnum=1&amp;amp;ved=0CCoQ6AEwAA"&gt;&lt;span style="font-style: italic;"&gt;A Behavioral Theory of Elections&lt;/span&gt;, by Bendor, Diermeier, and Siegel&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://books.google.com/books?id=hXSKQgAACAAJ&amp;amp;dq=al+roth+handbook&amp;amp;hl=en&amp;amp;ei=XSllTpr3DsaLswb_6onDCg&amp;amp;sa=X&amp;amp;oi=book_result&amp;amp;ct=result&amp;amp;resnum=1&amp;amp;ved=0CCoQ6AEwAA"&gt;&lt;span style="font-style: italic;"&gt;A Handbook of Experimental Economics&lt;/span&gt;, Roth&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="Groves-Ledyard%20Mechanism"&gt;Groves-Ledyard mechanism&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://books.google.com/books?hl=en&amp;amp;lr=&amp;amp;id=qqZ_FDFoDcMC&amp;amp;oi=fnd&amp;amp;pg=PA60&amp;amp;dq=cyert-march+rule&amp;amp;ots=9UZLNjEr9R&amp;amp;sig=v2C5__yfo3s9WxuEXe3SLpf3AWM#v=onepage&amp;amp;q&amp;amp;f=false"&gt;Cyert-March rule&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://books.google.com/books?id=ro7X8HRyuEIC"&gt;Predictably Irrational, Ariely&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://en.wikipedia.org/wiki/Global_game"&gt;Global Games&lt;/a&gt;&lt;/li&gt;&lt;li&gt;&lt;a href="http://web.ics.purdue.edu/%7Etreimer/Rieskamp_Reimer_2007.pdf"&gt;Ecological rationality&lt;/a&gt;&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;More to come soon...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8345912821909709214?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8345912821909709214/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8345912821909709214' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8345912821909709214'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8345912821909709214'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/09/topics-that-came-up-at-aspa-part-1.html' title='Topics that came up at ASPA, part 1.'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-82700947710188041</id><published>2011-09-05T10:52:00.001-07:00</published><updated>2011-09-05T11:05:13.352-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='presentations'/><category scheme='http://www.blogger.com/atom/ns#' term='blogs'/><title type='text'>Slides from APSA: Why do people blog about politics</title><content type='html'>Back from APSA!&lt;br /&gt;&lt;br /&gt;Here is the &lt;a href="http://www-personal.umich.edu/%7Eagong/presentations/why_blog_politics_Gong_APSA2011_presentation-FINAL.pdf"&gt;final version&lt;/a&gt; of my presentation slides.  I'm not going to post the paper because it's in the process of splitting into three, but I'm happy to share by email if you're interested.&lt;br /&gt;&lt;br /&gt;On with the job search!&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-82700947710188041?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/82700947710188041/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=82700947710188041' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/82700947710188041'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/82700947710188041'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/09/slides-from-apsa-why-do-people-blog.html' title='Slides from APSA: Why do people blog about politics'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-6029775636589458915</id><published>2011-08-30T10:26:00.000-07:00</published><updated>2011-08-30T10:29:14.687-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data'/><category scheme='http://www.blogger.com/atom/ns#' term='merge'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><title type='text'>Hard-learned lessons on data merging</title><content type='html'>For my APSA paper this year, I had to merge two survey data sets, and do a bunch of work with imputation and instrumental variables.  To complicate things, I was working with a student on the merge.&lt;br /&gt;&lt;br /&gt;Here are some lessons learned:&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;1. Agree on an outline before doing the data work.&lt;/span&gt;&lt;br /&gt;Know the main steps going in, and write them as comments in the code.  This will make the logic of the data work more clear, which is helpful for debugging. For long merges, it can be motivating to see your progress as you go along.  Also, if the steps need to change, it's good to realize why the original plan didn't work.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;2. Stick to an intuitive naming convention for variables, data sets, etc.&lt;/span&gt;&lt;br /&gt;Yes, you'll have to type out longer variable names, but you'll save a ton of time in debugging.  For instance, "october_cleaned_dataset" instead of "d2.10c".&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;3. Store files as .csv, not .dat.&lt;/span&gt;&lt;br /&gt;R's "save" command isn't fully compatible across versions and operating systems. Instead, save small-to-moderate datasets as .csv files.  That way, compatibility won't be an issue.  (Plus you can open the same files in other programs, like STATA or excel.)&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;4. Use names, not numbers.&lt;/span&gt;&lt;br /&gt;Don't use commands based on row or column numbers: data_set[,c(1:4,7,8,23)].  They're hard to read, and they're brittle: if a new column gets inserted in data_set, that code isn't going to work anymore.  Instead, use variable names: data_set[,c("time1","time2","time3","time4","age","gender","year")]. Regular expressions can be very helpful here: data_set[,append(grep("time.", names(data_set)), c("age","gender","year"))].  Yes, it takes longer to write, but you'll save a huge amount of time debugging.&lt;br /&gt;&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-6029775636589458915?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/6029775636589458915/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=6029775636589458915' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6029775636589458915'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6029775636589458915'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/08/hard-learned-lessons-on-data-merging.html' title='Hard-learned lessons on data merging'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8081864810106564165</id><published>2011-08-08T11:05:00.000-07:00</published><updated>2011-08-08T11:18:24.581-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='computational social science'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud computing'/><category scheme='http://www.blogger.com/atom/ns#' term='EC2'/><category scheme='http://www.blogger.com/atom/ns#' term='R'/><title type='text'>The world's fastest demo for RStudio in EC2</title><content type='html'>Last week I posted &lt;a href="http://lowlywonk.blogspot.com/2011/08/mining-and-visualizing-twitter-from.html"&gt;instructions&lt;/a&gt; to get started with &lt;a href="http://rstudio.org/"&gt;RStudio&lt;/a&gt; on &lt;a href="http://aws.amazon.com/ec2/"&gt;Amazon's Elastic Compute Cloud&lt;/a&gt;.  Here are new and improved* instructions (&lt;a href="http://www-personal.umich.edu/%7Eagong/docs/Using%20RStudio%20on%20Amazon%20EC2%20under%20the%20Free%20Usage%20Tier.docx"&gt;docx&lt;/a&gt;, &lt;a href="http://www-personal.umich.edu/%7Eagong/docs/Using%20RStudio%20on%20Amazon%20EC2%20under%20the%20Free%20Usage%20Tier.pdf"&gt;pdf&lt;/a&gt;).  These steps should be enough to get you into the cloud in 15 minutes or less, for free.&lt;br /&gt;&lt;br /&gt;Please let me know in the comments if you have any trouble or questions with this demo.  I'm trying to lower the startup costs for people to do computational social science, so I'm happy to be a resource for others working their way down the cloud computing path.&lt;br /&gt;&lt;br /&gt;Cheers!&lt;br /&gt;&lt;br /&gt;PS - The instance comes with several fun R libraries pre-installed: &lt;a href="http://cran.r-project.org/web/packages/tm/index.html"&gt;tm&lt;/a&gt;, &lt;a href="http://igraph.sourceforge.net/"&gt;igraph&lt;/a&gt;, and &lt;a href="http://cran.r-project.org/web/packages/twitteR/index.html"&gt;twitteR&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;*I've 1) dropped several steps that aren't necessary for running RStudio, 2) added a few screenshots, and 3) clarified a steps that were giving people trouble. Thanks again to Kevin J. for putting together the original slides.&lt;br /&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8081864810106564165?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8081864810106564165/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8081864810106564165' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8081864810106564165'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8081864810106564165'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/08/worlds-fastest-demo-for-rstudio-in-ec2.html' title='The world&apos;s fastest demo for RStudio in EC2'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1516724066041199385</id><published>2011-08-01T11:23:00.000-07:00</published><updated>2011-08-01T11:51:45.885-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data visualization'/><category scheme='http://www.blogger.com/atom/ns#' term='big data'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud computing'/><category scheme='http://www.blogger.com/atom/ns#' term='R'/><title type='text'>Mining and visualizing twitter from RStudio in EC2</title><content type='html'>Here's code I'm going to use for my &lt;a href="http://computationallegalstudies.com/icpsr-class/"&gt;ICPSR class&lt;/a&gt; today.  This is set up to run immediately from an &lt;a href="http://aws.amazon.com/ec2/"&gt;EC2&lt;/a&gt; instance of my AMI &lt;span class="value"&gt;agongRStudio2 (ID:&lt;/span&gt;&lt;span class="value"&gt;ami-1bb47272).  It's the simplest introduction to text mining I've been able to pull together so far.&lt;br /&gt;&lt;br /&gt;Step-by-step instructions for getting started in EC2 are here (&lt;a href="http://www-personal.umich.edu/%7Eagong/docs/Using%20R%20on%20Amazon%20EC2%20under%20the%20Free%20Usage%20Tier.pdf"&gt;pdf&lt;/a&gt; and &lt;a href="http://www-personal.umich.edu/%7Eagong/docs/Using%20R%20on%20Amazon%20EC2%20under%20the%20Free%20Usage%20Tier.docx"&gt;docx&lt;/a&gt;). These are intended to get you started in command-line R.  For this exercise, we want to use the &lt;a href="http://rstudio.org/"&gt;RStudio&lt;/a&gt; GUI instead, so there are a few changes.&lt;br /&gt;&lt;br /&gt;1. On step 6, use this Community AMI: &lt;/span&gt;&lt;span class="value"&gt;agongRStudio2 /&lt;/span&gt;&lt;span class="value"&gt; ami-1bb47272&lt;br /&gt;&lt;br /&gt;2. On step 8, you don't need to download the keypair.  "Proceed without a keypair" instead.&lt;br /&gt;&lt;br /&gt;2. On step 9, you also need to enable port 8787, the port the RStudio server uses.&lt;br /&gt;&lt;br /&gt;3. On step 11 stop following the tutorial.  Instead, open up your EC2 URL in your browser, with port 8787.  It will look something like this:&lt;br /&gt;&lt;blockquote&gt;http://ec2-123-45-67-890.compute-1.amazonaws.com:8787/&lt;/blockquote&gt;&lt;br /&gt;4. I'll give out the username and password in class.  If you're not in the class, email me and I can clue you in.&lt;br /&gt;&lt;/span&gt;&lt;span class="value"&gt;&lt;br /&gt;5. Here's a first script to run&lt;/span&gt;&lt;br /&gt;&lt;blockquote&gt;library(twitteR)&lt;br /&gt;library(tm)&lt;br /&gt;library(wordcloud)&lt;br /&gt;&lt;br /&gt;#Grab the 200 most recent tweets about #bachmann&lt;br /&gt;#http://www.slideshare.net/jeffreybreen/r-by-example-mining-twitter-for&lt;br /&gt;k = 200&lt;br /&gt;my_tweets &amp;lt;- searchTwitter("#bachmann", n=k)&lt;br /&gt;&lt;br /&gt;#Convert tweet status objects to text&lt;br /&gt;#http://www.r-bloggers.com/word-cloud-in-r/&lt;br /&gt;my_text &amp;lt;- data.frame( text=unlist( lapply( c(1:k), function(x){my_tweets[[x]]$text} ) ) )&lt;br /&gt;&lt;br /&gt;#Convert text to a tm corpus object&lt;br /&gt;my_corpus &amp;lt;- Corpus( DataframeSource( my_text ) )&lt;br /&gt;my_corpus &amp;lt;- tm_map(my_corpus, removePunctuation)&lt;br /&gt;my_corpus &amp;lt;- tm_map(my_corpus, tolower)&lt;br /&gt;my_corpus &amp;lt;- tm_map(my_corpus, function(x) removeWords(x, stopwords("english")))&lt;br /&gt;&lt;br /&gt;#Convert corpus to matrix&lt;br /&gt;tdm &amp;lt;- TermDocumentMatrix(my_corpus)#, control = list(weighting = weightTfIdf))&lt;br /&gt;m &amp;lt;- as.matrix(tdm)&lt;br /&gt;&lt;br /&gt;#Get features and frequencies&lt;br /&gt;v &amp;lt;- sort(rowSums(m),decreasing=TRUE)&lt;br /&gt;d &amp;lt;- data.frame(word = names(v),freq=v)&lt;br /&gt;&lt;br /&gt;#Display as a word cloud&lt;br /&gt;wordcloud(d$word,d$freq,min.freq=5,use.r.layout=T,vfont=c("sans serif","plain"))&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;#Basic bluster analysis of words&lt;br /&gt;#From: http://www.statmethods.net/advstats/cluster.html&lt;br /&gt;m2 &amp;lt;- m[colSums(m)&amp;gt;15,]&lt;br /&gt;&lt;br /&gt;dist_matrix &amp;lt;- dist(m2, method = "euclidean") # distance matrix&lt;br /&gt;fit &amp;lt;- hclust(dist_matrix, method="ward")&lt;br /&gt;plot(fit) # display dendogram&lt;br /&gt;&lt;/blockquote&gt;&lt;span class="value"&gt;We're going to be trying this &lt;span style="font-style: italic;"&gt;in class&lt;/span&gt;.  &lt;/span&gt;&lt;span class="value"&gt;I have 20 minutes budgeted,&lt;/span&gt;&lt;span class="value"&gt; so hopefully it's really this easy.&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;PS - Don't forget to terminate your EC2 instance when you're done, or you will use up your free hours, then run up a smallish (~50 cents/day) Amazon bill until you remember&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1516724066041199385?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1516724066041199385/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1516724066041199385' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1516724066041199385'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1516724066041199385'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/08/mining-and-visualizing-twitter-from.html' title='Mining and visualizing twitter from RStudio in EC2'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8542046453199367228</id><published>2011-07-25T19:50:00.000-07:00</published><updated>2011-07-25T19:50:31.443-07:00</updated><title type='text'>Brand sentiment showdown...</title><content type='html'>From FlowingData.  I may use this in my ICPSR class on computational social science next week.  What do you think?&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://flowingdata.com/2011/07/25/brand-sentiment-showdown/"&gt;Brand sentiment showdown&lt;/a&gt;: "&lt;p&gt;&lt;a href="http://flowingdata.com/2011/07/25/brand-sentiment-showdown/"&gt;&lt;img src="http://flowingdata.com/wp-content/uploads/2011/07/movies-revised.png" alt="movies-revised" title="movies-revised" height="443" width="450" /&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;There are many brands on Twitter that exist to uphold an image of the company they represent. As consumers, we can communicate with these accounts, voicing praise or displeasure (usually the latter). Using a simple sentiment classifier&lt;sup&gt;&lt;a href="http://flowingdata.com/2011/07/25/brand-sentiment-showdown/#footnote1"&gt;1&lt;/a&gt;&lt;/sup&gt;, I scored feelings towards major brands from 0 (horrible) to 100 (excellent) once a day for five days. &lt;/p&gt;&lt;br /&gt;&lt;p&gt;The above for example, shows scores for Netflix, Hulu, and Redbox. Netflix had the lowest scores, whereas Redbox had the highest. I suspect Netflix started low with people still upset over the price hike, but it got better the next couple of days. Then on Saturday, there was a score drop, which I'm guessing was from their downtime for most of Saturday. Hulu and Redbox, on the other hand, held more steady scores.&lt;/p&gt;&lt;br /&gt;&lt;p&gt;As for auto brands, Toyota clearly had the lowest scores. However, Lexus, which is actually a luxury vehicle division of Toyota had the highest scores in the high 90s to 100.&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;img src="http://flowingdata.com/wp-content/uploads/2011/07/cars-revised.png" alt="" title="Car showdown" height="765" width="450" /&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;How about the major mobile phone companies, AT&amp;amp;T, Verizon, and Sprint? Verizon scored better initially, but had lower scores during the weekend. Not sure what was going on with Sprint.&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;img src="http://flowingdata.com/wp-content/uploads/2011/07/phone-revised.png" alt="" title="Phone showdown" height="443" width="450" /&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;Between Twitter and Facebook, there was obviously some bias, but Twitter faired slightly better. Twitter scored lower than I expected, but it probably has to do with bug reports directed towards @twitter.&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;img src="http://flowingdata.com/wp-content/uploads/2011/07/social-revised.png" alt="" title="social-revised" height="356" width="450" /&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;Is Domino's Pizza good now? Papa John's stayed fairly steady while Pizza Hut scores were sub-par.&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;img src="http://flowingdata.com/wp-content/uploads/2011/07/pizza-revised.png" alt="" title="Pizza" height="443" width="450" /&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;Finally, as a sanity check, I compared airlines like Breen did in his tutorial. Results were similar with JetBlue and Southwest clearly in the positive and the others picking up the rear.&lt;/p&gt;&lt;br /&gt;&lt;p&gt;&lt;img src="http://flowingdata.com/wp-content/uploads/2011/07/airlines-revised.png" alt="" title="Airlines ranked" height="796" width="450" /&gt;&lt;/p&gt;&lt;br /&gt;&lt;p&gt;Any of these scores seem surprising to you?&lt;/p&gt;&lt;br /&gt;&lt;hr /&gt;&lt;br /&gt;&lt;ol&gt;&lt;br /&gt;&lt;li&gt;Jeffrey Breen provides an &lt;a href="http://jeffreybreen.wordpress.com/2011/07/04/twitter-text-mining-r-slides/"&gt;easy-to-follow tutorial&lt;/a&gt; on Twitter sentiment in R. The scoring system is pretty basic. All you do is load tweets with a given search phrase, and then find all the 'good' words and 'bad' words. Good words give +1, and bad words give -1. Then a tweet is classified good or bad based on the total. Then to get a final score, only tweets with total of +2 or more or -2 or less are counted. The final score is computed by dividing number of negative tweets divided by total number of 'extreme' tweets. Obviously this won't pick up on sarcasm, but the scoring seems to still do a decent job. I wouldn't make any important business decisions based on these results though.&lt;/li&gt;&lt;br /&gt;&lt;/ol&gt;&lt;br /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;hr /&gt;&lt;p&gt;&lt;/p&gt;&lt;p&gt;The &lt;a href="http://www.amazon.com/gp/product/0470944889"&gt;new FlowingData book&lt;/a&gt; is available now.&lt;/p&gt;&lt;p&gt;&lt;/p&gt;&lt;br /&gt;&lt;div&gt;&lt;br /&gt;&lt;a href="http://feeds.feedburner.com/%7Eff/FlowingData?a=C2MuF6ttGxE:Micry4QunYI:yIl2AUoC8zA"&gt;&lt;img src="http://feeds.feedburner.com/%7Eff/FlowingData?d=yIl2AUoC8zA" border="0" /&gt;&lt;/a&gt; &lt;a href="http://feeds.feedburner.com/%7Eff/FlowingData?a=C2MuF6ttGxE:Micry4QunYI:7Q72WNTAKBA"&gt;&lt;img src="http://feeds.feedburner.com/%7Eff/FlowingData?d=7Q72WNTAKBA" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;/div&gt;&lt;img src="http://feeds.feedburner.com/%7Er/FlowingData/%7E4/C2MuF6ttGxE" height="1" width="1" /&gt;"&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8542046453199367228?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://flowingdata.com/2011/07/25/brand-sentiment-showdown/' title='Brand sentiment showdown...'/><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8542046453199367228/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8542046453199367228' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8542046453199367228'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8542046453199367228'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/07/brand-sentiment-showdown.html' title='Brand sentiment showdown...'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7125002179287632136</id><published>2011-07-15T17:12:00.001-07:00</published><updated>2011-07-15T17:14:15.834-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='papers'/><category scheme='http://www.blogger.com/atom/ns#' term='snowcrawl'/><title type='text'>Automated snowball census</title><content type='html'>After a lot of work, I pushed the &lt;a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1832024"&gt;new version of the paper&lt;/a&gt; to SSRN today.  Getting ready to pack it up, send it off for review, and move on to the next thing.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7125002179287632136?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7125002179287632136/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7125002179287632136' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7125002179287632136'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7125002179287632136'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/07/automated-snowball-census.html' title='Automated snowball census'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-4402705831411229091</id><published>2011-07-12T09:53:00.000-07:00</published><updated>2011-07-13T13:18:26.766-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dissertation'/><category scheme='http://www.blogger.com/atom/ns#' term='new media'/><category scheme='http://www.blogger.com/atom/ns#' term='information flow'/><category scheme='http://www.blogger.com/atom/ns#' term='Benkler'/><title type='text'>How are new media reshaping politics?  Take 2</title><content type='html'>&lt;span&gt;Yesterday I posted on Hidman's "missing middle" hypothesis.  Kudos to my good friend &lt;a href="http://petersbenjamin.wordpress.com/"&gt;Ben Peters&lt;/a&gt;&lt;/span&gt;&lt;span style="font-weight: bold;"&gt; &lt;/span&gt;&lt;span&gt;for some great, thought-provoking responses.&lt;span style="font-weight: bold;"&gt;  &lt;/span&gt;Today, I'm going to push forward&lt;/span&gt;&lt;span&gt; and respond to another great thinker in this area: Benkler's theory of the "networked public sphere."&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;&lt;br /&gt;&lt;br /&gt;Benkler and the Networked Public Sphere&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;On the other side of the debate, Yochai Benkler is an Internet optimist. He argues that many-to-many communication will invigorate the public sphere, leading to broader intake of ideas, better discussion, and ultimately better governance. Benkler is very critical of the media oligopoly of the mid-20th century, which he says was heavily influenced by money and ideology, and excersized outsized control on public access to information.  According to his account, the current proliferation of online information sources is certainly better than being dependent on a handful of corporate broadcasters, even if it still falls short of utopia.&lt;br /&gt;&lt;br /&gt;This picture of the public sphere is appealing and not entirely untrue. I want to believe it. However, Benkler fails to take into important and well-established facts about American political system.&lt;br /&gt;&lt;br /&gt;First, most citizens in the U.S. are poorly equipped to deal with political information. Converse's half-century-old finding that as many as 90 percent of Americans are "innocent of ideology" (i.e. they have no idea what "liberal" and "conservative" mean) has been replicated and extended many times.  Most voters don't know how government works, they don't know how it's supposed to work, and they don't care to find out.  True, partisan cues, endorsements, and heuristics can sometimes bring voters up to speed enough to fill out a ballot, but these heuristic strategies cannot inform most citizens for participation in the public sphere the way Benkler imagines.  We must distinguish between the handful of citizens who are motivated and equipped to reason about politics, and the majority who are not so prepared or inclined.  Benkler's optimism really only extends as far as the electorate is capable of reasoning about democracy.&lt;br /&gt;&lt;br /&gt;Second, Benkler ignores the structure of government and policymaking. He treats "government" as a unitary actor, and makes only passing reference to elections and political parties. Benkler is painting with a broad brush, so perhaps he can be forgiven for ignoring the institutional details of representation and government in American politics.  However, those details are likely to matter, deeply.&lt;br /&gt;&lt;br /&gt;Consider: primary responsibility for lawmaking in the U.S.  falls to elected legislators.  These legislators are influenced not only by the ebb and flow of ideas in public debate, but by their ability win in zero-sum, partisan elections. Proliferation of information sources may affects public debate for the better, but it also affects the electoral pressures faced by public officials. We have strong reason to believe that access to additional channels, selective exposure, and ideological pandering are leading to increased polarization in the electorate.  What if this polarizing electoral effect dominates the enriching discursive effect that Benkler outlines?&lt;br /&gt;&lt;br /&gt;I'm sympathetic to the the idea of a networked public sphere.  As I said earlier, I really want it to be true.  But Benkler's picture ignores key institutions in American politics, like elections and parties, so I have a hard time placing much faith in his predictions.  We need to think carefully about the interplay of partisanship, ignorance, and representative government with technologies that allow cheap, many-to-many communication.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-4402705831411229091?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/4402705831411229091/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=4402705831411229091' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4402705831411229091'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4402705831411229091'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/07/how-are-new-media-reshaping-politics_12.html' title='How are new media reshaping politics?  Take 2'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8039371876726415555</id><published>2011-07-12T09:35:00.000-07:00</published><updated>2011-07-12T09:51:11.639-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dissertation'/><category scheme='http://www.blogger.com/atom/ns#' term='blogs'/><category scheme='http://www.blogger.com/atom/ns#' term='information flow'/><category scheme='http://www.blogger.com/atom/ns#' term='missing middle'/><title type='text'>How are new media reshaping politics?  Take 1</title><content type='html'>I've been studying political blogging for a couple years now, and I'm getting ready to bring it all together into a dissertation.  That means it's time to move past statistics and data, and start thinking in terms of Big Ideas.&lt;br /&gt;&lt;br /&gt;As I see it, the pressing question is &lt;span style="font-style: italic;"&gt;"How are new media (including blogs) reshaping American politics?"&lt;/span&gt;  This is a big question -- one that certainly matters outside of academia.  But that won't stop me from writing about it in a dry, academic way. :)  To my mind, Matthew Hindman, Yochai Benkler, and Cass Sunstein have put forward the three leading, competing theories for answering this question.  This week, I'm going to make a first attempt at responding to and synthesizing their ideas.&lt;span style="font-style: italic;"&gt;&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;Feedback and constructive criticism are very welcome.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Hindman and the Missing Middle&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.matthewhindman.com/"&gt;Matthew Hindman&lt;/a&gt; is an Internet pessimist.  In his book, &lt;a href="http://www.amazon.com/Myth-Digital-Democracy-Matthew-Hindman/dp/0691138680/"&gt;The Myth of Digital Democracy&lt;/a&gt;, he argues that the web has exacerbated the "rich get richer" tendencies of media markets, leading to greater inequality.  To back up his assertion, he shows that links and traffic to web pages follows a &lt;a href="http://en.wikipedia.org/wiki/Power_law"&gt;power law distribution&lt;/a&gt;.  He also interviews top 40 bloggers and claims that they are overwhelmingly white, male, high-income, and educated.  His analysis suggests that the people with big audiences online are no different from those offline.  Hindman labels this dramatic inequality between popular and unpopular sites "the missing middle."&lt;br /&gt;&lt;br /&gt;However, Hindman's line of attack has two important weaknesses.  First, he has no counterfactual. The distribution of online audiences is dramatically unequal, but the same is (and was) probably true offline as well.  Certainly, Barack Obama, Michelle Bachman, and Thomas Friedman have daily audiences that are orders of magnitude larger than mine or yours.  The same was true of their counterparts before the Internet. Audiences online are distributed unequally, but are they &lt;span style="font-style: italic;"&gt;more&lt;/span&gt; unequal than those that existed offline, before the Internet?  Hindman does not answer this question, and I suspect the answer is no.&lt;br /&gt;&lt;br /&gt;Second, Hindman ignores the potential for indirect influence. The &lt;a href="http://drudgereport.com/"&gt;Drudge Report&lt;/a&gt; is one of the most heavily trafficked blogs* on the Web, but Drudge himself writes almost no content. Instead, the site features links to stories elsewhere on the Internet. How then do we think about Drudge's influence?  He inserts no new ideas into public debate, but exercises some ability to influence which ideas get attention.  By linking to other authors' stories, Drudge allows those authors to exercise &lt;span style="font-style: italic;"&gt;indirect influence&lt;/span&gt; on his readers.&lt;br /&gt;&lt;br /&gt;Drudge is an extreme case of the common online practice of linking.  Linked content intrinsically gives others indirect influence.  It is not unique to the online world (think of citations, endorsements, recommendations), but it is probably more common there. Network theory shows us that all else equal, more re-linking leads to more egalitarian distribution of indirect influence.  By focusing only on direct readership, Hindman misses this possibility.&lt;br /&gt;&lt;br /&gt;The bottom line: Hindman is the skeptic in this debate, arguing that the Internet means business as usual for participation, voice, and influence. He's only right as long as we assume that 1) offline participation is not also unequal, and 2) only direct influence (i.e. readership and web traffic) matters.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8039371876726415555?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8039371876726415555/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8039371876726415555' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8039371876726415555'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8039371876726415555'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/07/how-are-new-media-reshaping-politics.html' title='How are new media reshaping politics?  Take 1'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-5713139784028548</id><published>2011-07-11T10:35:00.000-07:00</published><updated>2011-07-11T12:03:53.109-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='computational social science'/><category scheme='http://www.blogger.com/atom/ns#' term='amazon'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud computing'/><title type='text'>+Computation: Got an AWS in Education grant!</title><content type='html'>I just received a generous grant for usage on Amazon's Web Services -- cloud computing, storage space, and bandwidth. This is just in time for a bunch of heavy-duty text crunching I've been planning to do.  Thank you, Amazon!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-5713139784028548?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/5713139784028548/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=5713139784028548' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5713139784028548'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5713139784028548'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/07/computation-got-aws-in-education-grant.html' title='+Computation: Got an AWS in Education grant!'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-4166612375753701409</id><published>2011-06-22T06:42:00.000-07:00</published><updated>2011-06-22T06:42:00.463-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data'/><category scheme='http://www.blogger.com/atom/ns#' term='resources'/><category scheme='http://www.blogger.com/atom/ns#' term='big data'/><category scheme='http://www.blogger.com/atom/ns#' term='conferences'/><title type='text'>Political science  and big data resources from JITP</title><content type='html'>I'm cleaning out papers from my files, tired of carrying around all these dead trees.  Here are notes on nifty resources mentioned at the JITP conference a few weeks ago.&lt;br /&gt;&lt;br /&gt;TDT: topic detection and tracking (&lt;a href="http:///"&gt;http://projects.ldc.upenn.edu/TDT/&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;Socrata, the Open data company (&lt;a href="http://www.socrata.com/"&gt;http://www.socrata.com/&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;Google's Data Liberation Front (&lt;a href="http://www.dataliberation.org/"&gt;http://www.dataliberation.org/&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;TESS: Time sharing experiments in the social sciences (&lt;a href="http://www.tessexperiments.org/"&gt;http://www.tessexperiments.org/&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;TREC (Text retrieval conference) benchmark data sets (&lt;a href="http://trec.nist.gov/data.html"&gt;http://trec.nist.gov/data.html&lt;/a&gt;)&lt;br /&gt;&lt;br /&gt;And the good old American National Election Study (ANES) (&lt;a href="http://www.electionstudies.org/"&gt;http://www.electionstudies.org/&lt;/a&gt;)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-4166612375753701409?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/4166612375753701409/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=4166612375753701409' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4166612375753701409'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4166612375753701409'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/06/political-science-and-big-data.html' title='Political science  and big data resources from JITP'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-6070789695543926634</id><published>2011-06-20T08:28:00.000-07:00</published><updated>2011-06-20T08:28:00.231-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='information technology'/><category scheme='http://www.blogger.com/atom/ns#' term='lists'/><title type='text'>Laptop vs Clipboard</title><content type='html'>Things that could destroy my:&lt;br /&gt;&lt;br /&gt;&lt;table&gt;&lt;br /&gt;&lt;tbody style="text-align: center;"&gt;&lt;tr&gt;&lt;td style="font-weight: bold;"&gt;Laptop&lt;/td&gt;&lt;td style="font-weight: bold;"&gt;Both&lt;/td&gt;&lt;td style="font-weight: bold;"&gt;Clipboard&lt;/td&gt;&lt;/tr&gt;&lt;tr&gt;&lt;td style="vertical-align:top;"&gt;Coffee&lt;br /&gt;Falling&lt;br /&gt;Power surges&lt;br /&gt;Worms and viruses&lt;br /&gt;Rain&lt;br /&gt;Babies&lt;br /&gt;EMPs&lt;br /&gt;Losing the power cable&lt;br /&gt;Tripping on the power cable&lt;br /&gt;Planned obsolescence&lt;br /&gt;Stomping&lt;br /&gt;Yogurt&lt;br /&gt;Crumbs&lt;br /&gt;The AllSpark&lt;br /&gt;Big magnets&lt;br /&gt;&lt;/td&gt;&lt;td style="vertical-align:top;"&gt;Fire&lt;br /&gt;Karate experts&lt;br /&gt;Garbage compactors&lt;br /&gt;Lightsabers&lt;br /&gt;&lt;/td&gt;&lt;td style="vertical-align:top;"&gt;Termites&lt;br /&gt;Beavers&lt;br /&gt;Very strong wind&lt;br /&gt;&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;PS: This list was created as my flight into Detroit was descending, and "all personal electronic items" had to be switched off.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-6070789695543926634?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/6070789695543926634/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=6070789695543926634' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6070789695543926634'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6070789695543926634'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/06/laptop-vs-clipboard.html' title='Laptop vs Clipboard'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-6937767672071492514</id><published>2011-06-18T08:19:00.000-07:00</published><updated>2011-06-21T06:22:40.206-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='resources'/><category scheme='http://www.blogger.com/atom/ns#' term='computational social science'/><category scheme='http://www.blogger.com/atom/ns#' term='networks'/><title type='text'>Resources from PolNets</title><content type='html'>Note from Saturday, at the Political Networks conference.  The presentation went swimmingly. I think there are more sociologists than political scientists here today.&lt;br /&gt;&lt;br /&gt;Here are links to nifty resources referenced in talks:&lt;br /&gt;&lt;br /&gt;Affective Norms for English Words (ANEW): &lt;a href="http://csea.phhp.ufl.edu/Media.html#bottommedia"&gt;http://csea.phhp.ufl.edu/Media.html#bottommedia&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Networks, Computation, and Social Dynamics Lab at UC Irvine: &lt;a href="http://www.ncasd.org/public_html/"&gt;http://www.ncasd.org/public_html/&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;OpenCongress RSS feeds: &lt;a href="http://www.opencongress.org/about/rss"&gt;http://www.opencongress.org/about/rss&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Public.Resource.Org: &lt;a href="https://public.resource.org/"&gt;https://public.resource.org/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-6937767672071492514?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/6937767672071492514/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=6937767672071492514' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6937767672071492514'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6937767672071492514'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/06/resources-from-polnets.html' title='Resources from PolNets'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1390536870974696182</id><published>2011-06-16T22:05:00.000-07:00</published><updated>2011-06-16T22:14:37.158-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='blogs'/><category scheme='http://www.blogger.com/atom/ns#' term='media'/><category scheme='http://www.blogger.com/atom/ns#' term='conferences'/><title type='text'>Netroots Nation</title><content type='html'>I'm at the &lt;a href="http://www.blogger.com/www.netrootsnation.org"&gt;Netroots Nation&lt;/a&gt; conference in Minneapolis, really enjoying it.  Some reactions from day one:&lt;br /&gt;&lt;br /&gt;* Lots of discussion of messaging and issue ownership.  This is an area that academic researchers haven't really embraced, especially us quants.  We should.&lt;br /&gt;&lt;br /&gt;* The community is really open and egalitarian.  I sat in the same panel with &lt;a href="http://en.wikipedia.org/wiki/Markos_Moulitsas" title="Markos Moulitsas"&gt;Markos Moulitsas&lt;/a&gt; for half an hour without realizing it.&lt;br /&gt;&lt;br /&gt;* I've been impressed by the sophistication of research design in analytics and message experiments.  It's got me thinking about what's the next step after AB testing...  There's room for innovation here.&lt;br /&gt;&lt;br /&gt;* The left-meets-right happy hour was kind of a flop.  Where were all the Republicans?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1390536870974696182?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1390536870974696182/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1390536870974696182' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1390536870974696182'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1390536870974696182'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/06/netroots-nation.html' title='Netroots Nation'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-9212046981281151698</id><published>2011-05-17T07:06:00.000-07:00</published><updated>2011-05-17T07:17:12.475-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='presentations'/><category scheme='http://www.blogger.com/atom/ns#' term='text classification'/><category scheme='http://www.blogger.com/atom/ns#' term='computational social science'/><category scheme='http://www.blogger.com/atom/ns#' term='sampling'/><title type='text'>Slides from JITP: The Future of Computational Social Science</title><content type='html'>I'm at &lt;a href="http://www.umass.edu/jitp/2011.htm"&gt;JITP's conference on the future of computational social science&lt;/a&gt; this week.  Really interesting gathering of social scientists (and a smattering of CS people) interested in computational social science.  I'll blog more thoughts on the conference once it's over (tonight).&lt;br /&gt;&lt;br /&gt;For now, I'll post the resource I already have put together: my conference slides.&lt;br /&gt;&lt;br /&gt;&lt;div style="width: 425px;" id="__ss_7995320"&gt; &lt;strong style="display: block; margin: 12px 0pt 4px;"&gt;&lt;a href="http://www.slideshare.net/AbeGong/an-automated-snowball-census-of-the-political-web-jitp-2011" title="An Automated Snowball Census of the Political Web - JITP 2011"&gt;An Automated Snowball Census of the Political Web - JITP 2011&lt;/a&gt;&lt;/strong&gt; &lt;iframe src="http://www.slideshare.net/slideshow/embed_code/7995320" marginwidth="0" marginheight="0" width="425" frameborder="0" height="355" scrolling="no"&gt;&lt;/iframe&gt; &lt;div style="padding: 5px 0pt 12px;"&gt; View more &lt;a href="http://www.slideshare.net/"&gt;presentations&lt;/a&gt; from &lt;a href="http://www.slideshare.net/AbeGong"&gt;Abe Gong&lt;/a&gt; &lt;/div&gt; &lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-9212046981281151698?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/9212046981281151698/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=9212046981281151698' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9212046981281151698'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9212046981281151698'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/05/slides-from-jitp-future-of.html' title='Slides from JITP: The Future of Computational Social Science'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2472222774841691473</id><published>2011-05-12T05:58:00.000-07:00</published><updated>2011-05-13T13:50:59.124-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='surveys'/><category scheme='http://www.blogger.com/atom/ns#' term='computational social science'/><category scheme='http://www.blogger.com/atom/ns#' term='word cloud'/><title type='text'>Computational social scientists: a draft directory and basic survey results</title><content type='html'>Last week, some of us* at Michigan's &lt;a href="http://www.cscs.umich.edu/"&gt;Center for Complex Systems&lt;/a&gt; circulated a survey of  computational social scientists -- trying to find out who  self-identifies as a compSocSci person and what they study, so that they can be  in touch with each other.&lt;br /&gt;&lt;br /&gt;We had just under 100 responses, from &lt;a href="http://www-personal.umich.edu/%7Eagong/docs/compSocSci_directory_noEmail_5112011.pdf"&gt;people&lt;/a&gt; at many different institutions, working in a wide variety of areas.  Here are some early results.&lt;br /&gt;&lt;br /&gt;First, the obligatory word cloud.  This isn't particularly scientific, but it illustrates the concepts that people find important in this space.  Not surprisingly, we had a strong showing from network people and agent-based modelers.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://1.bp.blogspot.com/-9B989GmLFBM/Tcvdcc2iCoI/AAAAAAAAAHw/rvquHYDafSw/s1600/compSocSci_keywords.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 244px;" src="http://1.bp.blogspot.com/-9B989GmLFBM/Tcvdcc2iCoI/AAAAAAAAAHw/rvquHYDafSw/s400/compSocSci_keywords.png" alt="" id="BLOGGER_PHOTO_ID_5605817642043705986" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;We also asked about broad areas where people had formal training and were currently working.  The two are pretty similar, so I'll just show the graph on training.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/-c77G387wvIA/TcvdcGbW4fI/AAAAAAAAAHo/9M1pRVTFZ3Q/s1600/training_areas.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 400px;" src="http://4.bp.blogspot.com/-c77G387wvIA/TcvdcGbW4fI/AAAAAAAAAHo/9M1pRVTFZ3Q/s400/training_areas.jpg" alt="" id="BLOGGER_PHOTO_ID_5605817636024148466" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;More results, and a revised version of the directory will be forthcoming in a couple weeks.  Please let us know if you have any questions.  We hope these will be useful resources for the community.&lt;br /&gt;&lt;br /&gt;Click &lt;a href="http://umichlsa.qualtrics.com/SE/?SID=SV_4PjqeJXTMryG63i"&gt;here&lt;/a&gt; to take the survey.  We'll keep it open for another couple weeks, so that responses can continue to trickle in.&lt;br /&gt;&lt;br /&gt;Click &lt;a href="http://www-personal.umich.edu/%7Eagong/docs/compSocSci_directory_noEmail_5112011.pdf"&gt;here&lt;/a&gt; for the directory in pdf format.  (To avoid spam, this doesn't include email addresses.  Email me if you want a copy that includes emails.)&lt;br /&gt;&lt;br /&gt;* &lt;a href="http://www.cscs.umich.edu/%7Espage/"&gt;Scott Page&lt;/a&gt;, &lt;a href="http://computationallegalstudies.com/"&gt;Dan Katz&lt;/a&gt;, and &lt;a href="http://www-personal.umich.edu/%7Eagong/"&gt;I&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2472222774841691473?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2472222774841691473/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2472222774841691473' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2472222774841691473'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2472222774841691473'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/05/computational-social-scientists-draft.html' title='Computational social scientists: a draft directory and basic survey results'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/-9B989GmLFBM/Tcvdcc2iCoI/AAAAAAAAAHw/rvquHYDafSw/s72-c/compSocSci_keywords.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7284674654701313617</id><published>2011-05-11T12:16:00.000-07:00</published><updated>2011-05-13T13:50:59.387-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='surveys'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><category scheme='http://www.blogger.com/atom/ns#' term='R'/><title type='text'>R code to remove the second line of a Qualtrics .csv</title><content type='html'>I love Qualtrics, but its data export does this obnoxious thing.  Instead of exporting a regular .csv file, it exports a csv with two header rows.  The first one contains short variable names (e.g. Q1, Q2.2, Q3_TEXT) and the second one contains labels ("How old are you?", "What is your email address?")&lt;br /&gt;&lt;br /&gt;I keep having to figure out how to tell R how to deal with this messiness.  It's not complicated, but I have to look up the read.csv documentation every time.&lt;br /&gt;&lt;br /&gt;No more.  Here's my code:&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;DF &amp;lt;- read.csv("my_file.csv", skip=2, header=F)&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;DF2 &amp;lt;- read.csv( "my_file.csv" )&lt;/span&gt;&lt;br /&gt;&lt;span style="font-family: courier new;"&gt;names(DF) &amp;lt;- names(DF2)&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7284674654701313617?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7284674654701313617/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7284674654701313617' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7284674654701313617'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7284674654701313617'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/05/r-code-to-remove-second-line-of.html' title='R code to remove the second line of a Qualtrics .csv'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2009473687896194667</id><published>2011-05-10T11:20:00.001-07:00</published><updated>2011-05-10T11:26:55.010-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='surveys'/><category scheme='http://www.blogger.com/atom/ns#' term='blogs'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><title type='text'>I'm number one on Amazon turk!</title><content type='html'>I just maxed out my credit card to get a whole bunch of work done on mTurk.  For the moment, I'm the number one requester on the site!&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Screen shot:&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/-3y_Pb9jTqEs/TcmCi-WjxxI/AAAAAAAAAHQ/jyaEGbml91c/s1600/number_one_on_mturk.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 250px;" src="http://2.bp.blogspot.com/-3y_Pb9jTqEs/TcmCi-WjxxI/AAAAAAAAAHQ/jyaEGbml91c/s400/number_one_on_mturk.png" alt="" id="BLOGGER_PHOTO_ID_5605154748604073746" border="0" /&gt;&lt;/a&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/-02N7W194Rs8/TcmCCyyQi5I/AAAAAAAAAHI/LFsnVr7tSkE/s1600/top_of_mturk.png"&gt;&lt;br /&gt;&lt;/a&gt;Early results look pretty good.  A few turkers cheat, but I've been pleased with the good-faith effort most people seem to put into their work.&lt;br /&gt;&lt;br /&gt;PS - I'm curious about what Randolph Stevenson (in the number 3 slot) is doing...&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2009473687896194667?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2009473687896194667/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2009473687896194667' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2009473687896194667'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2009473687896194667'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/05/im-number-one-on-amazon-turk.html' title='I&apos;m number one on Amazon turk!'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/-3y_Pb9jTqEs/TcmCi-WjxxI/AAAAAAAAAHQ/jyaEGbml91c/s72-c/number_one_on_mturk.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-5799793727065653235</id><published>2011-05-05T07:43:00.000-07:00</published><updated>2011-05-05T07:49:52.214-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='web spiders'/><category scheme='http://www.blogger.com/atom/ns#' term='computational social science'/><category scheme='http://www.blogger.com/atom/ns#' term='sampling'/><title type='text'>Working paper: An automated snowball census of the political web</title><content type='html'>Here's &lt;a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1832024"&gt;my paper&lt;/a&gt; for the &lt;a href="http://www.umass.edu/jitp/2011.htm"&gt;JITP Future of Computational Social Science conference&lt;/a&gt; in a couple weeks.  This paper describes my process for using &lt;a href="http://lowlywonk.blogspot.com/2011/04/announcing-snowcrawl.html"&gt;SnowCrawl&lt;/a&gt; and a &lt;a href="http://lowlywonk.blogspot.com/2010/11/starting-to-get-some-dissertation.html"&gt;highly trained text classifier&lt;/a&gt; to search out political web sites -- pretty much all of them -- on the web.&lt;br /&gt;&lt;br /&gt;Final census results are available &lt;a href="http://www-personal.umich.edu/~agong/resources.html"&gt;here&lt;/a&gt;.  I'm planning to run another iteration of this census before too long.  I welcome comments and suggestions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-5799793727065653235?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/5799793727065653235/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=5799793727065653235' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5799793727065653235'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5799793727065653235'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/05/working-paper-automated-snowball-census.html' title='Working paper: An automated snowball census of the political web'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-636829686154996763</id><published>2011-05-01T20:00:00.000-07:00</published><updated>2011-05-01T20:13:34.262-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='breaking news'/><category scheme='http://www.blogger.com/atom/ns#' term='politics'/><title type='text'>The NYTimes is reporting Bin Laden dead</title><content type='html'>Story &lt;a href="http://atwar.blogs.nytimes.com/2011/05/01/bin-laden-is-dead-u-s-official-says/"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;UPDATE 11:13: Whitehouse stream here:&lt;br /&gt;&lt;br /&gt;&lt;object height="300" width="480"&gt;&lt;param name="movie" value="http://www.whitehouse.gov/sites/default/modules/wh_multimedia/EOP_OVP_player.swf"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;param name="wmode" value="opaque"&gt;&lt;param name="bgcolor" value="#FFFFFF"&gt;&lt;param name="scale" value="showall"&gt;&lt;param name="quality" value="best"&gt;&lt;param name="align" value="l"&gt;&lt;param name="allowfullscreen" value="true"&gt;&lt;param name="play" value="false"&gt;&lt;param name="menu" value="false"&gt;&lt;param name="loop" value="false"&gt;&lt;param name="flashvars" value="player=http://www.whitehouse.gov/sites/default/modules/wh_multimedia/EOP_OVP_player.swf&amp;amp;src=rtmp://cp68969.live.edgefcs.net/live/WHLive1@4853&amp;amp;scaleMode=stretch&amp;amp;link=&amp;amp;path_to_image=http://www.whitehouse.gov/sites/default/themes/whitehouse/img/facebook_bubble.gif&amp;amp;width=480&amp;amp;height=300"&gt;&lt;embed src="http://www.whitehouse.gov/sites/default/modules/wh_multimedia/EOP_OVP_player.swf" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" flashvars="player=http://www.whitehouse.gov/sites/default/modules/wh_multimedia/EOP_OVP_player.swf&amp;amp;src=rtmp://cp68969.live.edgefcs.net/live/WHLive1@4853&amp;amp;scaleMode=stretch&amp;amp;link=&amp;amp;path_to_image=http://www.whitehouse.gov/sites/default/themes/whitehouse/img/facebook_bubble.gif&amp;amp;width=480&amp;amp;height=300" height="300" width="480"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;!-- LIVE CHAT --&gt;&lt;div style="background-color: rgb(40, 40, 40); width: 480px; font-size: 10px; font-family: Arial,Helvetica,Sans-Serif;"&gt;&lt;div style="border-top: 1px solid rgb(102, 102, 102); margin: 0pt 10px; height: 40px; display: block;"&gt;&lt;div style="background: url(&amp;quot;http://www.whitehouse.gov/sites/default/themes/whitehouse/img/facebook_bubble.gif&amp;quot;) no-repeat scroll 0% 0% transparent; padding-top: 13px; height: 30px; float: left;"&gt;&lt;a href="http://apps.facebook.com/whitehouselive/" style="text-decoration: none; color: rgb(171, 171, 171); margin-left: 28px;"&gt;JOIN THE LIVE CHAT&lt;/a&gt;&lt;/div&gt;&lt;div style="padding-top: 13px; height: 30px; float: right;"&gt;&lt;a href="http://www.whitehouse.gov" style="text-decoration: none; color: rgb(171, 171, 171);"&gt;VISIT WHITEHOUSE.GOV&lt;/a&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;!-- END LIVE CHAT --&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-636829686154996763?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/636829686154996763/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=636829686154996763' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/636829686154996763'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/636829686154996763'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/05/nytimes-is-reporting-bin-laden-dead.html' title='The NYTimes is reporting Bin Laden dead'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3550769711363575215</id><published>2011-04-29T13:39:00.000-07:00</published><updated>2011-04-29T13:41:32.273-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data'/><category scheme='http://www.blogger.com/atom/ns#' term='computational social science'/><category scheme='http://www.blogger.com/atom/ns#' term='politics'/><title type='text'>Computational politics: U.S. House legislation may move to XML?</title><content type='html'>http://thehill.com/blogs/hillicon-valley/technology/158339-boehner-cantor-want-house-to-use-open-data-formats&lt;br /&gt;&lt;br /&gt;This would be a huge boon to computational social scientists and groups like the &lt;a href="http://sunlightfoundation.com/"&gt;Sunlight Project&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3550769711363575215?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3550769711363575215/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3550769711363575215' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3550769711363575215'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3550769711363575215'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/computational-politics-us-house.html' title='Computational politics: U.S. House legislation may move to XML?'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-6492111053123483933</id><published>2011-04-29T04:42:00.000-07:00</published><updated>2011-05-01T20:02:10.229-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='economics'/><category scheme='http://www.blogger.com/atom/ns#' term='movies'/><category scheme='http://www.blogger.com/atom/ns#' term='politics'/><title type='text'>The economics and politics of the Death Star</title><content type='html'>&lt;a href="http://www.overthinkingit.com/2011/04/25/star-wars-death-star-economics/2/"&gt;The economics and politics of the Death Star&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;"What’s the economic calculus behind the Empire’s tactic of A) building a  Death Star, B) intimidating planets into submission with the threat of  destruction, and C) actually carrying through with said destruction if  the planet doesn’t comply?"&lt;br /&gt;&lt;br /&gt;Fun discussion with lots of analogies to history and current politics&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-6492111053123483933?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/6492111053123483933/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=6492111053123483933' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6492111053123483933'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6492111053123483933'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/economics-and-politics-of-death-star.html' title='The economics and politics of the Death Star'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3814671297674795185</id><published>2011-04-26T10:54:00.000-07:00</published><updated>2011-04-26T10:54:00.335-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='complex systems'/><category scheme='http://www.blogger.com/atom/ns#' term='decentralization'/><title type='text'>Is bitcoin a decentralizing, democratizing agent?</title><content type='html'>A response to &lt;a href="http://canadaduane.posterous.com/is-the-bitconomy-the-frontier-of-economic-inn#comment"&gt;Duane on the importance of bitcoin&lt;/a&gt; as a new, decentralized currency. (This has been a fun running debate among friends over the last couple weeks.)&lt;br /&gt;&lt;br /&gt;I read his recent post as claiming "decentralized is better."  My response: "sometimes, but not always."  Here's my reasoning.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Point 1: Money is a figment of our shared imagination.&lt;/span&gt;  It has value because we all collectively accept that it has value.  There's nothing particularly special about green paper, or gold rocks, or any given set of bits, except for the common knowledge that other people will also accept those currencies in exchange for goods and services. In that sense, the valuation of any currency is already "democratic" -- its real value exists in the minds of a distributed network of people. People were using rare stones as currency long before governments got involved.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Point 2: Enlightened self-interest leads us to centralize some of the responsiblity for maintaining currencies.&lt;/span&gt; One of the main threats to any currency is counterfeiting, so it makes sense to centralize responsibility for preventing counterfeiting to mints and enforcement agencies like the secret service.  Another threat is inflation (or deflation), which is why every modern state with a large economy uses a central bank to manage inflation via the money supply.  A third "threat" is transaction costs, which is why we use credit cards for so many things -- the artificial "currencies" maintained by these corporations are so convenient that they have displaced state-authorized currency in many transactions*.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Point 3: As a side-effect of centralization, governments gain some power to regulate other uses of money.&lt;/span&gt;  (Note that governments, like currencies, are also a figment of collective imagination.  The Constitution is law because we all agree that it's the law.)  Thus, the imposition of taxes, tariffs, and embargoes; and continuing efforts by organized criminals to circumvent the system by laundering money.  These things are not necessarily good or bad. They depend on whether we approve of the use of government power in those areas.  By and large, I imagine most people approve of governments shutting down mobsters and sex trafficking rings.  Using the same suite of tools for defense spending, planned parenthood, agricultural subsidies, "bailouts," etc. is more controversial.&lt;br /&gt;&lt;br /&gt;On the whole, I'd argue that people are better off because governments have these capabilities at their disposal -- especially people living in places where government is reasonably transparent and accountable to its citizens.  We're better off because we've decided to pay taxes for roadways, police stations, and schools.  We're better off because the value of a dollar is reasonably stable. We're better off because corporations (especially publically traded firms) are forced to keep a strict accounting of their transactions. We're better off because the FBI and CIA can use financial information to crack down on terrorists and organized criminals.&lt;br /&gt;&lt;br /&gt;Putting that all together, I see &lt;a href="http://www.bitcoin.org/"&gt;bitcoin&lt;/a&gt; as an attempt to float an unregulateable currency using P2P technology and cryptography. By construction, such a currency would make it very difficult for government to intervene in the ways discussed in points two and three. Although I don't agree with everything our government does**, I don't see compelling reasons to deny ourselves the ability to use those tools as a society. Together, we're made better off by careful use of centralized financial regulation and lawmaking. That being the case, bitcoin seems partly radical, partly old hat, and partly a step backwards.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;*Semantic question: are credit cards a centralized or decentralized currency? What about frequent flyer miles?  What about Subway sandwich discount punch cards?  At some level, the labels "centralized" and "decentralized" are too blunt to be really useful.  This seems related to Duane's comments about needing both more and less regulation.&lt;br /&gt;&lt;br /&gt;**Nobody does.  That's the nature of compromise.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3814671297674795185?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3814671297674795185/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3814671297674795185' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3814671297674795185'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3814671297674795185'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/is-bitcoin-decentralizing-democratizing.html' title='Is bitcoin a decentralizing, democratizing agent?'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-465774971817071672</id><published>2011-04-25T11:39:00.000-07:00</published><updated>2011-04-25T11:39:00.586-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='amazon'/><category scheme='http://www.blogger.com/atom/ns#' term='cloud computing'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><title type='text'>The best $5 I've spent all year</title><content type='html'>I finally started using Amazon's EC2 yesterday.  I've been meaning to learn it forever, but assumed it would be time-consuming to get registered, set up an instance, and so on.&lt;br /&gt;&lt;br /&gt;Not true.  Thursday morning at 10am, I registered for EC2.  By 10:30 I had an instance of Drew Conway's Py/R AMI up and running, with several additional libraries installed, and a few GB of data I wanted to crunch uploaded to the server.  Very fast turnaround.&lt;br /&gt;&lt;br /&gt;Eight hours and $4.77 later, I'd crunched a lot of numbers -- by far my most productive workflow all week.  Highly recommend it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-465774971817071672?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/465774971817071672/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=465774971817071672' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/465774971817071672'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/465774971817071672'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/best-5-ive-spent-all-year.html' title='The best $5 I&apos;ve spent all year'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-188619935466733247</id><published>2011-04-22T10:48:00.000-07:00</published><updated>2011-04-22T11:29:14.101-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='education'/><category scheme='http://www.blogger.com/atom/ns#' term='policy'/><category scheme='http://www.blogger.com/atom/ns#' term='information technology'/><title type='text'>Scalable education</title><content type='html'>I got a lot of great responses on my previous post about &lt;a href="http://lowlywonk.blogspot.com/2011/03/scalable-models-for-education.html"&gt;scalable education&lt;/a&gt;, and wanted to share them back out here.  I also got a lot of questions on what I meant by "scalable" education.  Let me speak to that first.&lt;br /&gt;&lt;br /&gt;Thesis: the number of students getting a good education today is about  20 times the number of good teachers.  Under our current classroom  model, "number of high-quality teachers" is the limiting variable.  By  scalable education, I mean models of teaching that could feasibly grow  to orders of magnitude greater than 20 -- systems that would allow one  good teacher (plus the right support system) to teach 2,000 or 2,000,000  kids.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Here's a loosely annotated list of links to proposed models of "scalable education," sorted from least to most:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://nyti.ms/eO7BW6"&gt;Jump&lt;/a&gt; - New curriculum deployment.  Not bad, but not revolutionary.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.pbs.org/teachers/digitallearners/watch/"&gt;PBS documentary on "digital learning"&lt;/a&gt; - This is a mixed bag.  Some models (e.g. the Smithsonian scavenger hunt) are neat, engaging students in new ways.  I was less impressed with several segments that are just trading classrooms for classrooms plus computers.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.teachforamerica.org/"&gt;Teach for America&lt;/a&gt; - TFA's  teachers do a lot of good, but they also tend move on quickly.  (This  may be changing.) On the other hand, TFA is very proactive about  maintaining their alumni network.  If this is going the be  revolutionary, it will be as a policymaking network, more than a  teaching force.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.apple.com/education/itunes-u/"&gt;itunes-U&lt;/a&gt; - Possibly revolutionary.  But content delivery &amp;lt; education.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.ted.com/"&gt;TED&lt;/a&gt; - "The first major educational brand to emerge in a century."  Very cool topics, but see previous.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.extension.harvard.edu/DistanceEd/"&gt;Harvard's Distance Education&lt;/a&gt;  - Lectures are free; course credit with the Harvard brand cost $. Many  universities are doing this; Harvard is probably the best known.  This is changing things, but I'm not sure how competition is going to play out in this space.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.khanacademy.org/"&gt;The Khan Academy&lt;/a&gt; - My number one vote for a potentially disruptive model of education.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-188619935466733247?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/188619935466733247/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=188619935466733247' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/188619935466733247'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/188619935466733247'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/scalable-education.html' title='Scalable education'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-516903356400569746</id><published>2011-04-21T13:46:00.000-07:00</published><updated>2011-04-21T13:46:00.407-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='presentations'/><category scheme='http://www.blogger.com/atom/ns#' term='agent based models'/><title type='text'>Angry birds and agent-based modeling</title><content type='html'>"How +1 modeling can strengthen the logic of simulation and keep your pigs intact."&lt;br /&gt;&lt;br /&gt;&lt;object id="prezi_439e4f67c1571e91f81e498aafd3a91bbbd878ab" name="prezi_439e4f67c1571e91f81e498aafd3a91bbbd878ab" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" width="550" height="400"&gt;&lt;param name="movie" value="http://prezi.com/bin/preziloader.swf"/&gt;&lt;param name="allowfullscreen" value="true"/&gt;&lt;param name="allowscriptaccess" value="always"/&gt;&lt;param name="bgcolor" value="#ffffff"/&gt;&lt;param name="flashvars" value="prezi_id=439e4f67c1571e91f81e498aafd3a91bbbd878ab&amp;amp;lock_to_path=0&amp;amp;color=ffffff&amp;amp;autoplay=no&amp;amp;autohide_ctrls=0"/&gt;&lt;embed id="preziEmbed_439e4f67c1571e91f81e498aafd3a91bbbd878ab" name="preziEmbed_439e4f67c1571e91f81e498aafd3a91bbbd878ab" src="http://prezi.com/bin/preziloader.swf" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" width="550" height="400" bgcolor="#ffffff" flashvars="prezi_id=439e4f67c1571e91f81e498aafd3a91bbbd878ab&amp;amp;lock_to_path=0&amp;amp;color=ffffff&amp;amp;autoplay=no&amp;amp;autohide_ctrls=0"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-516903356400569746?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/516903356400569746/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=516903356400569746' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/516903356400569746'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/516903356400569746'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/angry-birds-and-agent-based-modeling.html' title='Angry birds and agent-based modeling'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3954863472763459358</id><published>2011-04-20T12:09:00.000-07:00</published><updated>2011-04-20T12:09:00.331-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='video'/><title type='text'>What I've been watching lately</title><content type='html'>Ralph Lagner's fascinating TED talk on reverse-engineering the Stuxnet worm.  This is the best presentation on cyber-terrorism/security I've seen so far.&lt;br /&gt;&lt;object width="446" height="326"&gt;&lt;param name="movie" value="http://video.ted.com/assets/player/swf/EmbedPlayer.swf"&gt;&lt;/param&gt;&lt;param name="allowFullScreen" value="true" /&gt;&lt;param name="allowScriptAccess" value="always"/&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/param&gt;&lt;param name="bgColor" value="#ffffff"&gt;&lt;/param&gt; &lt;param name="flashvars" value="vu=http://video.ted.com/talks/dynamic/RalphLangner_2011-medium.flv&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/RalphLangner-2011.embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=1107&amp;lang=eng&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=ralph_langner_cracking_stuxnet_a_21st_century_cyberweap;year=2011;theme=a_taste_of_ted2011;theme=war_and_peace;theme=bold_predictions_stern_warnings;theme=numbers_at_play;theme=what_s_next_in_tech;event=What%27s+Next+in+Tech;tag=Iran;tag=Science;tag=Technology;tag=computers;tag=government;tag=nuclear+energy;tag=politics;tag=war;&amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;" /&gt;&lt;embed src="http://video.ted.com/assets/player/swf/EmbedPlayer.swf" pluginspace="http://www.macromedia.com/go/getflashplayer" type="application/x-shockwave-flash" wmode="transparent" bgColor="#ffffff" width="446" height="326" allowFullScreen="true" allowScriptAccess="always" flashvars="vu=http://video.ted.com/talks/dynamic/RalphLangner_2011-medium.flv&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/RalphLangner-2011.embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=1107&amp;lang=eng&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=ralph_langner_cracking_stuxnet_a_21st_century_cyberweap;year=2011;theme=a_taste_of_ted2011;theme=war_and_peace;theme=bold_predictions_stern_warnings;theme=numbers_at_play;theme=what_s_next_in_tech;event=What%27s+Next+in+Tech;tag=Iran;tag=Science;tag=Technology;tag=computers;tag=government;tag=nuclear+energy;tag=politics;tag=war;"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;The bubble sort algorithm, illustrated as a Hungarian line dance.  (via flowingdata) Fun, but a little long.  What sort of dance could illustrate a heap sort?&lt;br /&gt;&lt;object style="height: 390px; width: 640px"&gt;&lt;param name="movie" value="http://www.youtube.com/v/lyZQPjUT5B4?version=3"&gt;&lt;param name="allowFullScreen" value="true"&gt;&lt;param name="allowScriptAccess" value="always"&gt;&lt;embed src="http://www.youtube.com/v/lyZQPjUT5B4?version=3" type="application/x-shockwave-flash" allowfullscreen="true" allowScriptAccess="always" width="640" height="390"&gt;&lt;/object&gt;&lt;br /&gt;&lt;br /&gt;A quick (4min) TED talk on Google's driverless cars.  Nifty.&lt;br /&gt;&lt;object width="446" height="326"&gt;&lt;param name="movie" value="http://video.ted.com/assets/player/swf/EmbedPlayer.swf"&gt;&lt;/param&gt;&lt;param name="allowFullScreen" value="true" /&gt;&lt;param name="allowScriptAccess" value="always"/&gt;&lt;param name="wmode" value="transparent"&gt;&lt;/param&gt;&lt;param name="bgColor" value="#ffffff"&gt;&lt;/param&gt; &lt;param name="flashvars" value="vu=http://video.ted.com/talk/stream/2011/Blank/SebastianThrun_2011-320k.mp4&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/SebastianThrun_2011-embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=1109&amp;lang=eng&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=sebastian_thrun_google_s_driverless_car;year=2011;theme=design_like_you_give_a_damn;theme=what_s_next_in_tech;theme=tales_of_invention;theme=new_on_ted_com;theme=a_taste_of_ted2011;event=A+Taste+of+TED2011;&amp;preAdTag=tconf.ted/embed;tile=1;sz=512x288;" /&gt;&lt;embed src="http://video.ted.com/assets/player/swf/EmbedPlayer.swf" pluginspace="http://www.macromedia.com/go/getflashplayer" type="application/x-shockwave-flash" wmode="transparent" bgColor="#ffffff" width="446" height="326" allowFullScreen="true" allowScriptAccess="always" flashvars="vu=http://video.ted.com/talk/stream/2011/Blank/SebastianThrun_2011-320k.mp4&amp;su=http://images.ted.com/images/ted/tedindex/embed-posters/SebastianThrun_2011-embed_thumbnail.jpg&amp;vw=432&amp;vh=240&amp;ap=0&amp;ti=1109&amp;lang=eng&amp;introDuration=15330&amp;adDuration=4000&amp;postAdDuration=830&amp;adKeys=talk=sebastian_thrun_google_s_driverless_car;year=2011;theme=design_like_you_give_a_damn;theme=what_s_next_in_tech;theme=tales_of_invention;theme=new_on_ted_com;theme=a_taste_of_ted2011;event=A+Taste+of+TED2011;"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3954863472763459358?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3954863472763459358/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3954863472763459358' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3954863472763459358'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3954863472763459358'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/what-ive-been-watching-lately.html' title='What I&apos;ve been watching lately'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3515018586157909123</id><published>2011-04-19T06:35:00.000-07:00</published><updated>2011-04-19T06:35:00.526-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='web spiders'/><category scheme='http://www.blogger.com/atom/ns#' term='python'/><category scheme='http://www.blogger.com/atom/ns#' term='code'/><category scheme='http://www.blogger.com/atom/ns#' term='sampling'/><title type='text'>Code: a random web crawler</title><content type='html'>This code crawls the web to generate a pseudo-random sample of web sites.  Not my prettiest code, but it works and may save somebody an afternoon of coding.&lt;br /&gt;&lt;br /&gt;random_crawler.tar.gz (14MB, zipped)&lt;br /&gt;&lt;blockquote&gt;This script explores the web for a pseudo-random sample of sites.  The crawl proceeds in a series of 100 (by default) waves.  In each wave, 2,000 (by default) crawlers attempt to download pages.  When page downloads are successful, all the unique outbound hyperlinks are stored to a master list.  Then sites for the next round are sampled.&lt;br /&gt;&lt;br /&gt;Sites from common domain names are undersampled, to avoid getting stuck (e.g. within specific social networking sites).  When sites are selected for the next round, weights are equal x^(3/4), where x is the number of sites in the same domain.&lt;br /&gt;&lt;br /&gt;After several waves, the sample should be well-mixed -- a pseudo-random sample of the web.&lt;br /&gt;&lt;br /&gt;Note: This code is kind of sloppy, does not scale well, and is poorly commented.  Bad code!  Bad!&lt;br /&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3515018586157909123?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3515018586157909123/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3515018586157909123' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3515018586157909123'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3515018586157909123'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/code-random-web-crawler.html' title='Code: a random web crawler'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-64324980809670814</id><published>2011-04-18T12:03:00.000-07:00</published><updated>2011-04-18T12:03:00.308-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data'/><category scheme='http://www.blogger.com/atom/ns#' term='text mining'/><title type='text'>Another data resource</title><content type='html'>I'm on a real data kick this week.  Here's another one that should be useful for people training language classifiers.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.cscs.umich.edu/%7Eagong/resources/lang_samples.tar.gz"&gt;lang_samples.tar.gz (1.5G gzipped)&lt;/a&gt;&lt;br /&gt;&lt;blockquote&gt;This archive contains language samples from the 2008 static wikipedia dumps available at http://static.wikipedia.org/downloads/2008-06/.  I downloaded all 261 archives, and extracted samples of text from each.&lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-64324980809670814?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/64324980809670814/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=64324980809670814' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/64324980809670814'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/64324980809670814'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/another-data-resource.html' title='Another data resource'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8582865680361324370</id><published>2011-04-16T07:27:00.000-07:00</published><updated>2011-04-16T07:39:53.760-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='scientific method'/><category scheme='http://www.blogger.com/atom/ns#' term='data science'/><category scheme='http://www.blogger.com/atom/ns#' term='big data'/><category scheme='http://www.blogger.com/atom/ns#' term='social science'/><title type='text'>What does it take to be a data scientist?</title><content type='html'>&lt;div&gt;Conway on what it takes to be a data scientist (@ &lt;a href="http://www.drewconway.com/zia/"&gt;ZIA&lt;/a&gt;, ht: &lt;a href="http://computationallegalstudies.com/"&gt;Mike B&lt;/a&gt;).&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;a href="http://4.bp.blogspot.com/-YdsOqNSxOVk/TamnVzr9KKI/AAAAAAAAAHA/9mQ3ZVoGQzg/s1600/conway_data_science_Venn_diagram.png" onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}"&gt;&lt;img style="display:block; margin:0px auto 10px; text-align:center;cursor:pointer; cursor:hand;width: 320px; height: 306px;" src="http://4.bp.blogspot.com/-YdsOqNSxOVk/TamnVzr9KKI/AAAAAAAAAHA/9mQ3ZVoGQzg/s320/conway_data_science_Venn_diagram.png" border="0" alt="" id="BLOGGER_PHOTO_ID_5596188005078214818" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;div&gt;The full article is &lt;a href="http://www.drewconway.com/zia/wp-content/uploads/2011/04/IQT-Quarterly_Spring-2011_Conway.pdf"&gt;here&lt;/a&gt;. It's short and sweet, and offers a nice counterpoint to some of the claims made by people with a more &lt;a href="http://www.google.com/research/pubs/archive/35179.pdf"&gt;computer-science-centric view of the world&lt;/a&gt;.  Turns out that modeling assumptions (i.e. math and statistics) and theory (i.e. substantive expertise) matter. You ignore them at your own risk.&lt;/div&gt;&lt;div&gt;&lt;br /&gt;&lt;/div&gt;&lt;div&gt;PS: The title makes it sound like this is about U.S. intelligence, but almost all the points in the article apply to business and academia as well.&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8582865680361324370?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8582865680361324370/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8582865680361324370' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8582865680361324370'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8582865680361324370'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/what-does-it-take-to-be-data-scientist.html' title='What does it take to be a data scientist?'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/-YdsOqNSxOVk/TamnVzr9KKI/AAAAAAAAAHA/9mQ3ZVoGQzg/s72-c/conway_data_science_Venn_diagram.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3995276127101092862</id><published>2011-04-15T12:18:00.000-07:00</published><updated>2011-04-18T11:17:11.124-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data'/><category scheme='http://www.blogger.com/atom/ns#' term='web spiders'/><category scheme='http://www.blogger.com/atom/ns#' term='politics'/><title type='text'>More text archives</title><content type='html'>&lt;a href="http://www.cscs.umich.edu/%7Eagong/resources/text_archives-4-2011.tar.gz"&gt;Three more data sets&lt;/a&gt; (108MB, gzipped) for training classifiers.  All these files are text-only, crawled in the week of 12/17/2010.  It just took me a while to get around to releasing them.&lt;br /&gt;&lt;blockquote&gt;political (115M):&lt;br /&gt;Front pages from ~2,500 political sites.  These sites were rated extremely likely to be political (p&amp;gt;.99) in an early version of my census of the political web.  I revisited the same sites several months later and downloaded the new front pages to create this dataset.  They should be appropriate for case-control training of a political classifier.&lt;br /&gt;&lt;br /&gt;random (471M):&lt;br /&gt;Front pages from ~8,000 "random" sites.  These are a pseudo-representative sample of the web.  I ran several web spiders in parallel, recording all outbound links from visited sites.  I deliberately undersampled common namespaces, in order to avoid getting trapped in densely-linked social networking sites.  The 8,000 sites are a random sample from a list of ~2 million sites generated in a crawl of this type.&lt;br /&gt;&lt;br /&gt;porn (43M):&lt;br /&gt;Front pages from ~1,200 pornography sites.  I never thought I'd be curating pornography as part of my dissertation, but so much of the web is porn that I've had to build special classifiers to screen it out before sending to my undergrad research team for coding.  These 1,200 sites were all linked from http://research.vision-options.com/research.php.&lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3995276127101092862?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3995276127101092862/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3995276127101092862' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3995276127101092862'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3995276127101092862'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/more-text-archives.html' title='More text archives'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-4855536089192464472</id><published>2011-04-14T12:07:00.000-07:00</published><updated>2011-04-14T12:09:15.149-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><title type='text'>It's data dump week!</title><content type='html'>In case this wasn't clear already, it's data dump week.  I'm gearing up for another crawl of the political web, and posting lots of bits of code and data along the way.  If nothing else, I'll be able to find them here in the future.  If these (or similar) resources are useful to you, please let me know.&lt;br /&gt;&lt;br /&gt;Cheers!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-4855536089192464472?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/4855536089192464472/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=4855536089192464472' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4855536089192464472'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4855536089192464472'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/its-data-dump-week.html' title='It&apos;s data dump week!'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-74481216197665361</id><published>2011-04-14T08:28:00.000-07:00</published><updated>2011-04-21T08:33:55.241-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data'/><category scheme='http://www.blogger.com/atom/ns#' term='web spiders'/><category scheme='http://www.blogger.com/atom/ns#' term='politics'/><title type='text'>Data release: 75K political web sites</title><content type='html'>Data release:&lt;br /&gt;&lt;a href="http://www.cscs.umich.edu/%7Eagong/resources/pol_site_sample.tar.gz"&gt;pol_site_sample.tar.gz&lt;/a&gt; (1.2GB, gzipped)&lt;br /&gt;&lt;blockquote&gt;This archive contains results from a front-page-only crawl of ~75,000 political web sites on 4/9/2011.&lt;br /&gt;&lt;br /&gt;These sites span a large portion of the political web, but they are not a representative sample.  Each was classified as very likely to be political (p &amp;gt; 0.9) in a crawl performed in August 2010. In other words, these are pages from sites that featured political content 8 months ago.  Presumably, most -- but certainly not all -- of the sites still feature political content today.&lt;br /&gt;&lt;/blockquote&gt;I'm going to use this to train content classifiers.  Might be useful to others as well.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-74481216197665361?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/74481216197665361/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=74481216197665361' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/74481216197665361'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/74481216197665361'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/data-release-75k-political-web-sites.html' title='Data release: 75K political web sites'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-799998547259269777</id><published>2011-04-13T07:42:00.000-07:00</published><updated>2011-04-13T07:42:01.026-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><title type='text'>Links on crowdsourcing</title><content type='html'>Point: &lt;a href="http://www.cond.org/eadar-crowdsourcing-workshop.pdf"&gt;Why I hate mturk (research)&lt;/a&gt;&lt;br /&gt;Counterpoint: &lt;a href="http://www.slideshare.net/ipeirotis/managing-crowdsourced-human-computation"&gt;Managing crowdsourced human computation&lt;/a&gt;&lt;span style="display: block;" id="formatbar_Buttons"&gt;&lt;span class=" on down" style="display: block;" id="formatbar_CreateLink" title="Link"&gt;&lt;img src="http://www.blogger.com/img/blank.gif" alt="Link" class="gl_link" border="0" /&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-799998547259269777?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/799998547259269777/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=799998547259269777' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/799998547259269777'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/799998547259269777'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/links-on-crowdsourcing.html' title='Links on crowdsourcing'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3153618023723169513</id><published>2011-04-12T08:53:00.000-07:00</published><updated>2011-04-12T08:53:00.716-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='software'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><title type='text'>Lightweight pdf renderers</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.astro.columbia.edu/%7Esavin/images/pdficon.png"&gt;&lt;img style="float: right; margin: 0pt 0pt 10px 10px; cursor: pointer; width: 192px; height: 192px;" src="http://www.astro.columbia.edu/%7Esavin/images/pdficon.png" alt="" border="0" /&gt;&lt;/a&gt;I'm finishing up dissertation data collection in the next ~6 weeks, which means I'm going to be spending a lot less time writing code, and a lot more time analyzing data and writing papers.  So R and laTex are going to be my new best friends.&lt;br /&gt;&lt;br /&gt;Taking a good look at my workflow around these packages, I realized that viewing pdfs was really slowing me down.  Every time I generate a graph or paper, I have to open up the pdf version and see what it looks like.  Adobe's very bulky software takes several seconds to load -- very frustrating when you're playing with margins or table formating and want to iterate quickly.&lt;br /&gt;&lt;br /&gt;So I went out looking for a lightweight pdf viewer.  Here's what I found:&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.downloadmunkey.net/2008/04/random-monday-foxit-reader-vs-pdf-xchange-viewer-vs-sumatra/"&gt;http://www.downloadmunkey.net/2008/04/random-monday-foxit-reader-vs-pdf-xchange-viewer-vs-sumatra/&lt;br /&gt;&lt;/a&gt;&lt;a href="http://portableapps.com/node/17260"&gt;http://portableapps.com/node/17260&lt;br /&gt;&lt;/a&gt;&lt;a href="http://www.techsupportalert.com/best-free-non-adobe-pdf-reader.htm"&gt;http://www.techsupportalert.com/best-free-non-adobe-pdf-reader.htm&lt;br /&gt;&lt;/a&gt;&lt;br /&gt;Any other advice?&lt;br /&gt;&lt;br /&gt;Based on those reviews, I'm going to give &lt;a href="http://www.tracker-software.com/"&gt;PDF-XChange&lt;/a&gt; a shot.  I'll let you know how it goes.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3153618023723169513?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3153618023723169513/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3153618023723169513' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3153618023723169513'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3153618023723169513'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/lightweight-pdf-renderers.html' title='Lightweight pdf renderers'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1751824940201260750</id><published>2011-04-11T07:40:00.000-07:00</published><updated>2011-04-11T07:42:03.736-07:00</updated><title type='text'>Redesigned my layout</title><content type='html'>I've been meaning to do this for a while.  tpmotd's nudge finally got me to make the neccesary 15 minutes.&lt;br /&gt;&lt;br /&gt;Now all my pictures should actually fit!&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1751824940201260750?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1751824940201260750/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1751824940201260750' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1751824940201260750'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1751824940201260750'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/redesigned-my-layout.html' title='Redesigned my layout'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8494588019341938766</id><published>2011-04-09T10:48:00.000-07:00</published><updated>2011-04-09T10:48:00.198-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='AI'/><category scheme='http://www.blogger.com/atom/ns#' term='presentations'/><category scheme='http://www.blogger.com/atom/ns#' term='game theory'/><title type='text'>Prezi: AI and games</title><content type='html'>A few months ago I &lt;a href="http://lowlywonk.blogspot.com/2010/12/prezi-quick-review.html"&gt;first tried out&lt;/a&gt; prezi.  Since then, I've seen it trickling into presentations here and there.  The novelty is great, and the usability is a little better than last time I played with it.  So I went ahead and&lt;br /&gt;&lt;br /&gt;These are slides for my (mock) TED talk from the Hill Street TED activity yesterday.&lt;br /&gt;&lt;br /&gt;&lt;object id="prezi_2625faafb2552485640a79aee4b02bae76ae08e9" name="prezi_2625faafb2552485640a79aee4b02bae76ae08e9" classid="clsid:D27CDB6E-AE6D-11cf-96B8-444553540000" width="550" height="400"&gt;&lt;param name="movie" value="http://prezi.com/bin/preziloader.swf"/&gt;&lt;param name="allowfullscreen" value="true"/&gt;&lt;param name="allowscriptaccess" value="always"/&gt;&lt;param name="bgcolor" value="#ffffff"/&gt;&lt;param name="flashvars" value="prezi_id=2625faafb2552485640a79aee4b02bae76ae08e9&amp;amp;lock_to_path=0&amp;amp;color=ffffff&amp;amp;autoplay=no&amp;amp;autohide_ctrls=0"/&gt;&lt;embed id="preziEmbed_2625faafb2552485640a79aee4b02bae76ae08e9" name="preziEmbed_2625faafb2552485640a79aee4b02bae76ae08e9" src="http://prezi.com/bin/preziloader.swf" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" width="550" height="400" bgcolor="#ffffff" flashvars="prezi_id=2625faafb2552485640a79aee4b02bae76ae08e9&amp;amp;lock_to_path=0&amp;amp;color=ffffff&amp;amp;autoplay=no&amp;amp;autohide_ctrls=0"&gt;&lt;/embed&gt;&lt;/object&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8494588019341938766?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8494588019341938766/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8494588019341938766' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8494588019341938766'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8494588019341938766'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/prezi-ai-and-games.html' title='Prezi: AI and games'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-797229404780656871</id><published>2011-04-07T07:14:00.000-07:00</published><updated>2011-04-07T07:50:10.313-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><category scheme='http://www.blogger.com/atom/ns#' term='mechanism design'/><title type='text'>Crowdsourcing and buzzword lumping</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.synergizedsolutions.com/simpsons/pics/homer/homer_dreamcar.gif"&gt;&lt;img style="float: right; margin: 0pt 0pt 10px 10px; cursor: pointer; width: 315px; height: 180px;" src="http://www.synergizedsolutions.com/simpsons/pics/homer/homer_dreamcar.gif" alt="" border="0" /&gt;&lt;/a&gt;In general, I'm a big supporter of crowdsourcing, but I worry about &lt;span dir="ltr" id=":24g"&gt;lumping together too many things under one popular buzzword.&lt;/span&gt;  A few NYTimes articles have spoken to this issue recently (ht Gloria).  &lt;a href="http://opinionator.blogs.nytimes.com/2011/03/28/crowdsourcing-a-better-world/"&gt;This one&lt;/a&gt; is pretty starry-eyed. &lt;a href="http://www.nytimes.com/2009/07/19/technology/internet/19unboxed.html"&gt;This one&lt;/a&gt; unpacks things (a little) more.&lt;br /&gt;&lt;span dir="ltr" id=":24g"&gt;&lt;br /&gt;Let me push on this idea of lumping.  Wikipedia defines crowdsourcing as &lt;/span&gt;&lt;blockquote&gt;&lt;span dir="ltr" id=":24g"&gt;&lt;/span&gt;the act of outsourcing tasks, traditionally performed by an employee or contractor, to an undefined, large group of people or community (a "crowd"), through an open call.&lt;/blockquote&gt;As I read that definition, all of these are crowdsourcing:&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://upload.wikimedia.org/wikipedia/en/thumb/3/3e/FettbobaJB.png/220px-FettbobaJB.png"&gt;&lt;img style="float: right; margin: 0pt 0pt 10px 10px; cursor: pointer; width: 220px; height: 276px;" src="http://upload.wikimedia.org/wikipedia/en/thumb/3/3e/FettbobaJB.png/220px-FettbobaJB.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Wikipedia&lt;/li&gt;&lt;li&gt;&lt;a href="http://openideo.com/"&gt;Ideo&lt;/a&gt;'s open design lab&lt;/li&gt;&lt;li&gt;&lt;a href="http://www.innocentive.com/"&gt;Innocentive&lt;/a&gt;'s innovation contests&lt;/li&gt;&lt;li&gt;&lt;a href="http://99designs.com/"&gt;99design&lt;/a&gt;'s graphic design sweatshop&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Elections&lt;/li&gt;&lt;li&gt;Spam farms&lt;/li&gt;&lt;li&gt;Penny stock pump-and-dump marketing&lt;/li&gt;&lt;li&gt;Bounty hunters and &lt;a href="http://en.wikipedia.org/wiki/Privateer"&gt;privateers&lt;/a&gt;&lt;/li&gt;&lt;/ul&gt;&lt;span dir="ltr" id=":24g"&gt;Trick question: so is crowdsourcing a good thing or a bad thing?&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;My position: We're at a place where technology is enabling new institutions.  It would be backward to ignore that potential.&lt;span dir="ltr" id=":g"&gt;  But  there are all kinds of issues with corruption, lack of expertise, bias  in who participates, etc. that "crowdsourcing" doesn't solve automatically.&lt;/span&gt;  Just like other institutions, crowdsourced institutions have to be designed carefully to head off those problems.  &lt;span dir="ltr" id=":24g"&gt;I don't think it's a magic bullet, but I do think it can help.&lt;br /&gt; &lt;br /&gt;&lt;/span&gt;&lt;span dir="ltr" id=":24g"&gt;&lt;br /&gt;  &lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-797229404780656871?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/797229404780656871/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=797229404780656871' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/797229404780656871'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/797229404780656871'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/crowdsourcing-and-buzzword-lumping.html' title='Crowdsourcing and buzzword lumping'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-4194701199429792223</id><published>2011-04-05T09:21:00.000-07:00</published><updated>2011-04-05T09:24:36.214-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Web utilities'/><category scheme='http://www.blogger.com/atom/ns#' term='open source'/><category scheme='http://www.blogger.com/atom/ns#' term='software'/><category scheme='http://www.blogger.com/atom/ns#' term='snowcrawl'/><title type='text'>Announcing SnowCrawl!</title><content type='html'>A&lt;a href="http://code.google.com/p/snowcrawl/logo?cct=0"&gt;&lt;img style="float: left; margin: 0pt 0pt 10px 10px; cursor: pointer; width: 88px; height: 55px;" src="http://code.google.com/p/snowcrawl/logo?cct=0" alt="" border="0" /&gt;&lt;/a&gt;nnouncing the beta release of &lt;a href="http://code.google.com/p/snowcrawl/"&gt;SnowCrawl&lt;/a&gt;, a python library for directed webcrawls. Nice features include: saved  state for backup, support for threading and client-server architecture,  lots of flexibility.&lt;br /&gt;&lt;br /&gt;The project is open source, hosted at google code.  More details to follow.&lt;br /&gt;&lt;span style="display: block;" id="formatbar_Buttons"&gt;&lt;span class=" on down" style="display: block;" id="formatbar_CreateLink" title="Link"&gt;&lt;img src="http://www.blogger.com/img/blank.gif" alt="Link" class="gl_link" border="0" /&gt;&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-4194701199429792223?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/4194701199429792223/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=4194701199429792223' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4194701199429792223'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4194701199429792223'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/04/announcing-snowcrawl.html' title='Announcing SnowCrawl!'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7036919151620510489</id><published>2011-03-14T09:08:00.000-07:00</published><updated>2011-03-14T09:08:01.269-07:00</updated><title type='text'>Wanted: A mystery science theater for stats and programming books.</title><content type='html'>Wanted: &lt;a href="http://en.wikipedia.org/wiki/Mystery_Science_Theater_3000"&gt;A mystery science theater&lt;/a&gt; for stats and programming books.&lt;br /&gt;&lt;br /&gt;Looking for good introductory R books for non-programmers, (Know any?  Books, I mean, not non-programmers.)  I ran across this &lt;a href="http://blog.rtwilson.com/review-statistical-analysis-with-r-beginners-guide-by-john-m-quick/"&gt;wonderful review&lt;/a&gt; of an apparently ludicrous book.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-weight: bold;"&gt;Statistical Analysis with R: Beginner’s Guide by John M. Quick&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;strong&gt;Summary:&lt;/strong&gt; If you can get past the strange underlying  story, then this gives a good introduction to R to someone with no  programming experience.&lt;br /&gt;&lt;br /&gt;[...]&lt;br /&gt;&lt;br /&gt;[B]efore describing the data analysis section of the book, I  should explain the underlying story used throughout the book. The  introductory chapter gives a bit of &lt;span style="font-weight: bold;"&gt;ancient Chinese history&lt;/span&gt;, and states  that &lt;span style="font-weight: bold;"&gt;you, the reader, have been chosen to succeed the famous military  leader Zhuge Liang &lt;/span&gt;and need to &lt;span style="font-weight: bold;"&gt;learn how to use R to analyse his data  and plan the future of the military campaign&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;The rest of the book takes  on this theme, both in the data analysis (comparing the Shu and Wei  armies, and predicting battle outcomes using regression) and the general  phrasing (headings like “&lt;span style="font-weight: bold;"&gt;Have a go hero!&lt;/span&gt;” and emphasis that &lt;span style="font-weight: bold;"&gt;if you fail  the Chinese kingdom will collapse&lt;/span&gt;).&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Wow.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://pressthebuttons.typepad.com/photos/uncategorized/2008/03/21/mst3kshadow.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 300px; height: 92px;" src="http://pressthebuttons.typepad.com/photos/uncategorized/2008/03/21/mst3kshadow.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7036919151620510489?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7036919151620510489/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7036919151620510489' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7036919151620510489'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7036919151620510489'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/03/wanted-mystery-science-theater-for.html' title='Wanted: A mystery science theater for stats and programming books.'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-9105470723881059082</id><published>2011-03-11T09:17:00.000-08:00</published><updated>2011-03-11T09:17:00.552-08:00</updated><title type='text'>Scalable models for education</title><content type='html'>via my good friend Steve Stay:&lt;br /&gt;&lt;br /&gt;The White House has a new initiative out: &lt;a href="http://www.ed.gov/sites/default/files/arpa-ed-background.pdf"&gt;Winning the Education Future - The Role of ARPA-ED (16 pages)&lt;/a&gt;&lt;br /&gt;&lt;blockquote&gt;To address the under-investment in learning technology R and D, the President’s FY2012 budget proposes to invest $90 million to create an Advanced Research Projects Agency for Education (ARPA-ED). ARPA-ED will fund projects performed by industry, universities, or other innovative organizations, selected based on their potential to create a dramatic breakthrough in learning and teaching.&lt;/blockquote&gt;&lt;blockquote&gt;ARPA-ED is trying to catalyze development of:&lt;br /&gt;*  Digital tutors as effective as personal tutors.&lt;br /&gt;* Courses that improve the more students use them.&lt;br /&gt;* Educational software as compelling as the best videogame.&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;The intuition here is really interesting -- building technologies for education that scale.  Our ed system is based on a 150-year-old chalk-and-talk delivery system.  One teacher, ~20 students, one lecture at a time, interspersed with homework, projects, and some small group discussion.  I'm stylizing here, but you know I'm not too far off.&lt;br /&gt;&lt;br /&gt;The major &lt;a href="http://www.jstor.org/stable/2138807"&gt;bottleneck in this system&lt;/a&gt; is &lt;a href="http://www.politicalscience.uncc.edu/godwink/PPOL8687/WK11March%2029%20Teachers/Darling-Hammond%20Review%20essay%20on%20teacher%20quality%20and%20outcomes.pdf"&gt;effective teachers&lt;/a&gt;.  Roughly speaking, the number of students getting a good classroom education is equal to the number of good teachers times twenty.  Given the &lt;a href="http://bul.sagepub.com/content/86/631/16.short"&gt;tremendous difficulty of recruiting, training, and retaining teachers&lt;/a&gt;, this is a serious constraint.&lt;br /&gt;&lt;br /&gt;Therefore, I am a huge fan of the idea of deploying better education via scalable technology.  But I don't think we've seen this done right yet.  Some also-rans:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.apple.com/education/itunes-u/"&gt;i-tunes U&lt;/a&gt; and similar platforms strike me as a partial first step -- maybe a good replacement for lectures.  Suppose I'm a middling public speaker with reasonably good subject knowledge in an area.  Why should I lecture when my students can hear from a top-of-the field virtuoso?   But education is a lot more than content delivery.  itunes can't give you feedback, answer questions, or hold office hours.&lt;/li&gt;&lt;br /&gt;&lt;li&gt;In the long run, private online universities may be able step up to provide scalable education.  But so far I've been pretty unimpressed.  For instance, University of Phoenix seems to be running a &lt;a href="http://www.newamerica.net/blog/higher-ed-watch/2009/memo-secretary-duncan-investigate-university-phoenix-9601"&gt;pump-and-dump model based on government subsidies through student loans&lt;/a&gt;.  (Maybe I'm out of line on this claim, but that's my impression.)&lt;/li&gt;&lt;/ul&gt;I think we're still waiting for the real thing in scalable education.  What possibilities am I missing?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-9105470723881059082?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/9105470723881059082/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=9105470723881059082' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9105470723881059082'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9105470723881059082'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/03/scalable-models-for-education.html' title='Scalable models for education'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2372684994123588589</id><published>2011-03-09T16:12:00.000-08:00</published><updated>2011-03-09T16:13:14.416-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='crowdsourcing'/><title type='text'>Three cheers for crowd source/ open source</title><content type='html'>Google is &lt;a href="http://datavizchallenge.org/"&gt;offering $10k in prizes&lt;/a&gt; for visualizations of the U.S. Federal budget.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://blog.revolutionanalytics.com/2011/02/r-overtakes-sas-and-matlab-in-programming-language-popularity.html"&gt;R has overtaken Matlab and SAS in popularity&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.zdnet.com/blog/open-source/hp-the-linux-desktop-company/8423?tag=nl.e550"&gt;HP is now including a Linux-based OS with every PC it ships.&lt;br /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2372684994123588589?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2372684994123588589/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2372684994123588589' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2372684994123588589'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2372684994123588589'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/03/three-cheers-for-crowd-source-open.html' title='Three cheers for crowd source/ open source'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2563276165419058610</id><published>2011-03-09T09:24:00.000-08:00</published><updated>2011-03-09T09:24:00.269-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='information flow'/><category scheme='http://www.blogger.com/atom/ns#' term='data visualization'/><title type='text'>Information visualization and the Battle of the Atlantic</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://t2.gstatic.com/images?q=tbn:ANd9GcQZOUuIxGi77yoxMWjV6uxMZZCl0ZL3jceEGaNwHiZia7T9bEk&amp;amp;t=1&amp;amp;usg=__q4LG1oJW0Fofefl1ii7BIeaymZg="&gt;&lt;img style="display: block; float: right; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 228px; height: 221px;" src="http://t2.gstatic.com/images?q=tbn:ANd9GcQZOUuIxGi77yoxMWjV6uxMZZCl0ZL3jceEGaNwHiZia7T9bEk&amp;amp;t=1&amp;amp;usg=__q4LG1oJW0Fofefl1ii7BIeaymZg=" alt="" border="0" /&gt;&lt;/a&gt;I've been reading Churchill's 6-volume history of WWII.  Fascinating reading if you're into this kind of stuff.  Some scattered thoughts on history, war, and information:&lt;br /&gt;&lt;br /&gt;WWII was arguably the first war fought through information as much as weaponry.  One of Neal Stephenson's characters in &lt;a href="http://en.wikipedia.org/wiki/Cryptonomicon"&gt;Cryptonomicon&lt;/a&gt; has &lt;a href="http://markpasc.org/blog/gems/athena.html"&gt;a great monologue&lt;/a&gt; on this point.  He claims that Nazi Germany typifies the values of Ares (you know, the Greek god of war), and the U.S./U.K. typify the values of Athena.  In this telling, WWII Germany had an advantage in guns and regimentation, but the proto-hacker cryptographers of &lt;a href="http://en.wikipedia.org/wiki/Bletchley_Park"&gt;Bletchley Park&lt;/a&gt;, etc. ran rings around them with information.  I recommend the monologue, but not the whole book.&lt;br /&gt;&lt;br /&gt;This comes out in Churchill's narrative.  Exhibit A is a set of &lt;a href="http://flowingdata.com/"&gt;FlowingData&lt;/a&gt;-esque maps of merchant ships sunk by U-boats throughout the war.&lt;br /&gt;&lt;br /&gt;A little background: in the middle part of the war (once France had been  defeated, but before  Russia and the U.S. had entered) the "Battle of  the Atlantic" was  probably the single most important "front" in the  war.  As long as England  was connected to her colonies by convoys of  merchant ships, she could  continue to fight.  If bombing and U-boat  action could constrict this  flow of trade sufficiently, the little  island would have no chance.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-style: italic;"&gt;Exhibit A: (scanned on the cheap with my pocket digital camera)&lt;/span&gt;&lt;br /&gt;&lt;br /&gt;&lt;a href="http://2.bp.blogspot.com/_zUbnTFdqWyg/TJy4cI3T_1I/AAAAAAAAAFw/dNCv9p4Qw6M/s1600/IMG_1443.jpg"&gt;&lt;img style="display: block; float: left; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 150px; height: 200px;" src="http://2.bp.blogspot.com/_zUbnTFdqWyg/TJy4cI3T_1I/AAAAAAAAAFw/dNCv9p4Qw6M/s200/IMG_1443.jpg" alt="" id="BLOGGER_PHOTO_ID_5520490036804714322" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a href="http://2.bp.blogspot.com/_zUbnTFdqWyg/TJy4uquh1fI/AAAAAAAAAF4/JGC8e67hyTs/s1600/IMG_1444.jpg"&gt;&lt;img style="display: block; float: left; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 150px; height: 200px;" src="http://2.bp.blogspot.com/_zUbnTFdqWyg/TJy4uquh1fI/AAAAAAAAAF4/JGC8e67hyTs/s200/IMG_1444.jpg" alt="" id="BLOGGER_PHOTO_ID_5520490355132323314" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;a href="http://3.bp.blogspot.com/_zUbnTFdqWyg/TJy5A1B9PLI/AAAAAAAAAGA/g7PELtE9zKc/s1600/IMG_1445.jpg"&gt;&lt;img style="display: block; float: left; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 150px; height: 200px;" src="http://3.bp.blogspot.com/_zUbnTFdqWyg/TJy5A1B9PLI/AAAAAAAAAGA/g7PELtE9zKc/s200/IMG_1445.jpg" alt="" id="BLOGGER_PHOTO_ID_5520490667135810738" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Charts like these make it clear that Churchill was interacting on  wartime data on a  day-to-day basis, and that that flow of information  was crucial to war  effort. Churchill likes to attribute success to the  bulldog-like grit  and willpower of the British people, but it's clear  from his narrative that the flow of information was at least as  important.  In war, grit doesn't matter much without gunpowder.&lt;br /&gt;&lt;br /&gt;In addition to maps, Churchill gives statistics and monthly trends for  various gains and losses in shipping.  They remind me of post-game trend  plots in &lt;a href="http://us.battle.net/sc2/en/buynow?ref=/sc2/"&gt;Starcraft II&lt;/a&gt;.  The general tension between military and  economy is the same.  They also remind me of the &lt;a href="http://www.google.com/images?q=business+dashboard&amp;amp;oe=utf-8&amp;amp;rls=org.mozilla:en-US:official&amp;amp;client=firefox-a&amp;amp;um=1&amp;amp;ie=UTF-8&amp;amp;source=univ&amp;amp;sa=X&amp;amp;ei=9u52TaS1EsTorAHZxoCUCg&amp;amp;ved=0CFEQsAQ&amp;amp;biw=922&amp;amp;bih=522"&gt;dashboards&lt;/a&gt; that are all the rage in business process management these days.  50 years ago, you had to be a superpower at war to devote these kinds of resources to information gathering.  Now, any reasonable-sized business has them.  Heck, even this blog is hooked up to &lt;a href="http://www.sitemeter.com/?a=stats&amp;amp;s=s49abegong"&gt;sitemeter&lt;/a&gt;.  Map of the world, populated with little dots?  Check.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2563276165419058610?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2563276165419058610/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2563276165419058610' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2563276165419058610'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2563276165419058610'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/03/information-visualization-and-battle-of.html' title='Information visualization and the Battle of the Atlantic'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_zUbnTFdqWyg/TJy4cI3T_1I/AAAAAAAAAFw/dNCv9p4Qw6M/s72-c/IMG_1443.jpg' height='72' width='72'/><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-6073566199634082691</id><published>2011-03-08T17:10:00.001-08:00</published><updated>2011-03-08T17:26:58.469-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='AI'/><category scheme='http://www.blogger.com/atom/ns#' term='randomness'/><title type='text'>AI for rock, paper, scissors</title><content type='html'>A really simple and clever demonstration of another place AI is surprisingly good at beating people: a &lt;a href="http://www.nytimes.com/interactive/science/rock-paper-scissors.html"&gt;rock-paper-scissors game on NYTimes&lt;/a&gt;.  Don't laugh.  The computer analyzes your past play and looks for weaknesses to exploit.  I played it 100 times and lost 23 to 35, with 42 ties.  (I said don't laugh!)&lt;br /&gt;&lt;br /&gt;Nifty application.  Nice design.  The "see what the computer is thinking" is a great  way of explaining how the AI works.  Much better than saying it's a  4th-order Markov process with backing off.&lt;br /&gt;&lt;br /&gt;The small print on the side of the screen is revealing (emphasis mine):&lt;br /&gt;&lt;blockquote&gt;&lt;span style="font-weight: bold;"&gt;&lt;blockquote&gt;&lt;/blockquote&gt;&lt;/span&gt;&lt;span style="font-weight: bold;"&gt;Note: &lt;/span&gt;A truly random game of  rock-paper-scissors would result in a statistical tie with each player  winning, tying and losing one-third of the time.  However, &lt;span style="font-weight: bold;"&gt;people are  not truly random and thus can be studied and analyzed.&lt;/span&gt;  While this  computer won't win all rounds, over time it can exploit a person's  tendencies and patterns to gain an advantage over its opponent. &lt;/blockquote&gt;This is nice evidence that people are surprisingly bad at being unpredictable.&lt;br /&gt;&lt;br /&gt;PS. I ran the numbers.  The odds of me doing so poorly against the computer in an even game (one third wins, one third losses, one third ties) are about one in 60, a p value of 0.015.  Not impossible, but extremely unlikely.  So the smart money is on the computer.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-6073566199634082691?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/6073566199634082691/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=6073566199634082691' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6073566199634082691'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6073566199634082691'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/03/ai-for-rock-paper-scissors.html' title='AI for rock, paper, scissors'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-84642946256259625</id><published>2011-02-28T09:11:00.000-08:00</published><updated>2011-02-28T09:38:41.582-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='scientific method'/><category scheme='http://www.blogger.com/atom/ns#' term='prediction'/><category scheme='http://www.blogger.com/atom/ns#' term='social science'/><title type='text'>What examples for successful prediction do we have in social science?</title><content type='html'>A question that's been bugging me: as social scientists, what can we predict reasonably well?&lt;br /&gt;&lt;br /&gt;Following &lt;a href="http://en.wikipedia.org/wiki/Thomas_Kuhn"&gt;Kuhn's definition for a scientific paradigm&lt;/a&gt;, I'm focusing on:&lt;br /&gt;1) social phenomena&lt;br /&gt;2) that we can predict with enough accuracy to be useful&lt;br /&gt;3) using technical skills that require special training.&lt;br /&gt;&lt;br /&gt;I've found surprisingly few examples that satisfy all of these criteria.  Only three, in fact.&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Aptitude testing&lt;br /&gt;Prediction: How well will a person perform in school, a given job, the Army, etc.?&lt;br /&gt;Technical skill: Psychometrics&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Microeconomics&lt;br /&gt;Prediction: What effect will various market interventions have on the price, quantity supplied, etc. for a specific good?&lt;br /&gt;Technical skill: Producer, consumer, and game theory&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li&gt;Election polling&lt;br /&gt;Prediction: Who will win in a given election?&lt;br /&gt;Technical skills: Survey design, sampling theory&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;/ol&gt;Can you think of any others?  I've got to be missing some.&lt;br /&gt;&lt;br /&gt;What areas *should* we be able to predict?  We have all kinds of new tools as social scientists.  It seems like we should be ready to tackle some new challenges.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-84642946256259625?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/84642946256259625/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=84642946256259625' title='4 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/84642946256259625'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/84642946256259625'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/what-exampels-for-successful-prediction.html' title='What examples for successful prediction do we have in social science?'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>4</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1479237575134620612</id><published>2011-02-26T07:46:00.000-08:00</published><updated>2011-02-26T11:05:36.114-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='social science'/><category scheme='http://www.blogger.com/atom/ns#' term='research design'/><title type='text'>How to do research that has an impact, and still survive</title><content type='html'>Fascinating talk by Richard Hamming, of hamming code fame: &lt;a href="http://www.paulgraham.com/hamming.html"&gt;You and Your Research&lt;/a&gt;.  It's a very long and detailed discussion of how to build a scientific career in a way that will maximize the long-term impact of your work.&lt;br /&gt;&lt;br /&gt;And here's an interesting counterpoint from the other end of a scientific career: how to survive grad school.  &lt;a href="http://yang.entomology.ucdavis.edu/assets/pdfs/StearnsandHuey.pdf"&gt;Some Modest Advice for Graduate Students&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1479237575134620612?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1479237575134620612/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1479237575134620612' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1479237575134620612'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1479237575134620612'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/how-to-do-research-that-has-impact-and.html' title='How to do research that has an impact, and still survive'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7523951072555086872</id><published>2011-02-21T09:10:00.000-08:00</published><updated>2011-02-26T11:11:48.490-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='media'/><category scheme='http://www.blogger.com/atom/ns#' term='NLP'/><category scheme='http://www.blogger.com/atom/ns#' term='politics'/><title type='text'>Glenn Beck conspiracy generator</title><content type='html'>A &lt;a href="http://politicalhumor.about.com/library/bl-glenn-beck-conspiracy.htm?PS=683%2C601%2C701%2C242%3A19"&gt;Glenn Beck conspiracy generator&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;How does this thing work?  I'm guessing mturk or some mailing list.  The phrases don't seem quite formulaic enough for &lt;a href="http://www.in-vacua.com/markov_gen.html"&gt;Markov generation&lt;/a&gt; or &lt;a href="http://www.lavarnd.org/cgi-bin/corpspeak.cgi"&gt;automated madlibs&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7523951072555086872?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7523951072555086872/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7523951072555086872' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7523951072555086872'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7523951072555086872'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/glenn-beck-conspiracy-generator.html' title='Glenn Beck conspiracy generator'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1557390021298571594</id><published>2011-02-19T20:51:00.000-08:00</published><updated>2011-02-19T20:55:09.183-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data visualization'/><title type='text'>Best. Weather. Forecast. Ever.</title><content type='html'>&lt;a href="http://weatherspark.com/"&gt;Weatherspark&lt;/a&gt; uses the same data as everyone else, but they make it so much more usable.  Gorgeous interactive maps, trends, and predictions.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1557390021298571594?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1557390021298571594/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1557390021298571594' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1557390021298571594'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1557390021298571594'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/best-weather-forecast-ever.html' title='Best. Weather. Forecast. Ever.'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7089425161852947409</id><published>2011-02-15T14:39:00.000-08:00</published><updated>2011-02-18T05:11:57.259-08:00</updated><title type='text'>Link mishmash</title><content type='html'>Here's some of what I've been reading/watching online lately.&lt;br /&gt;&lt;br /&gt;1. From TED, on &lt;a href="http://www.ted.com/talks/patricia_kuhl_the_linguistic_genius_of_babies.html"&gt;babies and how they "take statistics" to learn language&lt;/a&gt;.  Super cute.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Comprehension question: at what age did you stop being able to understand Mandarin?&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;Presentation question: why does using FMRI/MEG/etc. increase one's credibility as a scientist?  Several minutes of this 10-minute clip talk about brain activity and MEG output.  But none of the important findings here are based on that data -- only speculation.&lt;/li&gt;&lt;/ul&gt;2. Via marginal revolution, a &lt;a href="http://splittherent.blogspot.com/p/rent-calculator.html"&gt;fair rent calculator&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;This is for roommates deciding how much the big room with the extra window is worth.  Based on &lt;a href="http://splittherent.blogspot.com/2011/02/survey-results.html"&gt;this analysis&lt;/a&gt;, which uses surveys and &lt;a href="http://en.wikipedia.org/wiki/Hedonic_regression"&gt;hedonic regression&lt;/a&gt; to put numbers on preferences.  (PS - "Hedonic Regression" would be a fantastic name for a nerd band.)&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;3.  Two articles on credentialism in academia: from &lt;a href="http://www.businessinsider.com/how-to-short-the-bubble-in-higher-education-2011-2"&gt;businessinsider&lt;/a&gt;, via Mike Bommarito; and from &lt;a href="http://www.stat.columbia.edu/%7Egelman/blog/"&gt;Steven Hsu&lt;/a&gt;, at Andrew Gelman's blog.&lt;br /&gt;&lt;br /&gt;Interesting points made here.  Has the value of formal, certified education increased or decreased?  What about the quality of education?  I've grown increasingly pessimistic about this topic of late.&lt;br /&gt;&lt;br /&gt;4. And last, but not least &lt;a href="http://wondermark.com/cc1/"&gt;a caption contest at wondermark&lt;/a&gt; (I love this web comic).  Here's the pic.  Submissions wanted.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://wondermark.com/wp-content/uploads/2011/02/cc-001turkey-252x300.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 252px; height: 300px;" src="http://wondermark.com/wp-content/uploads/2011/02/cc-001turkey-252x300.png" alt="" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7089425161852947409?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7089425161852947409/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7089425161852947409' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7089425161852947409'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7089425161852947409'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/link-mishmash.html' title='Link mishmash'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8809665920419937563</id><published>2011-02-15T06:29:00.002-08:00</published><updated>2011-02-15T14:39:39.383-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='AI'/><category scheme='http://www.blogger.com/atom/ns#' term='game theory'/><title type='text'>How to win the final wager in Jeopardy</title><content type='html'>Here's a fun sideline to the human-vs-AI Jeopardy showdown: optimal wagering strategies for final Jeopardy.  The challenge: given what you know about yourself, your opponents, their bankrolls, and the question, choose the betting strategy most likely to secure a win or at least a tie.  What do you do?&lt;br /&gt;&lt;br /&gt;As it happens, some smart game theorists have already worked on this problem. &lt;span style="text-decoration: underline;"&gt;&lt;/span&gt;&lt;a href="http://www.jstor.org.proxy.lib.umich.edu/stable/2690846"&gt;Work in the 1990's&lt;/a&gt; shows that against your average human, it's not too complicated.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;If you're winning, assume your opponents will bet everything, and react accordingly by betting enough to double the number two player's score, plus one dollar.&lt;/li&gt;&lt;li&gt;If you're in the middle it can be tricky, but a good bet is to triple your score and subtract double the first player's score.&lt;/li&gt;&lt;li&gt;If you're losing by a lot, bet everything.&lt;/li&gt;&lt;/ul&gt;Following these rules should increase your chances of winning, regardless of your starting position.  (See pg 271 in the paper.)&lt;br /&gt;&lt;br /&gt;Against a human (or computer) who knows some game theory, the ideal strategy is complicated, because your strategizing will prompt strategizing by your opponents, and vice versa, ad infinitum. The Gilbert and Hatcher paper gives a solution for the two-player scenario.  This situation happens when one contestant has no money to wager in the final round -- it's actually not all that unusual.&lt;br /&gt;&lt;br /&gt;The three-player scenario is unsolved, but it's almost certainly a &lt;a href="http://en.wikipedia.org/wiki/Strategy_%28game_theory%29#Mixed_strategy"&gt;mixed strategy&lt;/a&gt;.  Sounds like a good problem for a different kind of AI: computational game theory.  Given that simple versions of poker have been completely solved, a mixed strategy over wagers should be child's play.&lt;br /&gt;&lt;br /&gt;With all that as background, here's &lt;a href="http://ibmresearchnews.blogspot.com/2011/02/watsons-wagering-strategies.html"&gt;IBM's page on how Watson wagers&lt;/a&gt;.  They cite &lt;a href="http://www.j-archive.com/help.php"&gt;the ridiculously detailed Jeopardy folklore of betting strategies&lt;/a&gt; --- the entries  for "two-thirds" and "Shore's conjecture" are particularly on point --- but evidently, they haven't worked through the game theory.  Knowing this,  Ken and Brad should be able to outflank Watson in the final round,  giving them slightly better odds at cinching a win.&lt;br /&gt;&lt;br /&gt;Punchline: in the world of AI, Watson is a verbal genius in need of some remedial math.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8809665920419937563?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8809665920419937563/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8809665920419937563' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8809665920419937563'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8809665920419937563'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/how-to-win-final-wager-in-jeopardy.html' title='How to win the final wager in Jeopardy'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1318830292695418615</id><published>2011-02-09T08:36:00.000-08:00</published><updated>2011-02-09T09:50:51.441-08:00</updated><title type='text'>Realtime Congress API</title><content type='html'>Just ran across this &lt;a href="http://sunlightlabs.com/blog/2011/the-real-time-congress-api/"&gt;new API, for Congress, in real time&lt;/a&gt;.  Put out by the &lt;a href="http://sunlightlabs.com/"&gt;Sunlight Foundation&lt;/a&gt;, an organization using "cutting-edge technology and ideas to make government  transparent and accountable," the API allows you to track, floor actions, video, committee hearings, bills, amendments, votes, and other documents in real time.&lt;br /&gt;&lt;br /&gt;Really looking forward to playing with this.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1318830292695418615?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1318830292695418615/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1318830292695418615' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1318830292695418615'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1318830292695418615'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/realtime-congress-api.html' title='Realtime Congress API'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7001649262921280940</id><published>2011-02-08T16:54:00.000-08:00</published><updated>2011-02-08T17:22:23.706-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='social science'/><title type='text'>Hard problems in social science</title><content type='html'>A Harvard conference on "Hard problems in social science" just wrapped up, with voting on the most important and most difficult problems.  The ran the poll on facebook with 10,000 people weighing in.&lt;br /&gt;&lt;br /&gt;Frontrunners include:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;World peace&lt;/li&gt;&lt;li&gt;Sustainable population growth&lt;/li&gt;&lt;li&gt;Collective decision making&lt;/li&gt;&lt;li&gt;Stable institutions&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;The pdf with survey results is &lt;a href="http://www.wjh.harvard.edu/%7Ekwn/Documents/SocSci/HardProbsSocSciSummaryResultsandPollText_1Feb2011.pdf"&gt; here&lt;/a&gt;.  Videos of speakers are &lt;a href="http://socialscience.fas.harvard.edu/icb/icb.do?keyword=socialsciencedivision&amp;amp;pageid=icb.page336066"&gt;here&lt;/a&gt;.  Very interesting stuff&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7001649262921280940?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7001649262921280940/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7001649262921280940' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7001649262921280940'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7001649262921280940'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/hard-problems-in-social-science.html' title='Hard problems in social science'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2280806784118595032</id><published>2011-02-05T08:19:00.001-08:00</published><updated>2011-02-06T16:48:42.400-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><title type='text'>Cracking the Ontario lottery - via Wired</title><content type='html'>A great story on the statistician who &lt;a href="http://www.wired.com/magazine/2011/01/ff_lottery/all/1"&gt;cracked the Ontario lottery&lt;/a&gt; -- how he cracked it, why the lottery didn't believe him, how other lotteries are probably breakable as well, and why the mob likes it that way.  A great read, all the way through.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2280806784118595032?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2280806784118595032/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2280806784118595032' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2280806784118595032'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2280806784118595032'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/cracking-ontario-lottery-via-wired.html' title='Cracking the Ontario lottery - via Wired'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2087730013481493046</id><published>2011-02-05T07:48:00.000-08:00</published><updated>2011-02-05T07:48:40.881-08:00</updated><title type='text'>Model Makers' Hippocratic Oath</title><content type='html'>&lt;a href="http://feedproxy.google.com/%7Er/StatisticalModelingCausalInferenceAndSocialScience/%7E3/ZKcJuco1k2s/model_makers_hi.html"&gt;Model Makers' Hippocratic Oath&lt;/a&gt;:&lt;br /&gt; &lt;ul&gt;&lt;li&gt;I will remember that I didn't make the world and that it doesn't satisfy my equations.&lt;/li&gt;&lt;br /&gt; &lt;li&gt;Though I will use models boldly to estimate value, I will not be overly impressed by mathematics.&lt;/li&gt;&lt;br /&gt; &lt;li&gt;I will never sacrifice reality for elegance without explaining why I have done so. Nor will I give the people who use my model false comfort about its accuracy. Instead, I will make explicit its assumptions and oversights.&lt;/li&gt;&lt;br /&gt; &lt;li&gt;I understand that my work may have enormous effects on society and the economy, many of them beyond my comprehension. &lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;Via &lt;a href="https://www.google.com/reader/view/feed/http%3A%2F%2Ffeeds.feedburner.com%2FStatisticalModelingCausalInferenceAndSocialScience"&gt;Statistical Modeling, Causal Inference, and Social Science&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2087730013481493046?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://feedproxy.google.com/~r/StatisticalModelingCausalInferenceAndSocialScience/~3/ZKcJuco1k2s/model_makers_hi.html' title='Model Makers&apos; Hippocratic Oath'/><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2087730013481493046/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2087730013481493046' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2087730013481493046'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2087730013481493046'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/02/model-makers-hippocratic-oath.html' title='Model Makers&apos; Hippocratic Oath'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2290786380990400770</id><published>2011-01-31T14:39:00.000-08:00</published><updated>2011-01-31T14:44:48.803-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data'/><category scheme='http://www.blogger.com/atom/ns#' term='methodology'/><category scheme='http://www.blogger.com/atom/ns#' term='research design'/><title type='text'>QMD - A framework for prioritizing research ideas</title><content type='html'>This from a recent exchange with a research assistant.  He's trying to decide what to do for his undergraduate honors thesis, and I was trying to give him some sage advice.  Here's the practical framework I use when deciding what research ideas to pursue:&lt;br /&gt;&lt;br /&gt;Every  research paper is a combination of three main parts: 1) a question, 2) a  methodology, and 3) data.  For instance, the &lt;a href="http://faculty.wcas.northwestern.edu/%7Ejnd260/pub/Druckman%20Nelson%20AJPS%202003.pdf"&gt;Druckman paper&lt;/a&gt; [from this week's reading] asks 1)  when elite framing can influence opinion using 2) an experiment where 3)  college students read and discuss news articles.  ...&lt;br /&gt;&lt;br /&gt;(By the way, "data" includes any kinds of facts you'd use to build  your  argument, not just numbers.  Surveys, content analysis, government  statistics, experimental results, interviews, historical case studies,  etc. are all data -- they're just analyzed differently.)&lt;br /&gt;&lt;br /&gt;The combination of &lt;span class="il"&gt;QMD&lt;/span&gt; that you choose  largely determines how hard it will be to carry out the project, the  kinds of conclusions you will be able to draw, and the threats to  validity that critics can use against you.  Choosing the &lt;span class="il"&gt;QMD&lt;/span&gt; for a research project is like choosing a major in college -- in a lot of ways, all the later decisions are just details.&lt;br /&gt;&lt;br /&gt;When you choose which Q, M, and D to use, you can either come up  with something completely new, or borrow ideas from past studies.  It is  much, much harder to come up with new questions, methods and data than  it is to borrow from past research.  As a strong rule of thumb, you only  want one of your three parts to be new.  So if you're asking a new  question, use familiar methods and easy-to-find data.  (This isn't  plagiarism because you're completely up front about the idea that you're  building on previous work.)&lt;br /&gt;&lt;br /&gt;Probably the easiest kind of study to pull off is a replication  study, using new data with old methods and questions.  For example, to  deal with one of the threats to validity in Druckman et. al's study, you  might do the same experiment with a representative sample of people,  instead of just college students.  Same question, same method, different  data.  This would still take work, but there aren't a lot of unknowns  in the process, and anyone who believes Druckman and Nelson will  probably believe your results as well.  Replication studies are good for  boosting ("Not only are Druckman and Nelson right about framing effects  for college students -- they're right about all American adults as  well!") or cutting back ("Druckman and Nelson's findings only apply to  people under 30.  Everyone else is much less susceptible to elite  framing.") the scope of previous findings.&lt;br /&gt;&lt;br /&gt;The next hardest is applying a new method to an old question.  For  example, you might ask when elite framing influences public opinion  using a series of carefully timed surveys asking the same questions that  Druckman et al used.  Similar question, different method, similar  data.  This is a little harder, because you'd have to use different  skills, build a new kind of argument, and deal with new threats to  validity.  On the plus side, different methods have different threats to  validity, so if you use a new method, you can probably address some of  the threats to validity that previous studies couldn't.  For example,  timed surveys could deal with the "recency effects" that Druckman can't  do much about.&lt;br /&gt;&lt;br /&gt;Introducing new questions is the hardest of all, for at least three  reasons.  First, you have to convince people that your  question is worth asking.  This is harder than it sounds.  In my  experience, people like to think they already understand the world  pretty well, so they will resist the notion that you've found a blind  spot in their worldview.  Second, you have to be able to convince people  that you're the first one to ask the question.  Third, you have to  build an entirely new line of evidence to defend your reasoning.  With a  one-semester deadline, I'd advise against trying to introduce a new  question, just because it's so hard to do.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;Anyway, that might be more than you want to know, but it's the way I decide what research projects to take on.  &lt;span class="il"&gt;QMD&lt;/span&gt; is my framework for figuring out what topics are in the overlap of "interesting" and "doable."&lt;br /&gt;&lt;br /&gt;Thoughts?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2290786380990400770?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2290786380990400770/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2290786380990400770' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2290786380990400770'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2290786380990400770'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/01/qmd-framework-for-prioritizing-research.html' title='QMD - A framework for prioritizing research ideas'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-4417561306422951877</id><published>2011-01-27T17:32:00.001-08:00</published><updated>2011-01-27T17:39:02.249-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='state of the union'/><category scheme='http://www.blogger.com/atom/ns#' term='word cloud'/><category scheme='http://www.blogger.com/atom/ns#' term='politics'/><title type='text'>Inspiring, hopeful, education, and salmon -- Not in that order</title><content type='html'>This is everywhere already, so why not put it up here too?  NPR asked  people what they heard in the state of the union, then put the 4,000+  responses into a word cloud.  Here it is.&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://media.npr.org/assets/img/2011/01/25/wordle_final_all_custom.jpg?t=1296014462&amp;amp;s=4"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 351px; height: 214px;" src="http://media.npr.org/assets/img/2011/01/25/wordle_final_all_custom.jpg?t=1296014462&amp;amp;s=4" alt="" border="0" /&gt;&lt;/a&gt;Remind me to include a couple of jokes next time I give a talk.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-4417561306422951877?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/4417561306422951877/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=4417561306422951877' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4417561306422951877'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4417561306422951877'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/01/inspiring-hopeful-education-and-salmon.html' title='Inspiring, hopeful, education, and salmon -- Not in that order'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8402353789764878452</id><published>2011-01-27T07:39:00.000-08:00</published><updated>2011-01-27T07:41:55.367-08:00</updated><title type='text'>Pennsylvania Policy Database Project</title><content type='html'>I just ran across &lt;a href="http://www.temple.edu/papolicy/DatasetDescriptions.htm"&gt;this project&lt;/a&gt;, with some &lt;a href="http://www.temple.edu/papolicy/DatasetDescriptions.htm"&gt;interesting data sets&lt;/a&gt; on policy making in Pennsylvania.  I'll come back later to investigate in more detail.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8402353789764878452?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8402353789764878452/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8402353789764878452' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8402353789764878452'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8402353789764878452'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/01/pennsylvania-policy-database-project.html' title='Pennsylvania Policy Database Project'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-4579899112214188348</id><published>2011-01-26T19:00:00.000-08:00</published><updated>2011-01-26T19:17:11.077-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='flow charts'/><category scheme='http://www.blogger.com/atom/ns#' term='tools'/><category scheme='http://www.blogger.com/atom/ns#' term='data visualization'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><title type='text'>bubbl.us: A great little tool for flowcharts</title><content type='html'>As I've gotten serious about this whole dissertation thing, I've realized that the process can get pretty complicated.  "Before I can launch the survey, I need to draw the sample and write the questionnaire.  But before I can write the questionnaire, I have to read the three papers that talk about measuring ideology, and ..."  To cope, I've found myself drawing a lot of flowcharts to map dependencies.&lt;br /&gt;&lt;br /&gt;Two days ago, I found this great little tool for flowcharts: &lt;a href="https://bubbl.us/"&gt;bubbl.us&lt;/a&gt;.  It's free, unless you want the premium license, and it has a really nice interface.  Once you learn how to tab and shift-click and everything, you can sketch out complicated charts very quickly.  If you allow it some room on your hard drive, you can even save maps between sessions.  As I said, I'm using it primarily for mapping workflow, but I could see it being useful for brainstorming, simple org charts, or flow diagrams too.&lt;br /&gt;&lt;br /&gt;And with that said, here's the map through the dissertation quagmire to the end of the semester.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_zUbnTFdqWyg/TUDjWB2_pVI/AAAAAAAAAG0/aOH7KJbQj5E/s1600/dissertation_workflow.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 200px; height: 188px;" src="http://2.bp.blogspot.com/_zUbnTFdqWyg/TUDjWB2_pVI/AAAAAAAAAG0/aOH7KJbQj5E/s200/dissertation_workflow.png" alt="" id="BLOGGER_PHOTO_ID_5566699107025659218" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-4579899112214188348?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/4579899112214188348/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=4579899112214188348' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4579899112214188348'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4579899112214188348'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/01/bubblus-great-little-tool-for.html' title='bubbl.us: A great little tool for flowcharts'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_zUbnTFdqWyg/TUDjWB2_pVI/AAAAAAAAAG0/aOH7KJbQj5E/s72-c/dissertation_workflow.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3742375605707703186</id><published>2011-01-13T09:45:00.000-08:00</published><updated>2011-01-13T09:52:49.246-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='social choice theory'/><category scheme='http://www.blogger.com/atom/ns#' term='information flow'/><category scheme='http://www.blogger.com/atom/ns#' term='disagreement'/><title type='text'>An Interview with Ronald Coase</title><content type='html'>A fascinating snippet of &lt;a href="http://english.unirule.org.cn/Html/Unirule-News/20110101130956819.html"&gt;an interview with the famous Ronald Coase:&lt;br /&gt;&lt;/a&gt;&lt;p style="padding-left: 30px;"&gt;RC: Nothing guarantees success. Given human fallibility, we are bound to make mistakes all the time.&lt;/p&gt; &lt;p style="padding-left: 30px;"&gt;WN: So the question is how we can learn  from experiments at minimal cost. Or, how could we structure our economy  and society in such a way that collective learning can be facilitated  at a bearable price?&lt;/p&gt; &lt;p style="padding-left: 30px;"&gt;RC: That’s right. Hayek made a good point that knowledge was diffused in society and that made central planning impossible.&lt;/p&gt; &lt;p style="padding-left: 30px;"&gt;WN: The diffusion of knowledge creates  another social problem: conflict between competing ideas. To my  knowledge, only people fight for ideas (religious or ideological), only  people are willing to die for their ideas. The animal world might be  bloody and uncivilized. But animals, as far as we know, do not fight  over ideas.&lt;/p&gt; &lt;p style="padding-left: 30px;"&gt;RC:  That’s probably right. That’s why we  need a market for ideas. Ideas can compete; people with different ideas  do not need to slaughter each other.&lt;/p&gt;Really interesting tension.  I'm still working through what to make of it.  Collectively, we're smarter because we disagree?  And vice versa?&lt;br /&gt;&lt;br /&gt;Snipped &lt;a href="http://knowledgeproblem.com/2011/01/12/ronald-coase-interview/"&gt;here&lt;/a&gt; (Knowledge Problem just made my blogroll), with a hat tip to &lt;a href="http://www.marginalrevolution.com/marginalrevolution/2011/01/assorted-links-11.html?utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed%3A+marginalrevolution%2FhCQh+%28Marginal+Revolution%29"&gt;marginal revolution&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3742375605707703186?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3742375605707703186/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3742375605707703186' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3742375605707703186'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3742375605707703186'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/01/interview-with-ronald-coase.html' title='An Interview with Ronald Coase'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7675977403855094853</id><published>2011-01-01T09:31:00.000-08:00</published><updated>2011-01-01T10:44:42.724-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='poker'/><category scheme='http://www.blogger.com/atom/ns#' term='AI'/><category scheme='http://www.blogger.com/atom/ns#' term='game theory'/><category scheme='http://www.blogger.com/atom/ns#' term='computing'/><title type='text'>Optimal Poker AI -- I told you so!</title><content type='html'>A couple years ago, I had a long debate about &lt;a href="http://en.wikipedia.org/wiki/Texas_hold_%27em"&gt;Texas Hold'em&lt;/a&gt; with my brother, a long-time aficionado of the game.  He's quite good and he's roomed with friends who actually payed the bills playing online poker.  So you can see why he'd get defensive when I claimed that there is an optimal strategy for heads-up Texas Hold'em.&lt;br /&gt;&lt;br /&gt;I've thought back to that conversation several times, mainly because it was really frustrating for both of us.  I had a proof backing me up -- Von Neuman's 80-year-old &lt;a href="http://en.wikipedia.org/wiki/Minimax"&gt;minimax theorem&lt;/a&gt;* -- an absolutely airtight mathematical proof, but I still couldn't convince him.  The problem is that it's an existence proof, so it demonstrates that the strategy must exist, but doesn't tell you what the strategy is.  And of course, arguing that "my math says there's an optimal strategy" rings pretty hollow when you can't answer the question, "so what's the strategy?"&lt;br /&gt;&lt;br /&gt;So I was very happy to run across &lt;a href="http://www.cs.cmu.edu/%7Esandholm/RIHoldEm.ISD.aaai05proceedings.pdf"&gt;this paper&lt;/a&gt; and &lt;a href="http://www.cs.cmu.edu/%7Egilpin/gsi.html"&gt;demo&lt;/a&gt; presenting the optimal strategy for Rhode Island Hold'em.  Rhode Island poker is a stripped down version of Texas Hold'em (get it?).  You play with a smaller deck, only one hole card, a one-card flop, and a one-card turn, but no river.  There are a few limits on betting, but the game is essentially the same.  According the abstract, "Some features of the [optimal strategy] include poker techniques such as bluffing, slow-playing, check-raising, and semi-bluffing."&lt;br /&gt;&lt;br /&gt;Very nifty proof of concept.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;* A little bit more on the math.  The minmax theorem states that &lt;span style="font-style: italic;"&gt;every &lt;/span&gt;one-on-one winner-take-all game has an optimal strategy profile.  A quick FAQ on what it means to be an optimal strategy.&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;&lt;/span&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;So what does it mean that the strategy is optimal?&lt;/span&gt;&lt;br /&gt;The technical definition is that it's an equilibrium strategy.  If both players are playing the optimal strategy, neither has an incentive to deviate and play anything else.  As one of my profs would say, it's prediction proof.  Even if we both know each others' strategies, neither of us has an incentive to deviate.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;So this strategy always wins?&lt;/span&gt;&lt;br /&gt;No.  There is some luck in the game.  But if you played it repeatedly, there's no way you would win more than half the time.  Almost any non-optimal strategy you'd play, the computer would beat you more than half the time.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Wouldn't an "optimal strategy" be really predictable?&lt;/span&gt;&lt;br /&gt;No. In poker, it's often useful to bluff and keep your opponent guessing, so the optimal strategy is partly random.  That means that when facing a given situation (e.g. After the flop, I have X cards, I did Y pre-flop, and my opponent did Z) the computer will flip a weighted coin to make its decision (e.g. 3/4 of the time I fold, 1/8 of the time I raise, and 1/8 of the time I check.)  So the strategy is perfectly well-defined, but still unpredictable at times.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Does it know how to play short-stack and long-stack poker?&lt;/span&gt;&lt;br /&gt;I don't think so.  As far as I can tell, Rhode Island poker is played one hand at a time, with limits on betting.  So the optimal strategy doesn't need to know anything about stack length, escalating blinds, going all in, etc.&lt;br /&gt;&lt;br /&gt;If you introduced those elements, you'd be playing a different game.  According to the min-max theorem, there would still be an optimal strategy, but it would probably be at least a little different.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7675977403855094853?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7675977403855094853/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7675977403855094853' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7675977403855094853'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7675977403855094853'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2011/01/optimal-poker-ai-i-told-you-so.html' title='Optimal Poker AI -- I told you so!'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1254118588016962304</id><published>2010-12-20T05:54:00.000-08:00</published><updated>2010-12-20T07:29:00.674-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dissertation'/><category scheme='http://www.blogger.com/atom/ns#' term='OPSP'/><category scheme='http://www.blogger.com/atom/ns#' term='graphic design'/><title type='text'>A graphic design challenge...</title><content type='html'>Next semester, I'm going to be conducting a large-scale survey of bloggers for my dissertation.  (I've had a pilot wave in the field for the last several weeks.)  For credibility's sake, I need a public face for the project -- a web site people can visit to reassure themselves that my survey request isn't some kind of phishing scam.&lt;br /&gt;&lt;br /&gt;I'm branding the project as the Online Political Speech Project, with the goal of "understanding the voices of the Internet."  I've worked out some rough ideas for visual motifs, but I'm no graphic designer.  If you are, or know someone who is, I'd love to be in touch.  I'm hoping this can be a fun project for some talented graphic designer...&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://2.bp.blogspot.com/_zUbnTFdqWyg/TQ9nEEnYGEI/AAAAAAAAAGo/c0Yuvp26tY8/s1600/OPSP_logo.png"&gt;&lt;img style="margin: 0px auto 10px; center; cursor: pointer; width: 200px; height: 166px;" src="http://2.bp.blogspot.com/_zUbnTFdqWyg/TQ9nEEnYGEI/AAAAAAAAAGo/c0Yuvp26tY8/s200/OPSP_logo.png" alt="" id="BLOGGER_PHOTO_ID_5552770185227147330" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Project specs: I need to design and lay out this web page by the second week in January.  I'm imagining about five pages (splash page, FAQ, contact, blog, data  repository).  Scope of work would be limited to designing templates -- I can plug in content later.&lt;br /&gt;&lt;br /&gt;This should be reasonably quick work, with a lot of creative control for the designer.  I can offer a little cash, but probably a lot less than going professional rates.  (That's because, as a grad student, I get paid a lot less than going professional rates, too.)  Maybe a good portfolio piece for a student web designer...?  Or else a fun project for someone who wants to play with some of the styles and themes that have emerged lately on the web...?&lt;br /&gt;&lt;br /&gt;Please email me (agong at umich dot edu) if you're interested.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1254118588016962304?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1254118588016962304/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1254118588016962304' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1254118588016962304'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1254118588016962304'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/12/graphic-design-challenge.html' title='A graphic design challenge...'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_zUbnTFdqWyg/TQ9nEEnYGEI/AAAAAAAAAGo/c0Yuvp26tY8/s72-c/OPSP_logo.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2285587238394410684</id><published>2010-12-17T06:42:00.000-08:00</published><updated>2010-12-17T06:58:02.072-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='text mining'/><category scheme='http://www.blogger.com/atom/ns#' term='n-gram'/><title type='text'>Google's n-gram viewer - pros and cons</title><content type='html'>Google just released &lt;a href="http://ngrams.googlelabs.com/"&gt;an n-gram viewing tool&lt;/a&gt; for tracking trends in half a trillion words across 400 years of text.  Check it out.  It's super fast and interesting, kind of like &lt;a href="http://www.google.com/trends"&gt;Google trends&lt;/a&gt; on timescale steroids.  Even better, they released &lt;a href="http://ngrams.googlelabs.com/datasets"&gt;the data&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Assorted thoughts poached from conversations with friends:&lt;br /&gt;&lt;div class="commentContent UIImageBlock_Content UIImageBlock_SMALL_Content"&gt;&lt;ul&gt;&lt;li&gt;&lt;span jsid="text"&gt;Someone should tell Fox about "Christmas"&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;span jsid="text"&gt;I predict that in the year 165166 AD, fully one hundred percent of the  words written in English will be the word "dance."  (This will make  500-word APSA applications much more straightforward.)&lt;/span&gt;&lt;/li&gt;&lt;/ul&gt;&lt;/div&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;My take&lt;br /&gt;&lt;/span&gt;(In 18 words: For scientists, a small, representative sample usually beats a large, non-random sample.  Google has given us the latter.)&lt;br /&gt;&lt;br /&gt;The n-gram viewer will be a lot of fun for journalists, who  often want a quick graph to illustrate a trend.  It's a gold mine for  linguists, who want to understand how syntax and semantics have changed  over time.  For both of these fields, lots of data equals better.&lt;br /&gt;&lt;br /&gt;As a  bonus for me, maybe this tool will popularize the term &lt;a href="http://en.wikipedia.org/wiki/N-gram"&gt;n-gram&lt;/a&gt;, so I  won't have to keep explaining it in conference talks.&lt;br /&gt;&lt;br /&gt;I'm not sure about the impact the n-gram viewer will have on the  areas where the researchers seem to want to apply it most: culture,  sociology, and social science in general.  The reason is that in those  fields we tend to care a lot about &lt;span style="font-style: italic;"&gt;who is speaking&lt;/span&gt;, not just &lt;span style="font-style: italic;"&gt;how much  speech&lt;/span&gt; we have on record.  This is why the field of sampling theory has  been a consistent cornerstone in social science for almost a century.   From the press releases, we can see Google's claim that this is 5.2% of  all books ever written.  But we don't know if it's a random sample.&lt;br /&gt;&lt;br /&gt;And in any case, books have never been written by a random sample  of people.  I suspect that the n-gram viewer will have very little to  offer to researchers hoping to study the culture and sociology of, say,  blacks in the Jim Crow south, or the early phases of Lenin's  revolution.  By and large, we already know what cultural elites were  saying in those periods, and the rest weren't writing books.&lt;br /&gt;&lt;br /&gt;  This means that there are two filters in place for any social  scientist who wants to pull cultural trends out of these data.  First,  there's Google's method for sampling books, for which I haven't seen any  documentation.  (No offense, but this is pretty typical of computer  scientists, who think in terms of scale more than sampling).  Second,  there's the authorship filter: you have to keep in mind that any trends  are derived from written language, produced by whatever subset of the  population was literate in a given period&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;Example&lt;/span&gt;&lt;br /&gt;As a political scientist, I'm interested in conflict.  If  you go to the site and punch in "war," from 1500 to 2000, you get a  graph showing quite a few interesting trends.  Here are some stories  that I could naively tell from this data.  I suspect that many are  false.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;In general, humanity has been far more interested in war since 1750.&lt;/li&gt;&lt;li&gt;That said, interest in war has generally declined since then.&lt;/li&gt;&lt;li&gt;Interest in wars spikes during wars.&lt;/li&gt;&lt;li&gt;Proportionally, wars since 1750 affect people far less than they did  previously -- the use of the term "war" jumped up to 5 times the period  baseline during the English civil war, but less than double the period  baseline during the World Wars.&lt;/li&gt;&lt;li&gt;World wars I and II were the historical high points for human  interests in war, but only slightly higher than the sustained interest  of the peak period in 1750.&lt;/li&gt;&lt;/ul&gt;Interesting conjectures, and I can  spin a story for each of these.  In several cases, I'm pretty sure we're  seeing the effects of cultural or linguistic artifacts in the data.  Potential confounders include: changes in literacy rates, introduction of public schooling, the American revolution, improvements in printing technology, linguistic drift, etc.&lt;br /&gt;&lt;br /&gt;I'm cautiously pessimistic here.  Given all the potential confounding variables, it's hard to see how we can sort out what's really going on in the line graphs.  Maybe we can do it with fancy statistics.  But we're not there yet.&lt;br /&gt;&lt;br /&gt;I'll give the last word to BJP:&lt;br /&gt;&lt;blockquote&gt;My instinct suggests that what is talking here is less the folds of  culture and more the wrinkles of the record of the technology expressing  that culture.&lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2285587238394410684?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2285587238394410684/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2285587238394410684' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2285587238394410684'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2285587238394410684'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/12/googles-n-gram-viewer-pros-and-cons.html' title='Google&apos;s n-gram viewer - pros and cons'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1298346451423126079</id><published>2010-12-16T05:42:00.001-08:00</published><updated>2010-12-16T06:07:13.583-08:00</updated><title type='text'>Link mish-mash</title><content type='html'>All kinds of interesting stuff on the web today.  Not that this wasn't also true yesterday, or the day before...&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;IBM says its Watson AI is &lt;a href="http://www.bloomberg.com/news/2010-12-14/ibm-computer-to-match-wits-with-jeopardy-champs-in-1-million-showdown.html?cmpid=yhoo"&gt;ready to take on human Jeopardy champs&lt;/a&gt; for a $1 million prize.  The showdown is scheduled for Feb 14.  This is reminiscent of the Kasparov/Deep Blue showdown, except that Watson will be competing on human home turf: making sense of the linguistic ambiguity in the hints, phrasing, puns, etc. of Jeopardy prompts.   (The AI has one advantage: I bet Watson will always remember to phrase its answer in the form of a question.)&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.miller-mccune.com/culture-society/the-physics-of-terror-25955/"&gt;A rehash &lt;/a&gt;of physicist Aaron Clauset's work on "the physics of terrorism." I'm not a big fan of his stuff, to be honest.  My view: Clauset showed that the severity of terrorist attacks follow a powerlaw distribution, and has been wildly extrapolating from that single finding ever since.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;A Jeremiad about the &lt;a href="http://www.nieman.harvard.edu/reports/article/102499/Its-Scary-Out-There-in-Reporting-Land.aspx"&gt;state of journalism&lt;/a&gt;, from Pulitzer prize winner David Cay Johnson.  He talks trends (reporters know less and less about government; papers keep cutting content and raising prices), and hints at causes.  The last paragraph is especially intriguing to me.  Read it as a claim about how good reporting is supposed to uncover truth.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://www.abnormaluse.com/2010/12/views-of-2011-from-1931.html"&gt;Predictions about 2011 from 1931&lt;/a&gt;.  Eighty years ago, the NYTimes gathered a brain trust of experts in various fields and asked them what the world would look like today.  Follow the link for predictions and some commentary.  (Hat tip Marginal Revolution.)&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://econlog.econlib.org/archives/2010/12/the_psychology_1.html"&gt;This paper&lt;/a&gt; should depress my libertarian friends.  Evidently, profit is evil, with an r-value of -.62.&lt;/li&gt;&lt;/ul&gt;&lt;ul&gt;&lt;li&gt;The best &lt;a href="http://flowingdata.com/2010/12/14/10-best-data-visualization-projects-of-the-year-%E2%80%93-2010/"&gt;data visualization projects of 2010&lt;/a&gt;, from FlowingData.  Beautiful and informative.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1298346451423126079?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1298346451423126079/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1298346451423126079' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1298346451423126079'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1298346451423126079'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/12/link-mish-mash.html' title='Link mish-mash'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-6689118057856194118</id><published>2010-12-13T16:48:00.000-08:00</published><updated>2010-12-13T16:51:19.664-08:00</updated><title type='text'>HPN health prize: $3,000,000</title><content type='html'>&lt;span style="font-family: verdana,geneva; font-size: small;"&gt;The Heritage Provider Network is &lt;a href="https://www.theheritagehealthprize.com/Page/Home"&gt;offering a $3 million prize&lt;/a&gt; for predicting hospitalization -- which patients are likely to be hospitalized or rehospitalized?&lt;br /&gt;&lt;/span&gt;&lt;blockquote&gt;&lt;span style="font-family: verdana,geneva; font-size: small;"&gt;In 2006  well over $30 billion was spent on unnecessary hospital admissions.  How  many of those hospital admissions could have been avoided if only we  had real-time information as to which patients were at risk for future  hospitalization?&lt;/span&gt;&lt;br /&gt;&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-6689118057856194118?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/6689118057856194118/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=6689118057856194118' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6689118057856194118'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6689118057856194118'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/12/hpn-health-prize-3000000.html' title='HPN health prize: $3,000,000'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-912970570674189854</id><published>2010-12-11T12:36:00.000-08:00</published><updated>2010-12-11T12:50:40.488-08:00</updated><title type='text'>Programming Language Popularity Contest</title><content type='html'>I keep telling people that python is the language to learn.  Here's proof.  Half a million github and StackOverflow programmers can't be wrong, right?&lt;br /&gt;&lt;br /&gt;(Ignore javascript.  It's an in-browser language only.  Fun if you want to animate a web page; useless if you want to write a file.)&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.dataists.com/wp-content/uploads/2010/12/rank_scatter1.png"&gt;&lt;img src="http://www.dataists.com/wp-content/uploads/2010/12/rank_scatter1.png" width="600px" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;via : &lt;a href="http://feedproxy.google.com/%7Er/ZeroIntelligenceAgents/%7E3/2kkFcyWZZ54/"&gt;Programming Language Popularity Contest&lt;/a&gt;&lt;p&gt;&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-912970570674189854?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/912970570674189854/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=912970570674189854' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/912970570674189854'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/912970570674189854'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/12/programming-language-popularity-contest.html' title='Programming Language Popularity Contest'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3668027194867461150</id><published>2010-12-07T12:32:00.000-08:00</published><updated>2010-12-07T12:34:21.422-08:00</updated><title type='text'>Journalism and activism: Stylized facts</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_zUbnTFdqWyg/TP6aCcGPyQI/AAAAAAAAAGg/ud6cVQ2RIdY/s1600/AJ.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 320px; height: 154px;" src="http://1.bp.blogspot.com/_zUbnTFdqWyg/TP6aCcGPyQI/AAAAAAAAAGg/ud6cVQ2RIdY/s320/AJ.png" alt="" id="BLOGGER_PHOTO_ID_5548041157659838722" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Trying this idea on for size.  Reactions?&lt;br /&gt;&lt;br /&gt;(Also, I'm attempting to post to facebook via Notes.  Let's hope the feed actually works now...)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3668027194867461150?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3668027194867461150/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3668027194867461150' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3668027194867461150'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3668027194867461150'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/12/journalism-and-activism-stylized-facts.html' title='Journalism and activism: Stylized facts'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_zUbnTFdqWyg/TP6aCcGPyQI/AAAAAAAAAGg/ud6cVQ2RIdY/s72-c/AJ.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-9183855554488465600</id><published>2010-12-06T07:13:00.000-08:00</published><updated>2010-12-06T07:13:17.503-08:00</updated><title type='text'>The United States of Autocomplete</title><content type='html'>Hat tip &lt;a href="http://flowingdata.com/"&gt;FlowingData&lt;/a&gt;:  &lt;a href="http://flowingdata.com/2010/12/06/the-united-states-of-autocomplete/"&gt;The United States of Autocomplete&lt;/a&gt;&lt;br /&gt;&lt;p&gt;&lt;a href="http://flowingdata.com/2010/12/06/the-united-states-of-autocomplete/"&gt;&lt;img src="http://flowingdata.com/wp-content/uploads/2010/12/United-States-of-Autocomplete-575x422.jpg" alt="United States of Autocomplete" title="United States of Autocomplete" height="422" width="575" /&gt;&lt;/a&gt;&lt;/p&gt;&lt;p&gt;Very Small Array has &lt;a href="http://www.verysmallarray.com/?p=1056"&gt;some fun Google's autocomplete&lt;/a&gt;. Utah... Jazz. Kentucky... Fried Chicken. New York... Times.&lt;/p&gt;&lt;p&gt;[&lt;a href="http://www.verysmallarray.com/?p=1056"&gt;Very Small Array&lt;/a&gt; via @&lt;a href="http://twitter.com/#%21/mericson/status/10909657245683712"&gt;mericson&lt;/a&gt;]&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-9183855554488465600?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://flowingdata.com/2010/12/06/the-united-states-of-autocomplete/' title='The United States of Autocomplete'/><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/9183855554488465600/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=9183855554488465600' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9183855554488465600'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9183855554488465600'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/12/united-states-of-autocomplete.html' title='The United States of Autocomplete'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1983596275932691834</id><published>2010-12-02T18:15:00.000-08:00</published><updated>2010-12-04T15:21:09.976-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='complex systems'/><title type='text'>Arsenic-based life and autocatalytic sets</title><content type='html'>NASA has just announced the discovery of a new kind of extremophile bacteria.  If I'm reading the (sketchy) early release info correctly, these critters use arsenic in their DNA, instead of phosphorus.  This is a biggish deal because &lt;span style="font-style: italic;"&gt;every other known life form&lt;/span&gt; uses phosphorus.&lt;br /&gt;&lt;br /&gt;How big a deal?  Setting aside shades of the &lt;a href="http://en.wikipedia.org/wiki/The_Andromeda_Strain"&gt;Andromeda Strain&lt;/a&gt;*, people seem pretty underwhelmed.  Partly it's a bait-and-switch involving the NASA brand: the microbe in question, Gammaproteobacteria GFAJ-1, isn't an alien visitor.  It's a mutant strain of a run-of-the-mill terrestrial bug.&lt;br /&gt;&lt;br /&gt;So the real question is: How surprised should we be that little GFAJ-1 managed to assimilate arsenic into its DNA?  Or the converse: if it's so easy, should we be surprised that no other life form bothered to do the same thing?&lt;br /&gt;&lt;br /&gt;Taking a conversational step sideways, this seems like a good moment to put in a plug for a really fascinating theory on the &lt;a href="http://en.wikipedia.org/wiki/Abiogenesis"&gt;origin of life&lt;/a&gt;: &lt;a href="http://en.wikipedia.org/wiki/Autocatalytic_set"&gt;autocatalytic sets&lt;/a&gt;.  This theory--which I find persuasive--argues that life isn't rare or unexpected -- it's virtually inevitable.  I can't find a good non-technical description of this underappreciated set of ideas online, so I thought I'd take a crack at it here . (My explanation is based largely on the opening chapters of Kauffman's &lt;a href="http://www.amazon.com/At-Home-Universe-Self-Organization-Complexity/dp/0195111303"&gt;At Home in the Universe&lt;/a&gt;).&lt;br /&gt;&lt;br /&gt;The puzzle: all forms of life we know of are pretty complicated.  There are a couple hundred cell differentiations in a human body; thousands of enzymes in a working cell; millions of base pairs in even the simplest DNA.  (Prions and viruses, which are often simpler, don't count because they can't replicate without hijacking a cell's machinery.)  With these kinds of numbers, the odds of getting the just right combination for life are astronomical.  Try flipping 220 million coins until they line up with the base pairs in a single human chromosome.  Practically speaking, it will never happen.&lt;br /&gt;&lt;br /&gt;For most explanations for the origins of life, complexity is a stumbling block.  We just got lucky, or God intervened, or maybe an asteroid seeded the planet to produce something capable of self-replication.  There's a leap in the logic at the point where we try to explain the incredibly low probability of life emerging spontaneously from a lifeless chemical soup.&lt;br /&gt;&lt;br /&gt;The autocatalytic theory turns this logic around: it argues that life exists &lt;span style="font-style: italic;"&gt;because &lt;/span&gt;of its chemical complexity, not in spite of it.&lt;br /&gt;&lt;br /&gt;The theory builds from three simple assumptions.  First, some chemicals react; most don't, with roughly constant odds for any given pair of chemicals reacting.  Second, chemical reactions produce new chemicals.  Third, the number of possible chemicals is very large.&lt;br /&gt;&lt;br /&gt;Thought experiment: suppose you find yourself in front of chemistry set containing a rack of &lt;span style="font-style: italic;"&gt;n &lt;/span&gt;beakers, each filled with an unknown chemicals.  Channeling your inner mad scientist, you begin mixing them at random.  Suppose 1 in 100 pairs creates a reaction.  How many total reactions should you be able to get out of your chemistry set?&lt;br /&gt;&lt;br /&gt;If you have two chemicals, the odds of a reaction are just 1 in 100.&lt;br /&gt;&lt;br /&gt;If you have three chemicals, you have three potential pairs, and the odds of a reaction are about 3 in 100.&lt;br /&gt;&lt;br /&gt;If you have four chemicals, you actually have six potential pairs, so the odds of a reaction are a little better than 6 in 100.&lt;br /&gt;&lt;br /&gt;At this point, exponentials start working rapidly in your favor.  With five chemicals, you have 10 potential pairs, for a 9.5% chance of at least one reaction.  Twelve chemicals gets you 66 pairs with 48.4% odds of at least one reaction.  The deluxe 30-chemical starter kit has 435 potential pairs, with 98.7% odds of at least one reaction.&lt;br /&gt;&lt;br /&gt;What does this prove?  The number of likely reactions in a pool increases faster than the number of chemicals in the pool.&lt;br /&gt;&lt;br /&gt;It keeps going, too.  With 1 in 100 odds, you would expect to get about 4 reactions out of your 30-chemical kit.  If each reaction creates a new chemical, you now have 34 chemicals in your pool, with correspondingly greater odds of additional reactions.  Eventually, you pass a tipping point, and the expected number of compounds becomes infinite.  If I've got my math right, it happens around 80 chemicals in this scenario, because the expected number of new reactions exceeds the number of reactions in the existing set.  The more you mix, the more potential mixtures you get.&lt;br /&gt;&lt;br /&gt;A quick pause: when we talk about chemicals, we're not talking about atoms in the periodic table of the elements.  Except during fission and fusion, atoms themselves don't combine and react.  Instead, we're talking about molecules.&lt;br /&gt;&lt;br /&gt;In particular, organic molecules -- the ones that ended up supporting life -- fit this model very well.  Enzymes, RNA, and DNA are all organic molecules.  Believe it or not, a strand of DNA is one enormous molecule.   Organic molecules are all built mainly from the same base atoms: carbon, hydrogen, oxygen, nitrogen, phosphorus -- and now arsenic(!)   These atoms happen to be good at linking to form long chains.  Because of the way these chains fold, they react with each other in often unpredictable ways.  Most organic molecules don't react with each other, but quite a few do.  And because the chains can get very long, the set of potential molecules made of CHONP atoms is essentially infinite.&lt;br /&gt;&lt;br /&gt;Now getting back to the main story...  We're halfway through.  We've shown how simple rules for reactions can get you from a small-ish starting set to an infinite variety of chemicals to play with.  It seems very reasonable to suppose that the primordial organic soup  included enough organic reactants to pass the tipping point into  infinite variety.  But that just means a more flavorful soup.  How do we get to life?&lt;br /&gt;&lt;br /&gt;Setting aside the transcendental, life is defined by  sustainable reproduction.  A cell is a bag of chemicals and reactions  that keeps working long enough to make at least one copy of itself.  As  part of the deal, the cell has to be in the right sort of environment,  with whatever energy sources and nutrients are necessary.&lt;br /&gt;&lt;br /&gt;Our cells achieve sustainability by using enzymes to catalyze other reactions.  It turns out that the same logic that applies to pairs of reactions also applies to catalyzers: the probability of catalysts in a pool increases faster than the number of chemicals in the pool.  Once you get enough chemicals, it's virtually certain that you'll have quite a few catalyzed reactions.&lt;br /&gt;&lt;br /&gt;Here's the really nifty bit.  As the number of catalyzed reactions in the set increases, eventually some of them will form an autocatalytic set -- a loop of reactions catalyzing each other.  Reaction A creates the catalyst that enables reaction B, creating the catalyst for C, and so on back to catalyst Z that enables reaction A.&lt;br /&gt;&lt;br /&gt;Based on the same logic we saw earlier, these loops &lt;span style="font-style: italic;"&gt;always &lt;/span&gt;appear once the pool of chemicals gets large enough.  They are typically long and complicated, cycling through a seemingly random group of chemicals among a much larger set of nutrients and byproducts. They tap nutrients and energy sources in the environment, increasing themselves the longer they run.  In other words, autocatalytic sets look a whole lot like life as we know it.&lt;br /&gt;&lt;br /&gt;I find this theory compelling.  It takes the biggest objection to prevailing theories -- the inherent complexity of life -- and makes it the cornerstone of a different approach.&lt;br /&gt;&lt;br /&gt;And as a bonus, it makes arsenic-based life forms seem very plausible. Given NASA's results, it seems reasonable to say that arsenic-based DNA is another unexplored evolutionary path for viable autocatalytic sets.  Bill Bryson says it well in &lt;a href="http://books.google.com/books?id=o7XH-v_j2XIC&amp;amp;pg=PT319&amp;amp;lpg=PT319&amp;amp;dq=bill+bryson+ammonia&amp;amp;source=bl&amp;amp;ots=OP6NAUeTlf&amp;amp;sig=Wn6yRBDfv1aJc00W15P1Kt3M9pE&amp;amp;hl=en&amp;amp;ei=8Mf6TIzcCYylnwfJwYnJCg&amp;amp;sa=X&amp;amp;oi=book_result&amp;amp;ct=result&amp;amp;resnum=1&amp;amp;ved=0CBYQ6AEwAA#v=onepage&amp;amp;q&amp;amp;f=false"&gt;A Short History of Nearly Everything&lt;/a&gt;:&lt;br /&gt;&lt;blockquote&gt;"It may be that the things that make [Earth] so splendid to us---well-proportioned Sun, doting Moon, sociable carbon, more molten magma than you can shake a stick at and all the rest---seem splendid simply because they are what we were born no count on.  No one can altogether say. ... Other worlds may harbor beings thankful for their silvery lakes of mercury and drifting clouds of ammonia."&lt;/blockquote&gt;&lt;br /&gt;&lt;br /&gt;PS: Autocatalytic sets don't have much to do with the evolution/intelligent design debate. They propose a mechanism that could be responsible for jumpstarting evolution.  So if you're comfortable with the idea that God would choose cosmological constants and direct evolutionary processes with some goal in mind, it probably won't bother you to add the idea that he would use chemical soups and catalysis networks along the way.&lt;br /&gt;&lt;br /&gt;PPS: The main difference between an autocatalytic set and life as we know it is the absence of a cell wall.  It's not hard to close the gap conceptually.  Once a catalytic loop gets started, other loops usually form as well.  At this point, competition and natural selection between autocatalytic cycles can kick in.  If one autocatalytic loop happened to produce a &lt;a href="http://en.wikipedia.org/wiki/Hydrophobe"&gt;hydrophobic &lt;/a&gt;byproduct (like a fat or lipid), it could easily act as a primitive cell wall.  This kind of barrier would enable the autocatalytic loop to increase its concentration, and therefore its reaction rate.  This kind of pseudocell would reproduce faster and very likely evolve into more sophisticated organisms.&lt;br /&gt;&lt;br /&gt;*A ten word review of the Andromeda Strain: Typical Crichton--some interesting ideas; mildly annoying narration; mostly plotless.&lt;br /&gt;&lt;br /&gt;** A wonderful book for putting some color on the messy process of scientific discovery.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1983596275932691834?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1983596275932691834/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1983596275932691834' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1983596275932691834'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1983596275932691834'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/12/arsenic-based-life-and-autocatalytic.html' title='Arsenic-based life and autocatalytic sets'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8090282865776239360</id><published>2010-12-01T08:00:00.000-08:00</published><updated>2010-12-01T08:00:06.757-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='data visualization'/><category scheme='http://www.blogger.com/atom/ns#' term='software'/><title type='text'>Prezi - A quick review</title><content type='html'>I spent a couple hours playing around with &lt;a href="http://prezi.com/"&gt;Prezi&lt;/a&gt;, an online presentation builder being promoted as an alternative to the slideshow format of powerpoint, keynote, etc.  Instead of a series of slides, Prezi presentations consist of a series of views over one large image.  The format parallels drawing on a whiteboard, instead of clicking through slides on a projector.  A good concept, but the execution is a little clunky.&lt;br /&gt;&lt;br /&gt;My review:  Preparing good presentations is time consuming, for two reasons: 1) it takes some trial and error to figure out the best way to express an idea, verbally and visually, and 2) presentation software is clunky, requiring a lot of fiddling to get things right.  In my experience, spending time on (1) is fun and creative; spending time on (2) is frustrating and stressful.&lt;br /&gt;&lt;br /&gt;Because of the "whiteboard" metaphor and brand emphasis on good design, I was hoping Prezi would deliver a slick and streamlined user experience.  Being free from interface hassles and able focus on creative expression would be wonderful.  Alas, I quickly ran into many GUI annoyances.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;The interface for importing images is very clunky.  You have to download or save the image to your desktop, then upload.  On the plus side, you can batch upload several images at a time.&lt;/li&gt;&lt;li&gt;The whole image is static, which means that you can't mark up images over the course of a presentation.  To some extent this makes sense -- dynamic images would mess up the concept of arranging your display in space rather than time.  However, it breaks the whiteboard metaphor.  When I do whiteboard presentations, I often have an agenda that I revisit, adding checkmarks and lines to relevant content.  I can't do that in Prezi.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Rudimentary tools for grouping object are not available.  This one really gets me.  You can accomplish the same thing (visually) by putting several objects together in an invisible frame.  But every time you want to move the group, it takes several extra clicks to select everything and drag it around.  Poor usability.&lt;/li&gt;&lt;li&gt;You can only use a handful of presentation styles.  Your only  alternative is to hire Prezi staff to build a custom style for $450.&lt;/li&gt;&lt;/ul&gt;Summary: I would really like a tool that lets me express myself clearly, fast.   Prezi offers some advantages for clarity, but not really for speed.  Overall, I'm mildly impressed, but not overwhelmed.  For the moment, the main benefit of Prezi seems to be novelty.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8090282865776239360?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8090282865776239360/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8090282865776239360' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8090282865776239360'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8090282865776239360'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/12/prezi-quick-review.html' title='Prezi - A quick review'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7903028431816931455</id><published>2010-11-30T08:07:00.000-08:00</published><updated>2010-11-30T08:15:51.700-08:00</updated><title type='text'>Forums and platforms for crowdsourcing</title><content type='html'>I've always thought of crowdsourcing as a narrow, niche thing.  But then I ran across this &lt;a href="http://www.openinnovators.net/list-open-innovation-crowdsourcing-examples/"&gt;excellent list of forums and platforms for crowdsourcing&lt;/a&gt;.  Skimming down the list, I realized that crowdsourcing is actually starting to play an important role in quite a few industries.&lt;br /&gt;&lt;br /&gt;It looks like we're just starting to climb the adoption curve here.  Where should we expect crowdsourcing to stop?  What sorts of problems will it solve (and not solve)?  Who &lt;span style="font-style: italic;"&gt;won't&lt;/span&gt; ever use it?&lt;br /&gt;&lt;br /&gt;PS: In case you haven't run across the term before, here's &lt;a href="http://en.wikipedia.org/wiki/Crowdsourcing"&gt;wikipedia's definition of crowdsourcing&lt;/a&gt;:&lt;br /&gt;&lt;blockquote&gt;Crowdsourcing is the act of outsourcing tasks, traditionally performed by an employee or contractor, to an undefined, large group of people or community (a crowd), through an open call.&lt;/blockquote&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7903028431816931455?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7903028431816931455/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7903028431816931455' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7903028431816931455'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7903028431816931455'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/11/forums-and-platforms-for-crowdsourcing.html' title='Forums and platforms for crowdsourcing'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-5364209728764017651</id><published>2010-11-24T06:53:00.000-08:00</published><updated>2010-11-24T07:14:31.956-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dissertation'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><title type='text'>How long are books in political science?</title><content type='html'>As a polisci student who has passed his prelims, I have a very good idea how long books take to read.  But I realized this week that I didn't have a good idea how long books take to &lt;i&gt;write&lt;/i&gt;.&lt;br /&gt;&lt;br /&gt;As I was setting up the template to write a book-format dissertation, I decided to give it a look.  Here are some rough stats for five well-known polisci books*, pulled from my reading shelf more or less at random:&lt;br /&gt;&lt;br /&gt;&lt;table style="text-align: center;"&gt;&lt;br /&gt;&lt;tbody&gt;&lt;tr&gt;&lt;td&gt;&lt;br /&gt;&lt;/td&gt;&lt;td style="font-weight: bold;"&gt;Hindman&lt;/td&gt;&lt;td style="font-weight: bold;"&gt;Zaller&lt;/td&gt;&lt;td style="font-weight: bold;"&gt;Mutz&lt;/td&gt;&lt;td style="font-weight: bold;"&gt;Lupia and&lt;br /&gt;McCubbins&lt;/td&gt;&lt;td style="font-weight: bold;"&gt;Huber and&lt;br /&gt;Shipan&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;tr&gt;&lt;td style="text-align: right; font-weight: bold;"&gt;Lines/page&lt;/td&gt;&lt;td&gt;38&lt;/td&gt;&lt;td&gt;45&lt;/td&gt;&lt;td&gt;35&lt;/td&gt;&lt;td&gt;42&lt;/td&gt;&lt;td&gt;39&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;tr&gt;&lt;td style="font-weight: bold; text-align: right;"&gt;Words/line&lt;/td&gt;&lt;td&gt;13&lt;/td&gt;&lt;td&gt;13&lt;/td&gt;&lt;td&gt;11&lt;/td&gt;&lt;td&gt;12&lt;/td&gt;&lt;td&gt;11&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;tr&gt;&lt;td style="font-weight: bold; text-align: right;"&gt;Total pages&lt;/td&gt;&lt;td&gt;141&lt;/td&gt;&lt;td&gt;310&lt;/td&gt;&lt;td&gt;125&lt;/td&gt;&lt;td&gt;229&lt;/td&gt;&lt;td&gt;210&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;tr&gt;&lt;td style="font-weight: bold; text-align: right;"&gt;Total chapters&lt;/td&gt;&lt;td&gt;7&lt;/td&gt;&lt;td&gt;12&lt;/td&gt;&lt;td&gt;5&lt;/td&gt;&lt;td&gt;10&lt;/td&gt;&lt;td&gt;8&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;tr&gt;&lt;td style="font-weight: bold; text-align: right;"&gt;~Pages/chapter&lt;/td&gt;&lt;td&gt;20.1&lt;/td&gt;&lt;td&gt;25.8&lt;/td&gt;&lt;td&gt;25&lt;/td&gt;&lt;td&gt;22.9&lt;/td&gt;&lt;td&gt;26.2&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;tr&gt;&lt;td style="font-weight: bold; text-align: right;"&gt;~Total words&lt;/td&gt;&lt;td&gt;69,654&lt;/td&gt;&lt;td&gt;18,1350&lt;/td&gt;&lt;td&gt;48,125&lt;/td&gt;&lt;td&gt;11,5416&lt;/td&gt;&lt;td&gt;90,090&lt;/td&gt;&lt;/tr&gt;&lt;br /&gt;&lt;/tbody&gt;&lt;/table&gt;&lt;br /&gt;&lt;br /&gt;These estimates are probably biased upwards, because I based them on pages of full text without any tables, graphs, or chapter breaks.  But still, it's a pretty good idea for the rough scale of the project.&lt;br /&gt;&lt;br /&gt;To put it in perspective, the current working draft for my dissertation includes 10,653 words.  If I'm shooting for 70,000 total, that means I'm 15% done already!&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-size:85%;"&gt;*These are the books I used for estimates:&lt;br /&gt;Matthew Hindman, &lt;span style="font-style: italic;"&gt;The Myth of Digital Democracy&lt;/span&gt;&lt;br /&gt;John Zaller, &lt;span style="font-style: italic;"&gt;The Nature and Origins of Mass Opinion&lt;/span&gt;&lt;br /&gt;Diana Mutz, &lt;span style="font-style: italic;"&gt;Hearing the Other Side&lt;/span&gt;&lt;br /&gt;Arthur Lupia and Matthew McCubbins, &lt;span style="font-style: italic;"&gt;The Democratic Dilemma&lt;/span&gt;&lt;br /&gt;John Huber and Chuck Shipan, &lt;span style="font-style: italic;"&gt;Deliberate Discretion&lt;/span&gt;&lt;/span&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-5364209728764017651?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/5364209728764017651/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=5364209728764017651' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5364209728764017651'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5364209728764017651'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/11/how-long-are-books-in-political-science.html' title='How long are books in political science?'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-5475561855381970275</id><published>2010-11-23T07:15:00.000-08:00</published><updated>2010-11-23T07:49:48.948-08:00</updated><title type='text'>An update on voice recognition software</title><content type='html'>Thanks to everyone who sent comments about voice recognition via Dragon, Microsoft, and Mac.  I'm still playing with different options to see what will work.&lt;br /&gt;&lt;br /&gt;In the meantime, I've discovered another option: &lt;a href="http://www.google.com/googlevoice/landing/index.html#utm_source=en-ha-na-us-sk&amp;amp;utm_medium=ha&amp;amp;utm_term=google%20voice&amp;amp;utm_campaign=en"&gt;Google Voice&lt;/a&gt;.  It turns out there's an easy way to set up Google Voice so that you can dictate messages to yourself (info &lt;a href="http://markstout.blogspot.com/2009/08/transcribing-my-own-notes-with-google.html"&gt;here&lt;/a&gt;).  GV will then automatically transcribe them and send them to your email, or mobile device.  Nifty!&lt;br /&gt;&lt;br /&gt;Since it's free and didn't require any software installation or training, I decided to give this a shot first.  The result: laughably bad transcription (see below), but a substantial boost in productivity. How's that?  For me, drafting is the most time-consuming part of writing.  Once I have some basic ideas on paper, I can edit and elaborate reasonably quickly.  But it takes me &lt;a href="http://thelaughinghousewife.wordpress.com/2010/10/02/5443/"&gt;a long time to get that first version out&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;As a result, dictating an early version has been very helpful.  It forces me to say &lt;span style="font-style: italic;"&gt;something&lt;/span&gt;.  Since I'll be editing soon, it doesn't matter if that something is bad (it is, usually).  So dictation, even with GV's horribly inaccurate transcription has worked pretty well for me.  Bottom line: I'm certainly going to be investing in a voice recognition package in the near future.&lt;br /&gt;&lt;br /&gt;An example of GV transcription.  Here's what I said:&lt;br /&gt;&lt;blockquote&gt; This chapter describes methods and data sources for the book.  My goal is to describe the logic behind the research design.  The focus here is validity: what types of conclusions can we draw from these data?  Technical details---of which there are many---are saved for appendices.&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Here's what Google thought I said:&lt;br /&gt;&lt;blockquote&gt;This chapter just tries messages and data sources for the book. My goal is to describe the logic behind the research design. The focus here is validity. What types of conclusions. Can we draw from the Yeah, technical details. I wish there are many her saved for the appendicitis.&lt;br /&gt;&lt;/blockquote&gt;In fairness, Google just got into this business in the last year or so.  I'm sure their transcription will get better over time.  But for the moment, "I wish there are many her saved for the appendicitis."&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-5475561855381970275?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/5475561855381970275/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=5475561855381970275' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5475561855381970275'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5475561855381970275'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/11/update-on-voice-recognition-software.html' title='An update on voice recognition software'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-852991364638111093</id><published>2010-11-20T18:09:00.000-08:00</published><updated>2010-11-20T18:57:52.971-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dissertation'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><category scheme='http://www.blogger.com/atom/ns#' term='machine learning'/><category scheme='http://www.blogger.com/atom/ns#' term='voice recognition'/><title type='text'>Anybody have any experience with dictation software?</title><content type='html'>Faced with writing a 200-page thesis in the next year, I'm toying with the idea of getting myself a voice recognition software package.  Dragon seems to be the industry leader.  Here's &lt;a href="http://www.voicerecognition.com.au/dragon_naturallyspeaking_demo.html"&gt;a demo&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The software seems to work, is getting &lt;a href="http://www.amazon.com/Dragon-NaturallySpeaking-Home-Version-11/dp/B003VNCRNQ/ref=dp_ob_title_sw"&gt;good reviews on Amazon&lt;/a&gt;, and is not terribly expensive.  On the other hand, it seems like it might be hard to find a place to talk loudly to oneself  and a computer for long periods of time.  It also seems likely that  training the software and editing its mistakes could be pretty time consuming.  Also, Dragon seems built with MS Office in mind.  Given that I lean open source (and don't even have Office on my laptop), would it work for me?&lt;br /&gt;&lt;br /&gt;Bottom line: would this be more like the digital stylus I got a few months ago (and use all the time)? Or the extra external hard drives I got a year before that (and have only booted up to make sure they work)?&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;So I'm on the fence.  I really wish Dragon had a trial version.  Anybody have anything to add here?&lt;br /&gt;&lt;br /&gt;PS: Yes, I do know that one episode of The Office.  (Dwight: "Cancel card.  Can-suhl car-Duh.")&lt;br /&gt;&lt;br /&gt;PPS: How long do you think until a viable open-source option opens up?  Probably at least a couple years, unless someone like Google decides to  launch a free version.  They seem to be making moves in that direct (see &lt;a href="http://googlemobile.blogspot.com/2010/08/just-speak-it-introducing-voice-actions.html"&gt;here for an easy application&lt;/a&gt;, and &lt;a href="http://www.beet.tv/2010/02/google-voice-voicemail-youtube-transcriptions-via-google-voice-recognition-research.html"&gt;here for a broader one&lt;/a&gt;).&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-852991364638111093?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/852991364638111093/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=852991364638111093' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/852991364638111093'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/852991364638111093'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/11/anybody-have-any-experience-with.html' title='Anybody have any experience with dictation software?'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2344203844586276793</id><published>2010-11-19T10:55:00.001-08:00</published><updated>2010-11-19T11:00:27.318-08:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dissertation'/><category scheme='http://www.blogger.com/atom/ns#' term='text classification'/><category scheme='http://www.blogger.com/atom/ns#' term='time management'/><title type='text'>Training a text classifier</title><content type='html'>I've been doing a lot of work with &lt;a href="http://en.wikipedia.org/wiki/Document_classification"&gt;text classifiers&lt;/a&gt; lately.  (See &lt;a href="http://lowlywonk.blogspot.com/2010/11/starting-to-get-some-dissertation.html"&gt;this post&lt;/a&gt; for an example.)  It's an interesting process blending intuition, text, and math in some nifty ways.&lt;br /&gt;&lt;br /&gt;Anyway, I was going to spell out how it all works in a blog post, but I got side tracked making an illustration, and now I'm out of time for blogging.  But I like the illustration.  More to come...&lt;br /&gt;&lt;br /&gt;&lt;a href="http://2.bp.blogspot.com/_zUbnTFdqWyg/TObJC6UiUOI/AAAAAAAAAGY/psYVEdfxnso/s1600/training_a_text_classifier.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 320px; height: 252px;" src="http://2.bp.blogspot.com/_zUbnTFdqWyg/TObJC6UiUOI/AAAAAAAAAGY/psYVEdfxnso/s320/training_a_text_classifier.png" alt="" id="BLOGGER_PHOTO_ID_5541337443378286818" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2344203844586276793?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2344203844586276793/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2344203844586276793' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2344203844586276793'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2344203844586276793'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/11/training-text-classifier.html' title='Training a text classifier'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://2.bp.blogspot.com/_zUbnTFdqWyg/TObJC6UiUOI/AAAAAAAAAGY/psYVEdfxnso/s72-c/training_a_text_classifier.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-5500648057641131348</id><published>2010-11-17T13:58:00.000-08:00</published><updated>2010-11-17T13:58:49.474-08:00</updated><title type='text'>TED talk: Teaching kids real math with computers | Video on TED.com</title><content type='html'>&lt;a href="http://www.ted.com/talks/conrad_wolfram_teaching_kids_real_math_with_computers.html"&gt;Conrad Wolfram: Teaching kids real math with computers | Video on TED.com&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;I like this.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-5500648057641131348?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='related' href='http://www.ted.com/talks/conrad_wolfram_teaching_kids_real_math_with_computers.html' title='TED talk: Teaching kids real math with computers | Video on TED.com'/><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/5500648057641131348/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=5500648057641131348' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5500648057641131348'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5500648057641131348'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/11/ted-talk-teaching-kids-real-math-with.html' title='TED talk: Teaching kids real math with computers | Video on TED.com'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1647821263734436550</id><published>2010-11-04T18:59:00.000-07:00</published><updated>2010-11-04T19:16:08.461-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dissertation'/><category scheme='http://www.blogger.com/atom/ns#' term='text mining'/><category scheme='http://www.blogger.com/atom/ns#' term='word cloud'/><category scheme='http://www.blogger.com/atom/ns#' term='NLP'/><title type='text'>Starting to get some dissertation results...</title><content type='html'>Apologies for the long delay between posts.  Stock excuse: "Dissertation... blah blah blah..."&lt;br /&gt;&lt;br /&gt;Actually, I'm starting to get some nifty results from my dissertation.  I've spent a long summer writing surveys and software, and in the next few weeks I hope to have something to show for it.  Exhibit A: a word cloud for an automated classifier of political content.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://4.bp.blogspot.com/_zUbnTFdqWyg/TNNn_dJ1DFI/AAAAAAAAAGQ/yYSiCjfFrHU/s1600/political_classifier.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 400px; height: 173px;" src="http://4.bp.blogspot.com/_zUbnTFdqWyg/TNNn_dJ1DFI/AAAAAAAAAGQ/yYSiCjfFrHU/s400/political_classifier.png" alt="" id="BLOGGER_PHOTO_ID_5535882706823744594" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Orange words are associated with political content, and blue words are  disassociated.  The size of a word denotes the strength of association  -- essentially, the size of each word corresponds to the absolute value  of the beta value of the word in a logistic regression with  "political-ness" as the dependent variable.  The layout of the words is  done by computer algorithm to conserve space; it doesn't carry any  important information.&lt;br /&gt;&lt;br /&gt;I used &lt;a href="http://www.wordle.net/"&gt;wordle&lt;/a&gt; for the layout.  The classifier runs regularized logistic regression using the &lt;a href="http://scikit-learn.sourceforge.net/"&gt;scikits.learn&lt;/a&gt; package for python.  The training data is from a team of undergraduate research assistants.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1647821263734436550?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1647821263734436550/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1647821263734436550' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1647821263734436550'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1647821263734436550'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/11/starting-to-get-some-dissertation.html' title='Starting to get some dissertation results...'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_zUbnTFdqWyg/TNNn_dJ1DFI/AAAAAAAAAGQ/yYSiCjfFrHU/s72-c/political_classifier.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-8783541213452361425</id><published>2010-10-08T15:59:00.000-07:00</published><updated>2010-10-08T16:09:31.058-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='sentiment analysis'/><category scheme='http://www.blogger.com/atom/ns#' term='opinion mining'/><title type='text'>Daily Beast's Election Oracle</title><content type='html'>Here's a nifty new application in &lt;a href="http://en.wikipedia.org/wiki/Sentiment_analysis"&gt;sentiment analysis&lt;/a&gt;: the Daily Beast's "&lt;a href="http://www.thedailybeast.com/election-oracle/?cid=hp:beastoriginalsC1"&gt;Election Oracle&lt;/a&gt;."  It's based on an engine called &lt;a href="http://www.wisewindow.com/"&gt;WiseWindow&lt;/a&gt;, which apparently scrapes the Web in real time, searching for content relevant to all the midterm elections, then makes predictions.  One interesting touch is that DailyBeast pipes in &lt;a href="http://www.realclearpolitics.com/"&gt;RealClearPolitics&lt;/a&gt;' latest poll numbers, inviting head-to-head comparisons.&lt;br /&gt;&lt;br /&gt;If the screen-scraping works the way it's intended, you  would expect WiseWindow's numbers to change a little faster than the polls, since polls are only conducted every so often, and usually include a rolling average.&lt;br /&gt;&lt;br /&gt;I'm skeptical.  Not because I don't believe that it can be done, but because I've read up on the top-of-the-line opinion mining algorithms, and they're still full of holes.  Computer scientists are smart people, but they haven't absorbed the lessons that pollsters have spent almost 100 years learning.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-8783541213452361425?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/8783541213452361425/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=8783541213452361425' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8783541213452361425'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/8783541213452361425'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/10/daily-beasts-election-oracle.html' title='Daily Beast&apos;s Election Oracle'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2842159536425458126</id><published>2010-10-06T17:16:00.001-07:00</published><updated>2010-10-06T17:21:08.562-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><title type='text'>My particular brand of procrastination</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_zUbnTFdqWyg/TK0RpLS5y9I/AAAAAAAAAGI/BEbbB6C5BsY/s1600/procrastination.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 320px; height: 230px;" src="http://1.bp.blogspot.com/_zUbnTFdqWyg/TK0RpLS5y9I/AAAAAAAAAGI/BEbbB6C5BsY/s320/procrastination.png" alt="" id="BLOGGER_PHOTO_ID_5525091716957850578" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2842159536425458126?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2842159536425458126/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2842159536425458126' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2842159536425458126'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2842159536425458126'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/10/my-particular-brand-of-procrastination.html' title='My particular brand of procrastination'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_zUbnTFdqWyg/TK0RpLS5y9I/AAAAAAAAAGI/BEbbB6C5BsY/s72-c/procrastination.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7470894756429403559</id><published>2010-10-05T05:42:00.000-07:00</published><updated>2010-10-05T05:45:44.285-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='psychology'/><category scheme='http://www.blogger.com/atom/ns#' term='workflow'/><title type='text'>On procrastination</title><content type='html'>&lt;div&gt;&lt;div style="overflow: hidden; color: rgb(0, 0, 0); background-color: transparent; text-align: left; text-decoration: none; border: medium none;"&gt;"Victor  Hugo would write naked and tell his valet to hide his clothes so that  he’d be unable to go outside when he was supposed to be writing."&lt;span&gt;&lt;br /&gt;&lt;br /&gt;If only I didn't have to do so much work from the CAP lab...&lt;br /&gt;&lt;br /&gt;&lt;a href="http://www.newyorker.com/arts/critics/books/2010/10/11/101011crbo_books_surowiecki?currentPage=1"&gt;Here's&lt;/a&gt; a great, &lt;/span&gt;&lt;span&gt;well-sourced&lt;/span&gt;&lt;span&gt; article in the &lt;/span&gt;&lt;span&gt;New Yorker &lt;/span&gt;&lt;span&gt;on the psychology, philosophy, economics, and history of procrastination.  Worth a read for all grad students.&lt;br /&gt;&lt;br /&gt;&lt;/span&gt;&lt;/div&gt;&lt;/div&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7470894756429403559?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7470894756429403559/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7470894756429403559' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7470894756429403559'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7470894756429403559'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/10/on-procrastination.html' title='On procrastination'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2094249098853559907</id><published>2010-09-16T17:05:00.000-07:00</published><updated>2010-09-16T17:38:50.025-07:00</updated><title type='text'>A minute to learn, a lifetime to master</title><content type='html'>My family was in town last weekend (new baby--very exciting) and my brother Sam and I got to talking about board games, and what makes a good game.  One reasonable standard comes from Othello's tagline: "a minute to learn, a lifetime to master."  For strategy games, at least, we like the idea of complex gameplay emerging from simple rules.&lt;br /&gt;&lt;br /&gt;Following that idea, we pulled up a list of ~30 well-known games and rated them for "game complexity" (How complicated are the rules?) and "strategy complexity" (How complicated are strategies for winning?).  We came up with our ratings separately, but they were strongly correlated (R-squared values of .68 and .73, respectively).&lt;br /&gt;&lt;br /&gt;The strength of this correlation surprised me at first.  In retrospect, board games have been such an important part of brotherly bonding that I shouldn't have been taken aback.  Case in point: we had agreed to use 1 to 10 scales, but we both independently decided that War and Candyland should score zero for strategy complexity, because neither game ever presents players with a choice.&lt;br /&gt;&lt;a href="http://4.bp.blogspot.com/_zUbnTFdqWyg/TJK4cMI3VxI/AAAAAAAAAFo/3CurPvSa1Lg/s1600/gameplot.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 320px; height: 320px;" src="http://4.bp.blogspot.com/_zUbnTFdqWyg/TJK4cMI3VxI/AAAAAAAAAFo/3CurPvSa1Lg/s320/gameplot.jpg" alt="" id="BLOGGER_PHOTO_ID_5517675287916533522" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;With data in hand, I averaged our scores and then plotted game complexity against strategy complexity.  The two dimensions of complexity are correlated (R2 = .494).  Note that both my favorite game (Go), and his favorite game (Texas holdem) are strong positive outliers.&lt;br /&gt;&lt;br /&gt;Apologies for the ugly graph.  If I had the time, I'd have plugged in logos for all the games.  Still, the picture mostly comes through.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2094249098853559907?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2094249098853559907/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2094249098853559907' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2094249098853559907'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2094249098853559907'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/09/minute-to-learn-lifetime-to-master.html' title='A minute to learn, a lifetime to master'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://4.bp.blogspot.com/_zUbnTFdqWyg/TJK4cMI3VxI/AAAAAAAAAFo/3CurPvSa1Lg/s72-c/gameplot.jpg' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-544779814527466355</id><published>2010-09-08T13:55:00.000-07:00</published><updated>2010-09-08T14:36:10.883-07:00</updated><title type='text'>Link mishmash</title><content type='html'>Interesting post by &lt;a href="http://people.iq.harvard.edu/%7Edhopkins/"&gt;Dan Hopkins&lt;/a&gt; on the Monkey Cage: &lt;a href="http://www.themonkeycage.org/2010/09/spin_and_the_economy.html"&gt;Spin and the economy&lt;/a&gt; -- Can newspapers convince people the economy is better or worse than it really is?  No.&lt;br /&gt;&lt;br /&gt;Also from the MC: &lt;a href="http://www.themonkeycage.org/2010/09/political_science_forecasts_fo.html"&gt;predictions for the 2010 elections&lt;/a&gt;.  We're short of complete consensus, but the Dems are likely to lose the House.&lt;br /&gt;&lt;br /&gt;From FlowingData: &lt;a href="http://flowingdata.com/2010/09/07/tracking-firefly-trails-in-the-forest/"&gt;a beautiful delayed-shutter picture of fireflies in a forest&lt;/a&gt;.  See also the &lt;a href="http://flowingdata.com/2009/10/27/this-would-be-perfect-for-a-roomba-commercial/"&gt;delayed-shutter picture of a Roomba&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a href="http://feeds.nytimes.com/click.phdo?i=62c82575507b095e0428b65f9b4dc5ad"&gt;From NYTimes&lt;/a&gt;: folk wisdom about study habits -- some of it masquerading as scientific -- is often dead wrong.  This whole article is worth reading, but let me spoil a few of the punch lines.  Designated study rooms -- bad.  (What about designated study times?)  Learning styles (e.g. "I'm a visual learner") -- very little supporting evidence.  Testing -- often beneficial.&lt;br /&gt;&lt;br /&gt;An &lt;a href="http://feeds.pinkcakebox.com/%7Er/PinkCakeBoxPastries/%7E3/NdOO32tB9Wk/xkcd-comic-wedding-cake-2010-09-05.htm"&gt;xkcd wedding cake&lt;/a&gt;.  I love this web comic.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-544779814527466355?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/544779814527466355/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=544779814527466355' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/544779814527466355'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/544779814527466355'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/09/link-mishmash.html' title='Link mishmash'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-9069183388327692034</id><published>2010-08-18T19:05:00.001-07:00</published><updated>2010-08-18T19:10:00.844-07:00</updated><title type='text'>Please link/subscribe to me!</title><content type='html'>A cordial request: if you read or feed this blog, please link, subscribe, etc. in whatever format is right for you.  I'd like to bump up my PageRank etc. a little.  I promise I won't do this often.&lt;br /&gt;&lt;br /&gt;Thank you!  Here's a puppy:&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.diypuppytraining.com/puppy.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 300px; height: 207px;" src="http://www.diypuppytraining.com/puppy.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;We now return to our regularly scheduled social/computer-sciency blogging.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-9069183388327692034?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/9069183388327692034/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=9069183388327692034' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9069183388327692034'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9069183388327692034'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/please-linksubscribe-to-me.html' title='Please link/subscribe to me!'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-5050348211758698745</id><published>2010-08-17T11:26:00.000-07:00</published><updated>2010-08-17T11:28:58.141-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='computing'/><title type='text'>Probabilistic computing?</title><content type='html'>Hat tip &lt;a href="http://www.stat.columbia.edu/%7Ecook/movabletype/archives/2010/08/probability-pro_1.html?utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed%3A+StatisticalModelingCausalInferenceAndSocialScience+%28Statistical+Modeling%2C+Causal+Inference%2C+and+Social+Science%29"&gt;SMCISS&lt;/a&gt;.&lt;br /&gt;&lt;p&gt;&lt;/p&gt;&lt;blockquote&gt;&lt;p&gt;For over 60 years, computers have been based on digital computing  principles. Data is represented as bits (0s and 1s). Boolean logic gates  perform operations on these bits. A processor steps through many of  these operations serially in order to perform a function. However,  today’s most interesting problems are not at all suited to this  approach. &lt;/p&gt;           Here at Lyric Semiconductor, we are redesigning information  processing circuits from the ground up to natively process  probabilities: from the gate circuits to the processor architecture to  the programming language.  As a result, many applications that today  require a thousand conventional processors will soon run in just one  Lyric processor, providing 1,000x efficiencies in cost, power, and size.&lt;br /&gt;&lt;/blockquote&gt;&lt;br /&gt;Hat tip &lt;a href="http://www.stat.columbia.edu/%7Ecook/movabletype/archives/2010/08/probability-pro_1.html?utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed%3A+StatisticalModelingCausalInferenceAndSocialScience+%28Statistical+Modeling%2C+Causal+Inference%2C+and+Social+Science%29"&gt;SMCISS&lt;/a&gt;.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-5050348211758698745?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/5050348211758698745/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=5050348211758698745' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5050348211758698745'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5050348211758698745'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/probabilistic-computing.html' title='Probabilistic computing?'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-9090204874040881684</id><published>2010-08-15T08:05:00.000-07:00</published><updated>2010-08-16T20:20:52.832-07:00</updated><title type='text'>Psychologists say that power corrupts, via WSJ</title><content type='html'>&lt;a href="http://online.wsj.com/article/SB10001424052748704407804575425561952689390.html?mod=wsj_share_facebook"&gt;Psychologists say that power corrupts, via WSJ.&lt;/a&gt;  I know that this thesis is pretty widely accepted, but I want to snipe at it.  I think we may be mistaking correlation for causation.  The general arc of my argument is that thuggishness may  (sometimes) lead to power, instead of the other way around.&lt;br /&gt;&lt;br /&gt;First, all the cited psych studies showing "nice guys finish  first" are from areas with dense social networks (dorms, sororities,  chimp clans) and repeated person-to-person interactions.  Not  surprisingly, likability translates to popularity here.  But I don't see  any particular reason why that conclusion should translate to  accumulation of power through job interviews or elections.  I'm going out on a limb here, but perhaps the qualities that make frat boys popular are not the same qualities that put CEOs in power.&lt;br /&gt;&lt;br /&gt;Second, at risk of alienating a lot of my friends in political psychology, I'm not a big fan of gimmicky priming studies.  For instance: tell me about a time when you felt powerful, then write  the letter E on your head.  You wrote it backwards?  See -- power  corrupts.&lt;br /&gt;&lt;br /&gt;Err... run that by me again?  At what point does this experiment measure "power," and "corruption," and where does it demonstrate a casual relationship between the two?  Sure, the researchers who did this study can do a song and dance about it, but at the end of the day, there are a lot of leaps in the logic.  The study is gimmicky.  Interesting maybe, but nowhere close to conclusive.&lt;br /&gt;&lt;br /&gt;I completely accept the notion that our minds move between different mental states and that primed manipulations can nudge us from one state to another.  But I'm not persuaded that we've mapped the set of states and priming cues in enough detail to draw general conclusions -- particularly when treatments and responses are measured in ways that never occur in normal human experience.&lt;br /&gt;&lt;br /&gt;Third, the idea that transparency might fix things is nice, but lacking evidence.  This is a place where careful study of incentives and strategy could add a lot of leverage.  Go game theory!&lt;br /&gt;&lt;br /&gt; Parting shot: I'm not anti-niceness or pro-corruption, but I think we often demonize  authority and authority figures.  There are an awful lot  of social problems that best resolved through the benevolent use of  authority, and I know plenty of people who live up to that charge.  To the extent that power has a corrupting tendency, we should be looking for, promoting, and celebrating those who are proof against it.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-9090204874040881684?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/9090204874040881684/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=9090204874040881684' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9090204874040881684'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/9090204874040881684'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/psychologists-say-that-power-corrupts.html' title='Psychologists say that power corrupts, via WSJ'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-272870001014217522</id><published>2010-08-14T14:05:00.000-07:00</published><updated>2010-08-14T14:41:12.969-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='software'/><category scheme='http://www.blogger.com/atom/ns#' term='research design'/><title type='text'>Wacom tablet: post-purchase rationalization</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_zUbnTFdqWyg/TGcNRWJAtdI/AAAAAAAAAFQ/vLfKGhpR6Ng/s1600/targets.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 340px; height: 249px;" src="http://3.bp.blogspot.com/_zUbnTFdqWyg/TGcNRWJAtdI/AAAAAAAAAFQ/vLfKGhpR6Ng/s200/targets.png" alt="" id="BLOGGER_PHOTO_ID_5505383661136033234" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;So I bought a &lt;a href="http://www.amazon.com/Wacom-CTL460-Bamboo-Pen-Tablet/dp/B002OOWC3I/ref=cm_cr_pr_product_top"&gt;Wacom Bamboo tablet&lt;/a&gt; last week.  It's a nice digital pen tablet, with a drawing area about the size of a postcard -- a good size for sketching &amp;amp; gesture-based interfaces.  It plugs into a USB port and (after some fiddling with the drivers) works very nicely with Windows, Firefox, Inkscape, etc.&lt;br /&gt;&lt;br /&gt;That said, I'm not really sure why I bought the thing.  I do a lot of writing and programming, which leads to a lot of typing, but not much clicking and dragging.  Sure, the touchpad on my notebook is small, but it's not really $60 small.  I blame postpartum sleep deprivation.  It might also have something to do with watching this &lt;a href="http://www.ted.com/talks/john_underkoffler_drive_3d_data_with_a_gesture.html"&gt;TED talk.&lt;br /&gt;&lt;br /&gt;&lt;/a&gt;Anyway, in an effort to assuage my &lt;a href="http://en.wikipedia.org/wiki/Post-purchase_rationalization"&gt;post-purchase cognitive dissonance&lt;/a&gt;, I've been telling myself that if I can improve the speed and accuracy of my clicks by just a fraction, this tablet thing will easily pay itself off in productivity in the long run.  Right?&lt;br /&gt;&lt;br /&gt;To bring some proof to that claim, I dug up this &lt;a href="http://www.2dplay.com/rapid-fire-2/rapid-fire-2-play.htm"&gt;flash-based target practice game&lt;/a&gt;.  Little targets fly around the screen and you try to click on them: score points for every target you hit; bonus points for consecutive bulls-eyes; miss too many times and the game is over.  Great sound effects.  This is high brainpower stuff.&lt;br /&gt;&lt;br /&gt;I played three rounds in time trial mode, using the touchpad.  Mean score 438.  Then I played thrice with the tablet.  Mean score 685!&lt;br /&gt;&lt;br /&gt;To make sure this wasn't just an improvement in my reflexes and strategy (shoot the spring targets at the apex; don't waste a shot on a yo-yo target that might be yanked), I employed an &lt;a href="http://www.statsoft.com/textbook/time-series-analysis/#interrupted"&gt;interrupted time series design &lt;/a&gt;and played six more rounds.  Mean score with the touchpad: 407.  Mean score with the tablet: 977!&lt;br /&gt;&lt;br /&gt;With that kind of performance improvement, the tablet was clearly worth it.  Minority report, here I come.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-272870001014217522?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/272870001014217522/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=272870001014217522' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/272870001014217522'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/272870001014217522'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/wacom-tablet-post-purchase.html' title='Wacom tablet: post-purchase rationalization'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zUbnTFdqWyg/TGcNRWJAtdI/AAAAAAAAAFQ/vLfKGhpR6Ng/s72-c/targets.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3059170350175443883</id><published>2010-08-12T11:10:00.000-07:00</published><updated>2010-08-12T11:19:30.416-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='math'/><title type='text'>TED talk on origami</title><content type='html'>James Lang, physicist and origami master, &lt;a href="http://www.ted.com/talks/robert_lang_folds_way_new_origami.html"&gt;speaks at TED&lt;/a&gt;.  He talks about a fascinating intersection of art and math: origami.&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.ethanzuckerman.com/blog/wp-content/2008/02/ibex.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 378px; height: 283px;" src="http://www.ethanzuckerman.com/blog/wp-content/2008/02/ibex.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;This is a great, visual talk that links ideas you would never expect to come together: water bombs, ibex, and spiders turn into satellites, heart valves, and airbags, using the math of trees and circle packing.  A really great example of the unexpected practical value of mathematics.&lt;br /&gt;&lt;br /&gt;Hat tip to my brother Matthew for putting me onto Lang's work.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3059170350175443883?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3059170350175443883/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3059170350175443883' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3059170350175443883'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3059170350175443883'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/ted-talk-on-origami.html' title='TED talk on origami'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7763197040331655895</id><published>2010-08-09T05:02:00.000-07:00</published><updated>2010-08-09T08:35:14.661-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='math'/><title type='text'>P != NP?  The biggest mathematical problem in computer science may be solved</title><content type='html'>Banks, businesses, and credit card users can rest easy: public key encryption is still secure.  Mathematicians breathe a sigh of relief: computers aren't going to replace you by churning out brute force proofs.  Vinay Deolalikar, a researcher at HP Labs, is claiming that he has verified the most important conjecture in mathematical computer science: P != NP.&lt;br /&gt;&lt;br /&gt;Some links, and then onto an explanation of why this is all a big deal.&lt;br /&gt;&lt;ul&gt;&lt;li&gt;&lt;a href="http://gregbaker.ca/blog/author/greg/"&gt;Greg Baker&lt;/a&gt; broke the story.&lt;/li&gt;&lt;li&gt;The &lt;a href="http://science.slashdot.org/story/10/08/08/226227/Claimed-Proof-That-P--NP"&gt;slashdot&lt;/a&gt; squib.&lt;/li&gt;&lt;li&gt;Here's &lt;a href="http://rjlipton.wordpress.com/2010/08/08/a-proof-that-p-is-not-equal-to-np/"&gt;a very mathy site&lt;/a&gt; with some good context and historical background.&lt;/li&gt;&lt;li&gt;Here's a &lt;a href="http://en.wikipedia.org/wiki/P_versus_NP_problem#Consequences_of_proof"&gt;wiki section on consequences of a proof&lt;/a&gt; one way or the other.  I don't like it much, so I'm going to try to do better here.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;What's the difference between P and NP?&lt;/span&gt;&lt;br /&gt;Suppose you have a friend -- a real whiz at math and logic.  "I can answer any question you ask*," she promises, "it just might take some time."  Before you ask her to solve your sudoku, route the world's airline traffic, or crack your bank's electronic encryption key, it's worth figuring out "Just how long is it going to take?"&lt;br /&gt;&lt;br /&gt;For many kinds of problems, the answer depends on how many moving parts there are.  To cite a famous example, suppose I'm a &lt;a href="http://en.wikipedia.org/wiki/Travelling_salesman_problem"&gt;traveling salesman&lt;/a&gt;, using planes, trains, and automobiles to get between cities.  I have &lt;span style="font-style: italic;"&gt;n&lt;/span&gt; cities to visit.  What is the fastest route that lets me visit every city?&lt;br /&gt;&lt;br /&gt;Intuitively, the more cities there are, the harder this problem is going to be.  If &lt;span style="font-style: italic;"&gt;n&lt;/span&gt;=1, the answer is trivial.  I start in the single city and just stay there -- no travel time.  If &lt;span style="font-style: italic;"&gt;n&lt;/span&gt;=2, the answer is easy.  It's equal to the amount of time between the two cities (or double that, if I'm required to make a round trip.)  If &lt;span style="font-style: italic;"&gt;n&lt;/span&gt;=3, it starts to get complicated because I have several paths to check.  If &lt;span style="font-style: italic;"&gt;n&lt;/span&gt;=100, then solving the problem could take a long time.  Even your math whiz friend might have to chew on it for a while.&lt;br /&gt;&lt;br /&gt;In the world of computer science, P-type problems are the fast, easy, tractable ones.  They can be solved in "polynomial time": if the problem has more moving parts, it will take longer to solve, but not a whole lot longer.    Mathematically speaking, a problem belongs to complexity class P if you can write a function &lt;span style="font-style: italic;"&gt;f(n) =&lt;/span&gt; &lt;span style="font-style: italic;"&gt;n^k&lt;/span&gt; and prove that a problem with &lt;span style="font-style: italic;"&gt;n &lt;/span&gt;elements can be solved in &lt;span style="font-style: italic;"&gt;f(n)&lt;/span&gt; operations or less.  Constructing proofs of this kind is a big part of what mathematical computer scientists do.&lt;br /&gt;&lt;br /&gt;If we could prove that k=3 for the traveling salesman problem -- we can't, by the way -- then we could guarantee that a computer could solve the 100-city problem in 1 million time steps or less.  If your math whiz friend uses an Intel chip running millions of operations per second, that doesn't look so bad.  For P problems, we can put a bound on running time, and know that as &lt;span style="font-style: italic;"&gt;n&lt;/span&gt; increases, the performance bound increases at worst by the exponent &lt;span style="font-style: italic;"&gt;k&lt;/span&gt;.&lt;br /&gt;&lt;br /&gt;NP-type problems can get harder, because the definition is looser.  Instead of requiring your friend to come up with the answer in polynomial time, we only require her to &lt;span style="font-style: italic;"&gt;verify&lt;/span&gt; an answer in polynomial time.  ("My boss says I should go to Albuquerque, then Boston, then Chicago, then Denver.  Is that really the fastest route?"  "Hang on.  I'll get back to you in &lt;span style="font-style: italic;"&gt;n^k&lt;/span&gt; seconds.")  If a problem is of type NP, then we can find the right answer using a "brute force," guess-and-check strategy.  Unfortunately, as the number of moving parts increases, the number of possible answers explodes, and the the time required to solve the problem can get &lt;span style="font-style: italic;"&gt;a lot&lt;/span&gt; longer.&lt;br /&gt;&lt;br /&gt;Finding the solution to NP problems when &lt;span style="font-style: italic;"&gt;n&lt;/span&gt; is large can take a ridiculously long time.  Some NP problems are so computationally intensive that you could take all the computers in the world and multiply their processing power by a billion, run them in parallel for the lifespan of the universe, and still not find the solution.  Problems or this type are perfectly solvable, but too complicated to be solved perfectly.&lt;br /&gt;&lt;br /&gt;There's a third class of problems, called "NP-hard," that are even worse.  Things are bad enough already that I'm not going to get into them here.   You may also hear about "NP-complete" problems.  These are problems that are both NP and NP-hard.   (See this &lt;a href="http://en.wikipedia.org/wiki/P_versus_NP_problem#NP-complete"&gt;very helpful Venn diagram&lt;/a&gt; from wikipedia.)&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_zUbnTFdqWyg/TGAeo1OxMmI/AAAAAAAAAFA/4MlUdwPvuAw/s1600/800px-P_np_np-complete_np-hard.svg.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 339px; height: 241px;" src="http://3.bp.blogspot.com/_zUbnTFdqWyg/TGAeo1OxMmI/AAAAAAAAAFA/4MlUdwPvuAw/s200/800px-P_np_np-complete_np-hard.svg.png" alt="" id="BLOGGER_PHOTO_ID_5503432431479370338" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;The P !=NP Conjecture&lt;/span&gt;&lt;br /&gt;But there's an intriguing catch.  Until last week, nobody was sure whether any truly NP problems existed.**  Maybe, just maybe, some mathematical or computational trick would allow us to convert tricky NP problems into much easier P problems.&lt;br /&gt;&lt;br /&gt;Most computer scientists didn't believe that this was possible -- but they weren't sure.  Thus, P!=NP was a longstanding conjecture in the field.  (By the way, "!=" means "does not equal" in many programming languages).  Many proofs had been attempted to show that P=NP, or P!=NP, but all of them had failed.  In 2000, the Clay Mathematics Institute posed P!=NP as one of seven of the most important unsolved problems in mathematics.  They offered &lt;a href="http://en.wikipedia.org/wiki/Millennium_Prize_Problems"&gt;a million dollar prize&lt;/a&gt; for a proof or refutation of the conjecture.  The prize has gone unclaimed for a decade.&lt;br /&gt;&lt;br /&gt;Last week, Vinay Deolalikar (cautiously) began circulating a 100-page paper claiming to prove that P!=NP.  His proof hasn't been published or confirmed yet, but if he's right, he's solved one of the great open problems in mathematics.&lt;br /&gt;&lt;br /&gt;&lt;span style="font-weight: bold;"&gt;So what?&lt;/span&gt;&lt;br /&gt;Let's assume for the moment that Deolalikar has dotted his i's and crossed his t's.  What would a P!=NP proof mean?&lt;br /&gt;&lt;br /&gt;One immediate implication is in code-breaking.  &lt;a href="http://en.wikipedia.org/wiki/Public-key_cryptography"&gt;Public key encryption&lt;/a&gt; -- the basis for most online banking, e-commerce, and secure file transfers -- relies on the assumption that certain mathematical problems are more than P-difficult.  Until last week, that was just an assumption.  There was always the possibility that someone would crack the P-NP problem and be able to break these codes easily.  A P!=NP proof would eliminate that threat.  (Other threats to security could still come from number theory or quantum computing.)  For wikileaks, credit card companies, and video pirates, this is reason to celebrate.&lt;br /&gt;&lt;br /&gt;On the downside, a proof would put a computationally cheap solution to the traveling salesman problem (and every other NP-but-not-P problem) permanently out of reach.  This includes a host of practical problems in logistics, engineering, supply chain management, communications, and network theory.  Would-be optimizers in those areas will sigh, accept some slack in their systems, and continue looking for "good enough" solutions and heuristics.  Expect a little more attention to clever approximations, parallel processing, and statistical approaches to problem solving.&lt;br /&gt;&lt;br /&gt;A P!=NP proof can also help explain the messiness and confusion of politics and history.  &lt;a href="http://www.eecs.berkeley.edu/%7Echristos/papers/cacmDGP-2.pdf"&gt;Recent work in game theory&lt;/a&gt; demonstrates that finding stable solutions in strategic interactions with multiple players (e.g. solving a game) is a computationally difficult problem.  If it turns out that reaching equilibrium is NP, that would put bounds on the quality of likely policy solutions.  For instance, we can pose the problem of writing legislation and building a coalition to ratify it as an optimization problem over policy preferences, constrained by preferences.  If finding an optimal solution takes more than polynomial time, all the polling, deliberation, and goodwill in the world might still not be enough to arrive at a perfect (i.e. Nash, and/or Pareto-optimal) solution.  In the meantime, politicians will make mistakes, politics will stay messy, and history will stay unpredictable.&lt;br /&gt;&lt;br /&gt;Finally, a P!=NP proof puts us in an interesting place philosophically, especially when it comes to understanding intelligence.  As with engineering, optimal solutions to many problems will always out of reach for artificial intelligence.  Unless computation becomes much cheaper, a computer will never play the perfect game of chess, let alone the perfect game of &lt;a href="http://en.wikipedia.org/wiki/Go_%28game%29#Computers_and_Go"&gt;go&lt;/a&gt;.  Other difficult (and perhaps more useful) problems will also stay out of reach.&lt;br /&gt;&lt;br /&gt;For those who take a &lt;a href="http://en.wikipedia.org/wiki/Materialism"&gt;materialistic&lt;/a&gt; view of human intelligence -- that is, your mind/consciousness operates within a brain and body composed of matter -- a P!=NP proof puts the same constraint on &lt;span style="font-style: italic;"&gt;our&lt;/span&gt; intelligence.  As fancy and impressive as our neural hardware is, it's bound by the laws of physics and therefore the laws of algorithmic computation.  If that's true, then there are difficult problems that no amount of information, expertise, or intuition will ever solve.&lt;br /&gt;&lt;br /&gt;The jury is still out on Deolalikar's paper.  But if he's right, this isn't guesswork or extrapolation -- it's a matter of proof.&lt;br /&gt;&lt;br /&gt;* Your friend solves logic puzzles; she's not an oracle.  To give you  the right answer, she needs to have all the relevant information when  she begins, so it's no good asking her to pick winning horses in the  Kentucky Derby, or guess how many fingers you're holding up behind your  back.&lt;br /&gt;&lt;br /&gt;** To be precise here, all P problems are also NP.  If I can give you the solution in polynomial time, I can also verify the solution in polynomial time.  When I say "truly NP," I mean problems that are NP but not P, such as NP-complete problems.  This is the class of problems whose existence has been unclear.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7763197040331655895?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7763197040331655895/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7763197040331655895' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7763197040331655895'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7763197040331655895'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/p-np-biggest-mathematical-problem-in.html' title='P != NP?  The biggest mathematical problem in computer science may be solved'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zUbnTFdqWyg/TGAeo1OxMmI/AAAAAAAAAFA/4MlUdwPvuAw/s72-c/800px-P_np_np-complete_np-hard.svg.png' height='72' width='72'/><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-5066255395989248299</id><published>2010-08-08T16:36:00.000-07:00</published><updated>2010-08-08T16:59:44.476-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='software'/><title type='text'>Inkscape</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://inkscapetips.files.wordpress.com/2007/09/inkscape_logo_2.png?w=468"&gt;&lt;img style="float: right; display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 200px; height: 200px;" src="http://inkscapetips.files.wordpress.com/2007/09/inkscape_logo_2.png?w=468" alt="" border="0" /&gt;&lt;/a&gt;If you haven't tried &lt;a href="http://www.inkscape.org/"&gt;Inkscape &lt;/a&gt;yet, you should.  It's an open-source graphics program, like Illustrator, but cheap.  Very functional, totally free, the built-in documentation is pretty solid.  Since it's open source with a good user base, I expect development to come along rapidly.  Worth a look.&lt;br /&gt;Here's a link to the Inkscape &lt;a href="http://www.inkscape.org/showcase/"&gt;showcase&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Here's a link to an &lt;a href="http://www.inkscape.org/doc/tracing/tutorial-tracing.html"&gt;essential tool&lt;/a&gt; in Inkscape: converting bitmaps to vector graphics.&lt;br /&gt;&lt;br /&gt;Here's me showing off the result of an hour playing around with bitmaps, fonts, and vector paths: a spoof on the &lt;a href="http://www.seeklogo.com/images/B/Baby_Einstein-logo-CB16219F0C-seeklogo.com.gif"&gt;Baby Einstein logo&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_zUbnTFdqWyg/TF9EqBPNHsI/AAAAAAAAAE4/R3tmiS3hY_0/s1600/baby_genghis2.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 348px; height: 234px;" src="http://3.bp.blogspot.com/_zUbnTFdqWyg/TF9EqBPNHsI/AAAAAAAAAE4/R3tmiS3hY_0/s200/baby_genghis2.png" alt="" id="BLOGGER_PHOTO_ID_5503192758347439810" border="0" /&gt;&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-5066255395989248299?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/5066255395989248299/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=5066255395989248299' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5066255395989248299'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5066255395989248299'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/inkscape.html' title='Inkscape'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zUbnTFdqWyg/TF9EqBPNHsI/AAAAAAAAAE4/R3tmiS3hY_0/s72-c/baby_genghis2.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2472637058175581341</id><published>2010-08-07T11:33:00.000-07:00</published><updated>2010-08-07T12:01:24.146-07:00</updated><title type='text'>"Fail small and fast"</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://img.jesper.nu/view/900/fail28.jpg"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 450px; height: 330px;" src="http://img.jesper.nu/view/900/fail28.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Here's &lt;a href="http://www.slate.com/blogs/blogs/thewrongstuff/archive/2010/08/03/error-message-google-research-director-peter-norvig-on-being-wrong.aspx"&gt;an interesting interview with Google's Peter Norvig&lt;/a&gt; on Slate's &lt;a href="http://www.slate.com/blogs/blogs/thewrongstuff/"&gt;The Wrong Stuff&lt;/a&gt; (found via &lt;a href="http://www.marginalrevolution.com/"&gt;Marginal Revolution&lt;/a&gt;).  The topic is Google's approach to innovation, error, and failure.  I really like the admonition to "fail small and fast."&lt;br /&gt;&lt;br /&gt;Just to chip in my own two cents -- I've discovered that academic culture* is intolerant towards failure, errors, and mistakes in general.  (You learn this fast when you have a lot of bad ideas.)  We treat research like building a space shuttle, not looking for a decent restaurant.  We do a great job critiquing ideas and tearing apart arguments, but we're often not very good about brainstorming or being willing to fail publicly.  We're very reluctant to accept the null hypothesis.&lt;br /&gt;&lt;br /&gt;Why?  It's always struck me as really odd that an occupation that is ostensibly about new ideas, innovation, and pushing the status quo ends taking such a slow-moving, conservative approach.  I'm still not sure why.  Some working hypotheses:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Like politicians, &lt;span style="font-weight: bold;"&gt;professors' careers live and die by reputation&lt;/span&gt;.  We can't afford to fail publicly.  Those who do lose credibility and are forgotten.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Grad school selects for the risk-averse&lt;/span&gt;.  Bigger bets and more daring would benefit the field, but those sorts of people channel themselves to jobs that pay more and don't take 5 extra years of schooling to&lt;br /&gt;&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;Committee decisions favor the cautious&lt;/span&gt;.  Decisions about funding, tenure, and publication are all made by committee, usually anonymously.&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;The old guard kills innovation&lt;/span&gt;.  If you made your name building up theory X, and some upstart comes along arguing that theory Y is better, it's only natural to resist.  I don't believe the strong, cynical version of this hypothesis, but I've seen a number of examples of generational inertia.&lt;/li&gt;&lt;li&gt;&lt;span style="font-weight: bold;"&gt;The publication scale makes big bets risky&lt;/span&gt;.  It takes several years to publish a book.  If you have to write one book to graduate and another to get tenure, you don't have a lot of time to write failed books.  Smaller increments might allow for more failure and therefore more innovation.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;*This may be limited to political science; I'm not as immersed in the norms of other disciplines.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2472637058175581341?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2472637058175581341/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2472637058175581341' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2472637058175581341'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2472637058175581341'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/fail-small-and-fast.html' title='&quot;Fail small and fast&quot;'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-2411658854933180221</id><published>2010-08-05T08:45:00.001-07:00</published><updated>2010-08-15T08:05:21.221-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='text mining'/><category scheme='http://www.blogger.com/atom/ns#' term='miscellaneous'/><category scheme='http://www.blogger.com/atom/ns#' term='data visualization'/><title type='text'>Link mishmash</title><content type='html'>Four links worth a quick visit.&lt;br /&gt;&lt;br /&gt;1. "Minimum parking requirements act like a fertility drug for cars."  Interesting &lt;a href="http://www.nytimes.com/2010/08/15/business/economy/15view.html?_r=2&amp;amp;src=busln"&gt;NYTimes editorial&lt;/a&gt;  on the consequences of government-mandated parking.  I'm interesting in  getting the libertarian take on this one.  Do minimum parking  requirements distort the market for land use by overproducing parking  lots and roads, or do they facilitate commerce by lower transaction  costs?  Hat tip: David Smith.&lt;br /&gt;&lt;br /&gt;2. An &lt;a href="http://images.businessweek.com/mz/10/34/1034endorsments22.pdf?chan=magazine+channel_news+-+companies+%2B+industries"&gt;infographic from Bloomberg Businessweek&lt;/a&gt;  on negative buzz, doping, and Lance Armstrong's reputation.  The  interesting thing here is the source:  automated buzz tracking services  applying sentiment analysis to the blogosphere.  This is a technology I  plan to use in my dissertation.&lt;br /&gt;&lt;br /&gt;3. &lt;a href="http://ideologicalcartography.com/2010/07/12/citizens-united-and-the-myth-of-a-conservative-corporate-america/"&gt;The myth of a conservative corporate America&lt;/a&gt; - A nice infographic on corporate donations to politics.  Good use of FEC data.&lt;br /&gt;&lt;br /&gt;4. &lt;a href="http://www.stat.columbia.edu/%7Ecook/movabletype/archives/2010/08/turning_pages_i.html?utm_source=feedburner&amp;amp;utm_medium=feed&amp;amp;utm_campaign=Feed%3A+StatisticalModelingCausalInferenceAndSocialScience+%28Statistical+Modeling%2C+Causal+Inference%2C+and+Social+Science%29"&gt;Some good resources for screen scraping&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-2411658854933180221?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/2411658854933180221/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=2411658854933180221' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2411658854933180221'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/2411658854933180221'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/link-mishmash.html' title='Link mishmash'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-4983326774045923326</id><published>2010-08-05T08:43:00.000-07:00</published><updated>2010-08-05T09:27:22.022-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='information technology'/><category scheme='http://www.blogger.com/atom/ns#' term='media'/><title type='text'>"Information is good"</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://1.bp.blogspot.com/_zUbnTFdqWyg/TFrmWDwIe2I/AAAAAAAAAEw/IZEYSNLGUho/s1600/information.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 301px; height: 132px;" src="http://1.bp.blogspot.com/_zUbnTFdqWyg/TFrmWDwIe2I/AAAAAAAAAEw/IZEYSNLGUho/s200/information.png" alt="" id="BLOGGER_PHOTO_ID_5501963161425181538" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;&lt;br /&gt;Picking up a thread from &lt;a href="http://www.themonkeycage.org/2010/07/information_is_good.html"&gt;the monkey cage&lt;/a&gt; (great blog--worth a look), Slate has &lt;a href="http://www.slate.com/id/2262108/"&gt;an article&lt;/a&gt; by Washington Post reporter Anne Applebaum arguing that "WikiLeaks' data-dump reporting simply makes a case for the existence of the mainstream media."&lt;br /&gt;&lt;br /&gt;The article is a quick read.  I want to pick it apart, and see where it takes us.&lt;br /&gt;&lt;br /&gt;Things Applebaum gets:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Facts need interpretation to be useful.&lt;/li&gt;&lt;li&gt;In this instance WikiLeaks provided lots of facts and very little interpretation.&lt;/li&gt;&lt;/ul&gt;Things Applebaum misses:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;"Mainstream media" is not the only institution that can provide useful interpretation, context, expertise, etc.&lt;/li&gt;&lt;li&gt;Transparency (making data public) breaks up mainstream media's monopoly on interpretation by allowing others (e.g. bloggers) to tell different stories.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Transparency allows savvy readers to come to their own conclusions, instead of relying on reporters to decide which facts are important.  Most people won't bother to do this.  But it's useful to have multiple perspectives.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;My takeaways:&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Data dumps don't replace mainstream media -- but they do threaten its monopoly on interpretation.  The slightly defensive tone of the article suggests that Applebaum agrees, whether or not she wants to admit it.  Don't get me wrong.  I'm not a Big Media hater, but I am a big fan of transparent reporting.&lt;/li&gt;&lt;li&gt;On a more philosophical level, the difference between raw data and interpretation leads to some interesting ideas about "information."  Raw data is not actionable.  Is it still information?  For most people, the dense, jargon-laden text of military reports don't mean anything.  Reading them doesn't give you any traction to change your opinion or do anything differently -- they are "uninformative."  A few experts (e.g. troops in Afghanistan, military experts) know how to unpack and sift through these reports.  Apparently, what constitutes "information" is in the eye of the beholder.&lt;br /&gt;&lt;/li&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-4983326774045923326?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/4983326774045923326/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=4983326774045923326' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4983326774045923326'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/4983326774045923326'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/08/information-is-good.html' title='&quot;Information is good&quot;'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://1.bp.blogspot.com/_zUbnTFdqWyg/TFrmWDwIe2I/AAAAAAAAAEw/IZEYSNLGUho/s72-c/information.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7290358143046432985</id><published>2010-07-23T06:07:00.000-07:00</published><updated>2010-07-23T06:10:42.135-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='institutions'/><category scheme='http://www.blogger.com/atom/ns#' term='information technology'/><title type='text'>Mutiny, information technology, and technocracy</title><content type='html'>&lt;ul style="padding: 0px; margin: 0px;"&gt;&lt;p&gt;Responding to David Brooks, &lt;a href="http://www.nytimes.com/2010/07/20/opinion/20brooks.html?_r=2&amp;amp;ref=davidbrooks"&gt;on the recent upsurge in technocracy and its risks&lt;/a&gt;.  (This post was originally part of a discussion thread with friends, but I got into it enough to decide to put it up here.)&lt;br /&gt;&lt;/p&gt;&lt;p&gt;I'm convinced technology is increasing the marginal  truthfulness of many progressive claims.  I don't believe it  fundamentally changes the relationship among individuals, economies,  government, and other social institutions.&lt;/p&gt;&lt;p&gt;The example that  comes to mind is mutiny.  Mutiny was a huge worry for captains and  navies for hundreds of years, right up until the invention of the radio.   See &lt;a target="_blank" href="http://www.google.com/url?sa=D&amp;amp;q=http%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMutiny"&gt;mutiny&lt;/a&gt; for interesting reading, and watch &lt;a target="_blank" href="http://www.google.com/url?sa=D&amp;amp;q=http%3A%2F%2Fwww.imdb.com%2Ftitle%2Ftt0056264%2F"&gt;Mutiny on the Bounty&lt;/a&gt; and &lt;a target="_blank" href="http://www.google.com/url?sa=D&amp;amp;q=http%3A%2F%2Fwww.imdb.com%2Ftitle%2Ftt0046816%2F"&gt;The Caine Mutiny&lt;/a&gt; (starring &lt;span style="border-bottom: 1px solid rgb(204, 0, 0);"&gt;Humprey&lt;/span&gt;&lt;span contenteditable="false"&gt;&lt;span&gt;&lt;/span&gt;&lt;/span&gt; Bogart!) for good fictional accounts of the psychology and institutions that shape mutiny.&lt;/p&gt;&lt;p&gt;Radio was a game changer.  Since the invention and adoption of radio, mutiny has been almost unheard of, especially among large naval powers.  (The Vietnam-era &lt;a target="_blank" href="http://www.google.com/url?sa=D&amp;amp;q=http%3A%2F%2Fen.wikipedia.org%2Fwiki%2FSS_Columbia_Eagle_incident"&gt;SS_Columbia_Eagle_incident&lt;/a&gt; is  the exception that proves the rule.)  Shipboard radios tighten the link  between the captain's authority and the worldwide chain of command.  It  makes escape extremely unlikely for mutineers.  However, desertion, disobedience, cowardice, incompetence, corruption, theft, etc. are still problems for ships and navies.&lt;/p&gt;&lt;p&gt;As I see it, mutiny was already a marginal activity -- very risky  for the mutineers, with a low probability of success -- and radio  pushed the marginal success rate much lower.  But mutiny is just one  act.  From the perspective of naval efficiency, radio changed the  balance of power, but didn't fix the underlying social problems of  enforcing discipline and coordinating action.  Radio caused changes in the social structure of ships, but they didn't fundamentally alter the problems that navies face.&lt;/p&gt;&lt;p&gt;Information technology is doing a similar thing today.  It lowers the cost of storing, transmitting, aggregating, and manipulating data.  Where lower transaction costs can solve social problems, the progressives (and I'm one of them, cautiously) are right to be optimistic.&lt;/p&gt;&lt;p&gt;But  many kinds of information have been cheap for a long time.  Socially,  we haven't solved the problems of greed, lying, bureaucratic turf wars,  bullying, corruption, graft, incompetence...  When *those* are the real  causes, changes in information technology can't be expected to help  nearly as much.  We need to invent better institutions first.&lt;span class="__wave_paste" annotations="513,519,link%2Fmanual,http%3A%2F%2Fen.wikipedia.org%2Fwiki%2FMutiny:555,575,link%2Fmanual,http%3A%2F%2Fwww.imdb.com%2Ftitle%2Ftt0056264%2F:580,596,link%2Fmanual,http%3A%2F%2Fwww.imdb.com%2Ftitle%2Ftt0046816%2F:869,895,link%2Fmanual,http%3A%2F%2Fen.wikipedia.org%2Fwiki%2FSS_Columbia_Eagle_incident:" xml=""&gt;&lt;/line&gt;&lt;line&gt;&lt;/line&gt;The example that comes to mind is mutiny.  Mutiny was a huge worry for captains and navies for hundreds of years, right up until the invention of the radio.  See mutiny for interesting reading, and watch Mutiny on the Bounty and The Caine Mutiny (starring Humprey Bogart!) for good fictional accounts of the psychology and institutions that shape mutiny.&lt;line&gt;&lt;/line&gt;&lt;line&gt;&lt;/line&gt;Radio was a game changer.  Since the invention and adoption of radio, mutiny has been almost unheard of, especially among large naval powers.  (The Vietnam-era SS_Columbia_Eagle_incident is the exception that proves the rule.)  Shipboard radios tighten the link between the captain's authority and the worldwide chain of command.  It makes escape extremely unlikely for mutineers.  However, desertion, disobedience, cowardice, incompetence, corruption, theft, etc. are still problems for ships and navies.&lt;line&gt;&lt;/line&gt;&lt;line&gt;&lt;/line&gt;As I see it, mutiny was already a marginal activity -- very risky for the mutineers, with a low probability of success -- and radio pushed the marginal success rate much lower.  But mutiny is just one act.  From the perspective of naval efficiency, radio changed the balance of power, but didn't fix the underlying social problems of enforcing discipline and coordinating action.  Radio caused changes in the social structure of ships, but they didn't fundamentally alter the problems that navies face.&lt;line&gt;&lt;/line&gt;&lt;line&gt;&lt;/line&gt;Information technology is doing a similar thing today.  It lowers the cost of storing, transmitting, aggregating, and manipulating data.  Where lower transaction costs can solve social problems, the progressives (and I'm one of them, cautiously) are right to be optimistic.&lt;line&gt;&lt;/line&gt;&lt;line&gt;&lt;/line&gt;But many kinds of information have been cheap for a long time.  Socially, we haven't solved the problems of greed, lying, bureaucratic turf wars, bullying, corruption, graft, incompetence...  When *those* are the real causes, changes in information technology can't be expected to help nearly as much.  We need to invent better institutions first."&gt;&lt;/span&gt;&lt;span&gt;&lt;br /&gt;&lt;/span&gt;&lt;/p&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7290358143046432985?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7290358143046432985/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7290358143046432985' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7290358143046432985'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7290358143046432985'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/07/mutiny-information-technology-and.html' title='Mutiny, information technology, and technocracy'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7142754130892830337</id><published>2010-07-19T06:16:00.000-07:00</published><updated>2010-07-19T06:47:46.031-07:00</updated><title type='text'>Top secret America: Simultaneously sinister and incompetent</title><content type='html'>The Washington Post just put out an &lt;a href="http://projects.washingtonpost.com/top-secret-america/articles/"&gt;investigative report on "Top Secret America"&lt;/a&gt; -- federal departments and agencies with top secret security clearance.&lt;br /&gt;&lt;br /&gt;Here's &lt;a href="http://projects.washingtonpost.com/top-secret-america/"&gt;the trailer&lt;/a&gt;.  Yes, a 1:47 Flash video trailer for a report put out by a newspaper.  It has a &lt;a href="http://www.imdb.com/title/tt0440963/"&gt;Bourne Ultimatum&lt;/a&gt; feel to it, right down to the percussion-heavy, digital soundtrack and gritty urban imagery.  Anybody want to talk about the blurring lines between news and entertainment?&lt;br /&gt;&lt;br /&gt;WPost claims that a fourth, secret branch of government has opened up since 9-11.  They make this new branch out to be simultaneously sinister and incompetent (like &lt;a href="http://en.wikipedia.org/wiki/Vogon"&gt;Vogons&lt;/a&gt;).  Watch the trailer and see what they're hinting at.&lt;br /&gt;&lt;br /&gt;Some counterclaims: government is still learning how to make use of recent (i.e. within the last 30 years) advances in information technology.  (Take &lt;a href="http://www.economist.com/node/16590962?story_id=16590962&amp;amp;fsrc=scn/tw/te/rss/pe"&gt;this article in the Economist on the alleged obsolescence of the census&lt;/a&gt;, and &lt;a href="data.gov"&gt;data.gov&lt;/a&gt; as examples). &lt;br /&gt;&lt;ul&gt;&lt;li&gt;Since the technology is new, it makes sense that new agencies would develop to handle the load.&lt;/li&gt;&lt;li&gt;Information technology is useful, and the government has a lot of data  to process, so it makes sense that there would be a lot of people  involved.&lt;/li&gt;&lt;li&gt;Since business tends to be more nimble about technology adoption, it makes sense that a lot of the work would be outsourced to private firms, at least initially.&lt;/li&gt;&lt;li&gt;Since the process is unfamiliar, it makes sense that there would be some inefficiency.&lt;/li&gt;&lt;li&gt;Since we want the system to be robust to failure, it makes sense that there would be some redundancy.&lt;/li&gt;&lt;/ul&gt;I agree with the Post that "we want to get this right," but let's back off the hype and the alarmism a little bit.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7142754130892830337?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7142754130892830337/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7142754130892830337' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7142754130892830337'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7142754130892830337'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/07/top-secret-america-simultaneously.html' title='Top secret America: Simultaneously sinister and incompetent'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-7852034759250570486</id><published>2010-07-13T07:49:00.000-07:00</published><updated>2010-07-13T09:37:49.991-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='scientific method'/><category scheme='http://www.blogger.com/atom/ns#' term='education'/><category scheme='http://www.blogger.com/atom/ns#' term='policy'/><title type='text'>a jaundiced formula for spinning educational research into something that sounds interesting</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://3.bp.blogspot.com/_zUbnTFdqWyg/TDyULKk8QTI/AAAAAAAAAEg/vcZ3vyEkWBY/s1600/debate.png"&gt;&lt;img style="display: block; margin: 0px auto 10px; text-align: center; cursor: pointer; width: 200px; height: 86px;" src="http://3.bp.blogspot.com/_zUbnTFdqWyg/TDyULKk8QTI/AAAAAAAAAEg/vcZ3vyEkWBY/s200/debate.png" alt="" id="BLOGGER_PHOTO_ID_5493428565023932722" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Here's a jaundiced formula for spinning educational research into something that sounds interesting.  Most researchers and reporters seem to follow this formula pretty closely*.&lt;br /&gt;&lt;br /&gt;1. Sample a bunch of kids in category A, and a bunch of kids in category B.&lt;br /&gt;&lt;br /&gt;Ex: A kids have computers in the home; B kids don't&lt;br /&gt;Ex: A kids are white; B kids are nonwhite&lt;br /&gt;Ex: A kids go to charter schools; B kids don't&lt;br /&gt;&lt;br /&gt;2. For each group, measure some dependent variable, Y, that we care about.&lt;br /&gt;&lt;br /&gt;Ex: grades, SAT scores, dropout rates, college attendance, college completion, long term impacts on wages, quality of life, etc.&lt;br /&gt;&lt;br /&gt;3. Compare Y means for group A and group B.&lt;br /&gt;3a. If the means differ and the A versus B debate is contested, take a side with the group A.&lt;br /&gt;3b. If the means &lt;span style="font-weight: bold;"&gt;don't&lt;/span&gt; differ and many people support one option, take the opposite stance.  (Ex: "Charter schools don't outperform non-charter schools")&lt;br /&gt;3c.  If neither of those options works, continue on to step 4.&lt;br /&gt;&lt;br /&gt;4. Introduce a demographic variable X, (probably gender or SES) as a control or interaction term in your regression analysis.  It will probably be significant.  Claim that A or B is "widening the racial achievement gap," or "narrowing the gender gap," etc. as appropriate.&lt;br /&gt;&lt;br /&gt;Papers following this formula will frequently be publishable and newsworthy.  (You can verify this, case by case, with the studies cited in &lt;a href="http://nyti.ms/aOiiLl"&gt;that NYTimes article&lt;/a&gt;.)  They will rarely make a substantive contribution to the science and policy of education.  Awful.  Awful.  Awful.&lt;br /&gt;&lt;br /&gt;Why?  Because this approach is superficial.  The scientific method is supposed to help us understand root causes, with an eye to making people better off.  But that depends on starting with categorizations that are meaningfully tied to causal pathways.  The distinctions we make have to matter.&lt;br /&gt;&lt;br /&gt;In a great many educational studies,  the categories used to split kids are  cheap and easy to observe.  Therefore, they make for easy studies and quick  stereotypes.  They feed political conflict about how to divide pies.  But they don't matter in any deep, structural way.&lt;br /&gt;&lt;br /&gt;Example: Does having a computer in the house makes a kid smarter or dumber?  It depends on how the computer is used.  If the computer is in the attic, wrapped in plastic, the effect of computer ownership on grades, SAT scores, or whatever will be pretty close to zero.  If the computer is only used to play games, the effect probably won't be positive; and if games crowd out homework, the effect will be negative.  No real surprises there.  And that's about as far as these studies usually go.  "Computers not a magic bullet.  Next!"&lt;br /&gt;&lt;br /&gt;This is more or less the state of knowledge with respect to school funding, busing, charter schools, etc.  We know that one blunt policy intervention after another does not work miracles.  We haven't really gotten under the hood of what makes the complex social system of education work.  It's like coming up with a theory of how airplanes fly based on the colors they're painted.   ("White airplanes travel slower than airplanes painted camouflage colors, but tail markings have little effect on air speed.)  You may be able to explain more than nothing, but you certainly haven't grasped the forces that make the system work.&lt;br /&gt;&lt;br /&gt;To say the same thing in different words, scientists are supposed to ask "why?"  Studies that say "kids in group A are more Y than kids in group B" doesn't answer the why question.   They are descriptive, not causal.  Without a deeper causal understanding of why schools work or don't work, I don't think we're ever going to stop chasing fads and really make things better.&lt;br /&gt;&lt;br /&gt;&lt;br /&gt;*This is an epistemological critique of just about every quantitative  article on education.  In general, I'm supportive of the increasing  influence  of economic/econometric analysis in education policy, but  this is one  area where we quants may be making things worse, not  better.  Hat tip to Matt for sending &lt;a href="http://nyti.ms/aOiiLl"&gt;the  article&lt;/a&gt; that reminded me how much the failings of this literature  frustrate me.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-7852034759250570486?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/7852034759250570486/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=7852034759250570486' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7852034759250570486'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/7852034759250570486'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/07/jaundiced-formula-for-spinning.html' title='a jaundiced formula for spinning educational research into something that sounds interesting'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><media:thumbnail xmlns:media='http://search.yahoo.com/mrss/' url='http://3.bp.blogspot.com/_zUbnTFdqWyg/TDyULKk8QTI/AAAAAAAAAEg/vcZ3vyEkWBY/s72-c/debate.png' height='72' width='72'/><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-1017437579893019796</id><published>2010-07-02T05:56:00.000-07:00</published><updated>2010-07-02T06:00:12.483-07:00</updated><title type='text'>Google code -- pros and cons?</title><content type='html'>Anybody have thoughts/experience on the pros and cons of &lt;a href="http://code.google.com/p/support/wiki/GettingStarted"&gt;Google code&lt;/a&gt;?&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-1017437579893019796?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/1017437579893019796/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=1017437579893019796' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1017437579893019796'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/1017437579893019796'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/07/google-code-pros-and-cons.html' title='Google code -- pros and cons?'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-6926602341388155079</id><published>2010-07-01T08:04:00.000-07:00</published><updated>2010-07-02T05:48:07.090-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='software'/><category scheme='http://www.blogger.com/atom/ns#' term='content analysis'/><title type='text'>Reliability</title><content type='html'>I'm sold on the argument that &lt;a href="http://en.wikipedia.org/wiki/Krippendorff%27s_Alpha"&gt;Krippendorff's alpha&lt;/a&gt; is the way to go for most reliability calculation.  Unfortunately, it's been hard to find code to calculate it.  Here are some of the best links I've run across.&lt;br /&gt;&lt;br /&gt;The NLTK package for python has &lt;a href="http://nltk.googlecode.com/svn/trunk/doc/api/nltk.metrics.agreement-pysrc.html#AnnotationTask.alpha"&gt;code for computing alpha&lt;/a&gt;.  It looks like this does basic nominal calculation; I don't know if/how it copes with missing data.&lt;br /&gt;&lt;br /&gt;The &lt;a href="http://cran.r-project.org/web/packages/concord/concord.pdf"&gt;concord package in R&lt;/a&gt; does nominal, ordinal, interval, and ratio versions of alpha.  It looks like this might not be maintained anymore, but it works.&lt;br /&gt;&lt;br /&gt;Here's a nice &lt;a href="http://cswww.essex.ac.uk/Research/nle/arrau/alpha.html"&gt;page of resources&lt;/a&gt; by computational linguists Artstein and Poesio.  Unfortunately, what they show is mainly that there aren't very good resources out there.  Their &lt;a href="http://portal.acm.org/citation.cfm?id=1479202.1479206"&gt;review article&lt;/a&gt; is very good -- the best treatment of reliability I've seen in the NLP community so far.&lt;br /&gt;&lt;br /&gt;Deen Freelon has some links to &lt;a href="http://dfreelon.org/external-resources/"&gt;reliability calculators and resources&lt;/a&gt;, including two nice online reliability calculators: &lt;a href="http://dfreelon.org/utils/recalfront/recal-oir/"&gt;Recal-OIR&lt;/a&gt;, and &lt;a href="http://dfreelon.org/utils/recalfront/recal3/"&gt;Recal-3&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;Krippendorff's oddly-formatted, information-sparse &lt;a href="http://www.asc.upenn.edu/usr/krippendorff/dogs.html"&gt;web page&lt;/a&gt;.  He invents the best measure for calculating reliability, then keeps a lid on it.  Less animated bowtie dogs, and more software, please!&lt;br /&gt;&lt;br /&gt;Matthew Lombard has a nice &lt;a href="http://astro.temple.edu/%7Elombard/reliability/"&gt;page&lt;/a&gt; on reliability statistics and the importance of reliability in content analysis in general.&lt;br /&gt;&lt;br /&gt;Beg:  Does anybody know how to compute K's alpha for a single coder?&lt;br /&gt;&lt;br /&gt;I have data coded by several coders and need to know who's doing a good job, reliability-wise, and who's not.  At a pinch, I'd be willing to use a different reliability statistic, or even an IRT model.  It just needs to be statistically defensible and reasonably easy to code.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-6926602341388155079?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/6926602341388155079/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=6926602341388155079' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6926602341388155079'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/6926602341388155079'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/07/reliability.html' title='Reliability'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-5396409699013588506</id><published>2010-06-29T13:24:00.000-07:00</published><updated>2010-06-29T13:55:48.178-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='statistics'/><category scheme='http://www.blogger.com/atom/ns#' term='surveys'/><category scheme='http://www.blogger.com/atom/ns#' term='polling'/><category scheme='http://www.blogger.com/atom/ns#' term='fraud'/><title type='text'>How to catch a dirty pollster</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://www.bitterwallet.com/wp-content/uploads/2009/04/burglar.jpg"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 174px; height: 210px;" src="http://www.bitterwallet.com/wp-content/uploads/2009/04/burglar.jpg" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;Just a couple hours ago, Markos Moulitsas (you know, the Daily Kos) &lt;a href="http://www.dailykos.com/storyonly/2010/6/29/880185/-More-on-Research-2000"&gt;announced&lt;/a&gt; that he's suing his former polling group, Research 2000.  Evidently, they faked a lot of their data in the last 19 months, and didn't even have the decency to fake it well.&lt;br /&gt;&lt;br /&gt;The statistical case against R2000 is made &lt;a href="http://www.dailykos.com/story/2010/6/29/880179/-Research-2000:-Problems-in-plain-sight"&gt;here&lt;/a&gt;.  Nate Silver chimes in &lt;a href="http://www.fivethirtyeight.com/2010/06/breaking-daily-kos-to-sue-research-2000.html"&gt;here&lt;/a&gt;.&lt;br /&gt;&lt;br /&gt;The punchline:&lt;br /&gt;&lt;blockquote&gt;"We do not know exactly how the weekly R2K results were created, but we are confident they could not accurately describe random polls."&lt;br /&gt;&lt;/blockquote&gt;What's interesting to me about the situation is how clear-cut the evidence is. Mark Grebner, Michael Weissman, and Jonathan Weissman (GWW), conduct three tests to show that the numbers are cooked.  Each statistical test gives results that are devastatingly improbable.&lt;br /&gt;&lt;br /&gt;&lt;ul&gt;&lt;li&gt;Probability of odd-even pairs between men and women: 776 heads on 778 tosses of a fair coin, or about 1 in 10&lt;sup&gt;231&lt;/sup&gt;.&lt;/li&gt;&lt;li&gt;Probability of consistency of results in small samples: ~1 in 10&lt;sup&gt;16&lt;/sup&gt;.&lt;/li&gt;&lt;li&gt;Probability of missing zeroes in a time trend: ~1 in 1016.&lt;/li&gt;&lt;/ul&gt;&lt;br /&gt;Statistically, there's almost no wiggle room here.  Until the R2000 folks come forward with some really compelling counterevidence, I'm convinced they're frauds.&lt;br /&gt;&lt;br /&gt;An interesting feature of the last two patterns is that they hint at overeagerness to give results.  Patterns in the data are *too* clear, and the detailed crosstabs supplied by R2000 made it easier to catch them.  If they had been willing to dump some random noise into the equations -- blurring the results -- it would have been much harder to build a conclusive case against them.&lt;br /&gt;&lt;br /&gt;This jives with my experience doing survey research for clients: nobody wants to pay for null results.  Even when your data collection is legitimate, there's always pressure to twist and squeeze the data to wring out "findings" that aren't really there.  Three cheers for transparency and the scientific method.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-5396409699013588506?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/5396409699013588506/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=5396409699013588506' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5396409699013588506'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/5396409699013588506'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/06/how-to-catch-dirty-pollster.html' title='How to catch a dirty pollster'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-7374448075509148245.post-3918248483802484719</id><published>2010-06-28T19:51:00.000-07:00</published><updated>2010-06-29T13:20:39.422-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dissertation'/><category scheme='http://www.blogger.com/atom/ns#' term='blogs'/><category scheme='http://www.blogger.com/atom/ns#' term='politics'/><title type='text'>Reasons to study political blogging</title><content type='html'>&lt;a onblur="try {parent.deselectBloggerImageGracefully();} catch(e) {}" href="http://wac.osu.edu/tutorials/images/calvin-writing.gif"&gt;&lt;img style="margin: 0pt 0pt 10px 10px; float: right; cursor: pointer; width: 300px; height: 375px;" src="http://wac.osu.edu/tutorials/images/calvin-writing.gif" alt="" border="0" /&gt;&lt;/a&gt;&lt;br /&gt;I'm working like crazy on my dissertation prospectus.  Data work, lit reviews, etc.  To escape from early research purgatory, I plan to blog parts of the prospectus as I write them.&lt;br /&gt;&lt;br /&gt;I'll kickoff today with introductory definitions and motivation. Feedback is much appreciated.  Beware of dry, academic writing!&lt;br /&gt;&lt;br /&gt;&lt;b&gt;What is a blog?&lt;/b&gt;&lt;br /&gt;Paraphrasing &lt;a href="http://en.wikipedia.org/wiki/Blog"&gt;wikipedia&lt;/a&gt;, a blog is a website containing regular entries ("posts") of commentary, links, or other material such as photos or video.  On most blogs, posts are displayed in reverse-chronological order -- the most recent post appears first.  Although most blogs are maintained by individuals, some are run by small groups, and blogs speaking on behalf of corporations, churches, newspapers, political campaigns, etc. are increasingly common. Many blogs focus on a specific topic, ranging from broad to narrow: entertainment, cooking, astronomy,the Detroit Tigers, to cold fusion.  For my dissertation, I plan to focus on political blogs.&lt;br /&gt;&lt;br /&gt;&lt;b&gt;Why study political blogs?&lt;/b&gt;&lt;br /&gt;Here are five reasons to study political blogs.&lt;br /&gt;&lt;ol&gt;&lt;li&gt;Blogs are &lt;span style="font-weight: bold;"&gt;public facing&lt;/span&gt;.  Lots of people read them, including politicians and journalists.  The extent to which blogs are replacing mainstream media is an open question, but it's certain that blogs have come to play an important role in public discourse, with real impact on politics.&lt;/li&gt;&lt;li&gt;Bloggers span a &lt;span style="font-weight: bold;"&gt;wide variety of  opinions&lt;/span&gt;.  The blogosphere embraces everyone from conservative wingnuts to liberal moonbats to political moderates.  Some political bloggers are politically omnivorous, writing about anything political.  Others focus on specific issues and topics: foreign policy, Congress,  feminism, etc.&lt;/li&gt;&lt;li&gt;Bloggers include both &lt;span style="font-weight: bold;"&gt;experts and amateurs&lt;/span&gt;.  Dividing the same pie in a different direction, many A-list bloggers (e.g. Andrew Sullivan, Ariana Huffington, Glenn Reynolds, Michelle Malkin) clearly qualify as political elites: they are experts, immersed in politics, well-informed and well-connected.  Other political bloggers are more obscure, casual -- closer to the average Joes who make up the "mass public."&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Blogs are &lt;span style="font-weight: bold;"&gt;updated frequently&lt;/span&gt;.  This has two nice consequences.  First, frequent posts allow us to replay bloggers' reactions to events as they unfold.  Second, frequent posts mean we have a lot of posts to work with.&lt;br /&gt;&lt;/li&gt;&lt;li&gt;Blogs are &lt;span style="font-weight: bold;"&gt;archived publicly&lt;/span&gt;. Unlike most forms of political speech and action, blogging leaves a permanent data trail.&lt;br /&gt;&lt;/li&gt;&lt;/ol&gt;The combination of these attributes creates a kind of perfect storm for social science.  Understanding the flow of opinions and ideas has always been difficult for social scientists, because most of our data have come from surveys.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/7374448075509148245-3918248483802484719?l=lowlywonk.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://lowlywonk.blogspot.com/feeds/3918248483802484719/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=7374448075509148245&amp;postID=3918248483802484719' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3918248483802484719'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/7374448075509148245/posts/default/3918248483802484719'/><link rel='alternate' type='text/html' href='http://lowlywonk.blogspot.com/2010/06/reasons-to-study-political-blogging.html' title='Reasons to study political blogging'/><author><name>Abe</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry></feed>
