How text analysis and natural language processing is being used in journalism, open government, and transparency generally. A survey of existing public projects, and the algorithms behind them. Then a demonstration of the Overview Project (overviewproject.org), a tool for automatically visualizing the topics in a large document set, designed for investigative journalists. Then, a discussion of where data-driven transparency is going now -- or, what should we work on next?
A talk given by Jonathan Stray at Sunlight Labs at the Sunlight Foundation, Washington DC.