Due to the increasing importance of the web and social web, user generated content will become another important information source for journalists. Also events should be documented and their impact should be analyzed. It is important that user generated content stays accessible even if the original source disappears.
In addition journalists need support with verification of the information. Therefore context information such as the user, the event, related links or entities mentioned on the Web pages need to be preserved as well.
After thorough cleaning and enrichment of data the final Web archive will allow an effective use of user generated content for decades to come.