Here’s how I’m archiving Web pages in our DC-X Digital Asset Management software:
A tiny Firefox extension takes a screenshot of the currently displayed page and posts it, along with the HTML source code, to the DAM in a new browser tab.
The DC-X DAM asks me to log in (if necessary), creates an import job and waits for its completion. Then I’m redirected to the details page of the “archived Web page” document that was just created.
I can do a fulltext search on my Web page archive (of course)…