Could MarkLogic be used for blog search and aggregation?
Yes. Most blogs are published as XML feeds and MarkLogic is very good at XML content integration and XML element-level search. One simple granularity distinction you find with blogs is post vs. page. The page at my blog’s URL (marklogic.blogspot.com) in fact has many posts on it. Typically when you search blogs, you are searching for posts — not pages — and you want to sort by recency first and relevancy second. It’s a good, if simple, example of why just searching pages based on relevancy (with a three-week average crawl recycle) doesn’t meet all search application requirements. Hence the rise of blog-specific search engines like Technorati or Google Blog Search.