Answer by MrAnonymous for Aggregating from various sources
Duplication is a nasty issue. What I eventually ended up doing:1. Strip out all HTML tags except for links (Although I started using regex, I was burned. I eventually moved to custom parsing to remove...
View ArticleAnswer by Mauricio Scheffer for Aggregating from various sources
You might want to try to use the YQL module to scrape a webpage that doesn't provide RSS. Here's a sample of a YQL statement to scrape HTML.About duplicates, take a look at this pipe.Customized...
View ArticleAggregating from various sources
It could be a project well beyond my skills right now but I've got around one full month to spend on it so I think I can do it. What I want to build is this: Gather news about a specific subject from...
View Article
More Pages to Explore .....