|Previous | Next|
WireHose Developers Guide
Next you'll tell WireHose which attributes should be used to distinguish one item from another during importing. This is important since new items will typically be added to the top of the feed, pushing older items off the bottom. If Hello World's crawler didn't have a method for determining if it had already seen an item, the database would be cluttered with duplicates.
In this case, duplicate items have either the same name or link.
Note: The RSS 2.0 specification allows feed providers to provide an optional attribute for each item, called "guid". This globally unique identifier is specifically intended help aggregators determine if an item has been seen previously, but not all feeds use it. For simplicity, we are ignoring this attribute in this tutorial.
Copyright ©2000-2003 Gary Teter. All rights reserved. WireHose is a trademark of Gary Teter.