A B C D E F G H I J L M N O P Q R S T U V W X

I

ignoresPolicy() - Method in class xsmeral.semnet.crawler.HTMLCrawler
 
initialize() - Method in class xsmeral.semnet.sink.NativeStoreFactory
 
initialize() - Method in class xsmeral.semnet.sink.RdbmsStoreFactory
 
initialize(Properties) - Method in class xsmeral.semnet.sink.RepositoryFactory
Sets the properties and calls RepositoryFactory.initialize()
initialize() - Method in class xsmeral.semnet.sink.RepositoryFactory
Instantiates and initializes the Repository.
initPostContext() - Method in class xsmeral.semnet.crawler.HTMLCrawler
 
initPostContext() - Method in class xsmeral.semnet.mapper.StatementMapper
 
initPostContext() - Method in class xsmeral.semnet.scraper.AbstractScraper
Initializes the stats.
initPostContext() - Method in class xsmeral.semnet.scraper.ScraperWrapper
Instantiates scrapers.
initPostContext() - Method in class xsmeral.semnet.sink.SesameWriter
Reads the supplied configuration (Properties) file, sets working directory, initializes repository factory.
initPostContext() - Method in class xsmeral.semnet.util.StdErrWriter
 
initPostContext() - Method in class xsmeral.semnet.util.StdOutWriter
 
initWithContext() - Method in class xsmeral.semnet.crawler.HTMLCrawler
Deserializes crawler configuration from XML and initializes crawler state
isAutoCommit() - Method in class xsmeral.semnet.crawler.RDBLayer
 
isEntity(int, Pattern) - Method in class xsmeral.semnet.crawler.HostManager
Indicates, whether the specified pattern represents an entity in the given host.
isEntity() - Method in class xsmeral.semnet.crawler.model.URLEntry
Indicates whether this URL represents an entity or a source URL.
isFakeReferrer() - Method in class xsmeral.semnet.crawler.HTMLCrawler
 
isFakeReferrer() - Method in class xsmeral.semnet.crawler.model.CrawlerConfiguration
Indication, whether the HTTP Referer header should be set to the base URL of the host
isFollowRedirects() - Static method in class xsmeral.semnet.crawler.util.ConnectionManager
Corresponds to HttpURLConnection.getFollowRedirects()
isPolicyIgnored() - Method in class xsmeral.semnet.crawler.model.CrawlerConfiguration
Indication of adherence to the Robots Exclusion Protocol
isSource(int, Pattern) - Method in class xsmeral.semnet.crawler.HostManager
Indicates, whether the specified pattern represents a source URL in the given host.
isSourceFirst() - Method in class xsmeral.semnet.crawler.model.HostDescriptor
Indicates whether source URLs should be crawled first
isWorking() - Method in class xsmeral.semnet.crawler.model.URLEntry
Indicates whether this URL is working (whether there were any errors during last visit by crawler).

A B C D E F G H I J L M N O P Q R S T U V W X