Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Hyperlink Selector

Embedded Object Grooper.Messaging

Defines a link to be followed on an HTML page when using HTTP Import.

Remarks

A HyperlinkSelector controls which hyperlinks are followed during an HTTP import crawl.

How it works

  • Selector: Specify a CSS selector to match the hyperlinks you want to follow on each page.
  • ExclusionSelector: Optionally exclude certain links, even if they match the main selector.
  • Recursive: Enable to follow links recursively, crawling through multiple levels of pages.
  • LinkSelectors: Add child selectors to define different crawling rules for different levels or sections.
  • IncludedUrlPattern / ExcludedUrlPattern: Use regular expressions to include or exclude URLs based on patterns.
  • LeafUrlPattern: Mark certain URLs as "leaves" to stop further crawling from those pages.

Example

To crawl all product pages but skip links in navigation bars:

  • Set Selector to a.product-link
  • Set ExclusionSelector to .navbar a
  • Enable Recursive to follow links on each product page

Notes

  • Combine multiple selectors and patterns for advanced crawling strategies.
  • Use child LinkSelectors for multi-level or section-specific crawling logic.

Properties

NameTypeDescription
General
URL Filtering

See Also

Used By

Notification