!--a11y-->
Resource Filters 
You use resource filters to influence the scope or results of crawling processes.
You specify resource filters as result or scope filters in the configuration of crawler parameter sets (see Crawlers and Crawler Parameters).
Resource filters that are used as scope filters can be defined using the following parameters.
Parameters of a scope filter
Parameter |
Required |
Description |
Name |
Yes |
Name of the resource filter. |
Case Sensitive |
No |
Specifies whether the system differentiates between uppercase and lowercase. This parameter affects documents, folders, and HTML pages. |
Access Path Mode |
Yes |
Specifies whether the specifications in the parameter Access Path Patterns are included in the results or filtered out. exclude: The specified access path patterns are not included in the results. include: Only the specified access path patterns are included in the results. This parameter affects documents, folders, and HTML pages. |
Access Path Patterns |
No |
Comma-separated list of access path patterns. You can use placeholders for this specification (see Using Placeholders). For example, /documents/** |
URL (Content Link) Mode |
Yes |
Specifies whether the specifications in the parameter URL Regular Expression are included in the results or filtered out. This parameter is valid for the URL stored in the property Content Link for a document. The property Content Link describes the URL from which the crawler calls up the document. exclude: The specified URL expressions are not included in the results. include: Only the specified URL expressions are included in the results. These specifications are applied to documents and HTML pages. |
URL Regular Expression |
No |
Regular expression to be compared with the URL stored in the property Content Link for a document. The regular expression must contain the complete URL. |
Resource filters that are used as result filters can be defined using the following parameters:
Parameters of a result filter
Parameter |
Required |
Description |
Name |
Yes |
Name of the resource filter. |
Include Documents/Web-Pages |
No |
Specifies whether documents or HTML pages are crawled. |
Include Folders |
No |
Specifies whether folders are crawled. |
Include Links |
No |
Specifies whether links in a hierarchical structure are crawled. Note that this parameter cannot be used for crawling Web repositories. |
Case Sensitive |
No |
Specifies whether the system differentiates between uppercase and lowercase. This parameter affects documents, folders, and HTML pages. |
Item ID Mode |
Yes |
Specifies whether the specifications in the parameter Item ID Patterns are included in the results or filtered out. exclude: The specified item ID patterns are not included in the results. include: Only the specified item ID patterns are included in the results. This parameter is only applied to documents and HTML pages. |
Item ID Patterns |
No |
Comma-separated list of item ID patterns. You can use placeholders for this specification (see Using Placeholders). For example: *.zip, *.xml |
Mime Type Mode |
Yes |
Specifies whether the specifications in the parameter Mime Type Patterns are included in the results or filtered out. exclude: The specified MIME type patterns are not included in the results. include: Only the specified MIME type patterns are included in the results. This parameter is only applied to documents and HTML pages. |
Mime Type Patterns |
No |
Comma-separated list of MIME type patterns. You can use placeholders for this specification (see Using Placeholders). For example, text/* |
Minimum Content Size |
No |
Minimum size of documents to be crawled. Note that it takes a certain amount of time to determine the file size of each document. This parameter is only applied to documents and HTML pages. |
Maximum Content Size |
No |
Maximum size of documents to be crawled. Note that it takes a certain amount of time to determine the file size of each document. Enter 0 for unlimited size. This parameter is only applied to documents and HTML pages. |
Maximum Age of Last Modification |
No |
Time in days within which the last change to a document must have taken place in order for that document to be crawled. Enter 0 for unlimited size. This parameter is only applied to documents and HTML pages. |
To create a resource filter, choose Content Management ® Global Services ® Resource Filters. Then select the resource filter in a set of crawler parameters.