Show TOC Entering content frame

Function documentation Resource Filters Locate the document in its SAP Library structure

Use

You use resource filters to influence the scope or results of crawling processes.

 

Integration

You specify resource filters as result or scope filters in the configuration of crawler parameter sets (see Crawlers and Crawler Parameters).

 

Features

Resource filters that are used as scope filters can be defined using the following parameters.

Parameters of a scope filter

Parameter

Required

Description

Name

Yes

Name of the resource filter.

Case Sensitive

No

Specifies whether the system differentiates between uppercase and lowercase.

This parameter affects documents, folders, and HTML pages.

Access Path Mode

Yes

Specifies whether the specifications in the parameter Access Path Patterns are included in the results or filtered out.

exclude: The specified access path patterns are not included in the results.

include: Only the specified access path patterns are included in the results.

This parameter affects documents, folders, and HTML pages.

Access Path Patterns

No

Comma-separated list of access path patterns.

You can use placeholders for this specification (see Using Placeholders).

For example, /documents/**

URL (Content Link) Mode

Yes

Specifies whether the specifications in the parameter URL Regular Expression are included in the results or filtered out.

This parameter is valid for the URL stored in the property Content Link for a document.

The property Content Link describes the URL from which the crawler calls up the document.

exclude: The specified URL expressions are not included in the results.

include: Only the specified URL expressions are included in the results.

These specifications are applied to documents and HTML pages.

URL Regular Expression

No

Regular expression to be compared with the URL stored in the property Content Link for a document.

The regular expression must contain the complete URL.

 

Resource filters that are used as result filters can be defined using the following parameters:

Parameters of a result filter

Parameter

Required

Description

Name

Yes

Name of the resource filter.

Include Documents/Web-Pages

No

Specifies whether documents or HTML pages are crawled.

Include Folders

No

Specifies whether folders are crawled.

Include Links

No

Specifies whether links in a hierarchical structure are crawled.

Note that this parameter cannot be used for crawling Web repositories.

Case Sensitive

No

Specifies whether the system differentiates between uppercase and lowercase.

This parameter affects documents, folders, and HTML pages.

Item ID Mode

Yes

Specifies whether the specifications in the parameter Item ID Patterns are included in the results or filtered out.

exclude: The specified item ID patterns are not included in the results.

include: Only the specified item ID patterns are included in the results.

This parameter is only applied to documents and HTML pages.

Item ID Patterns

No

Comma-separated list of item ID patterns.

You can use placeholders for this specification (see Using Placeholders).

For example: *.zip, *.xml

Mime Type Mode

Yes

Specifies whether the specifications in the parameter Mime Type Patterns are included in the results or filtered out.

exclude: The specified MIME type patterns are not included in the results.

include: Only the specified MIME type patterns are included in the results.

This parameter is only applied to documents and HTML pages.

Mime Type Patterns

No

Comma-separated list of MIME type patterns.

You can use placeholders for this specification (see Using Placeholders).

For example, text/*

Minimum Content Size

No

Minimum size of documents to be crawled.

Note that it takes a certain amount of time to determine the file size of each document.

This parameter is only applied to documents and HTML pages.

Maximum Content Size

No

Maximum size of documents to be crawled.

Note that it takes a certain amount of time to determine the file size of each document.

Enter 0 for unlimited size.

This parameter is only applied to documents and HTML pages.

Maximum Age of Last Modification

No

Time in days within which the last change to a document must have taken place in order for that document to be crawled.

Enter 0 for unlimited size.

This parameter is only applied to documents and HTML pages.

 

Activities

To create a resource filter, choose Content Management ® Global Services ® Resource Filters. Then select the resource filter in a set of crawler parameters.

 

Leaving content frame