The TREX preprocessor prepares documents for indexing by the TREX engines. The application using TREX (for example, Content Management in SAP Enterprise Portal) transmits the documents to be indexed to the preprocessor in the form of URIs that reference the storage location of the documents in question. The preprocessor resolves these URIs and collects the actual documents from a Web server using HTTP.
Access to Web pages can take place using a Proxy server regardless of whether the pages are in the Internet or in an Intranet. If you want to index documents that can only be accessed using a proxy server, you have to register the proxy server with the TREX preprocessor.
There might also be documents in your environment that can be accessed without a proxy server, for example, documents on local servers or your enterprise's external homepage. You can inform the preprocessor of the servers it can access without a proxy server. This speeds up the processing of documents on these servers.
You specified settings for the proxy server when you installed TREX. If you want to change this later on, modify the TREXPreprocessor.iniconfiguration file on the server on which the TREX preprocessor is running.
The graphic below shows a portal scenario. Some of the documents to be indexed are located on servers on the intranet, others on servers on the Internet. The documents on the Internet can only be reached using a proxy server. The proxy server is not needed for documents on the intranet.
Enter the proxy server into the section [httpclient] in the configuration file TREXPreprocessor.ini so that TREX can load external documents. Enter exclusion rules for internal documents into the section [proxyrules].
proxyhost=<name_of_proxy> (hostname and domain of the proxy server)
You only need to enhance the line proxyuser if a user ID is needed to access the proxy server.
You only need to enhance the line 'proxypassword=' if a password is also needed for the user ID.
You can specify the password for the proxy user during the installation of TREX. You can use a script to change this password later on or to define a password if you did not enter one when installing TREX. For more information, seeConfiguring TREX Security Settings →Specifying the Password for the Proxy Server.
The listing of the parameters cannot contain empty lines. Keep to the format outlined above. The system distinguishes between lowercase and uppercase.
Specify the addresses for which the proxy server is not to be used. You normally enter one or more character strings in which the addresses in your intranet end.
mycompany.com or mylocation.mycompany.com
Do not use the asterisk (*) as a placeholder. Lines that begin with # or ! are treated as comments and are therefore ignored. This is also true for IP addresses. To exclude the IP address space 10.10.0.0-10.10.255.255, add the line 10.10. [proxyrules] to the section. This ensures that no proxy is used for URLs that contain IP addresses in this space.