Entering content frame

Background documentation Indexing and Optimization Locate the document in its SAP Library structure

The queue parameters in the table below define the following:

·        What quantity of data is indexed at one time

·        When the queue server triggers optimization

When changing the parameters, remember that if the queue has the status Indexing or Optimizing, the changes do not affect actions that are currently taking place. The changes only take effect when the queue server has completed the actions.

 

Example

Initial indexing of large database tables

You want to index a large database table with around 200 million data records. TREX treats each data set as a document that consists only of attributes.

The queue server and index server should process the documents as efficiently as possible. Both servers should have a reasonable load. To avoid idle time and reduce administration overheads, the document sets should not be too small. However, the document sets should also not be so large that they overload both servers.

The application sends the table content to TREX in packages that each contain 25,000 documents. The queue server should always collect 100,000 documents before triggering indexing. You should only index a maximum of 50,000 documents at a time. The index server should only carry out optimization once it has indexed 20 million documents. This means that there must be 400 indexing runs, for 50,000 documents each, before the queue server triggers optimization.

In this case, you set the queue parameters as follows:

Note

The setting Initial Indexing Mode = On ensures that optimization first takes place after the index server has indexed 20 million documents.

If you set the Initial Indexing Mode to Off and keep the remaining configuration, the system would trigger indexing as soon as it has collected 100,000 documents. When indexing is complete (that is, 2 * 50,000 documents have been transmitted) the queue server triggers optimization. This means that optimization takes place after 100,000 documents have been indexed rather than 20 million documents.

Initial indexing of large document collections

You want to index 200 million documents (Word files, PDF files, and so on). Indexing should be as efficient as possible.

Processing documents with text content takes much more effort than processing documents that consist only of attributes. Therefore, the queue server should always collect only 10,000 documents before triggering indexing. You should only index a maximum of 10,000 documents at a time, with a maximum of 100 MB. The index server should trigger optimization once 100,000 documents have been indexed.

In this case, you set the queue parameters as follows:

Daily update of index

You should update a large index every day. To avoid creating a heavy load during the day, you set the update for 2am.

In this case, you set the queue parameters as follows:

The queue server triggers indexing at 2am. If there are 10,000 documents or less, and the quantity of data does not exceed 100 MB, the documents are indexed in one run and then optimized.

If there are more than 10,000 documents, the queue server triggers indexing and optimizing for the first 10,000 documents. The queue server waits until the optimization of these documents is complete. It then processes the next lot of documents.

Hourly update of index

You want to index a smaller document set that changes continually. This index should be updated hourly.

Because the document set to be indexed initially is not large, set the queue parameters as follows straight away:

 

 

 

 

Leaving content frame