Start of Content Area

InfoCube documentation Standard Web Log File  Locate the document in its SAP Library structure

Technical name: 0WEB_C01

Use

The InfoCube 0WEB_C01 contains the data from Web log files, which are created by Web servers from the interactive actions of a user on a Website.

The following Web log file formats are made from the Web servers that are used most frequently:

  1. W3C Extended Log File Format
  2. The W3C extended format is an adaptable ASCII format with a range of different parameters. The parameters are separated by blank spaces, and the time is shown in UTC (Greenwich Mean Time).

    Example

    212.70.97.39 - - [27/Dec/1999:00:18:01 -0800] "GET /accessarchive/ HTTP/1.1" 200 1552

    212.70.97.39 - - [27/Dec/1999:00:18:02 -0800] "GET /accessarchive/ContentsAA.htm HTTP/1.1" 200 1281

    212.70.97.39 - - [27/Dec/1999:00:18:03 -0800] "GET /accessarchive/Welcome.htm HTTP/1.1" 200 3051

    212.70.97.39 - - [27/Dec/1999:00:18:03 -0800] "GET /accessarchive/_themes/expeditn/exptextb.jpg HTTP/1.1" 304 -

    212.70.97.39 - - [27/Dec/1999:00:18:04 -0800] "GET /accessarchive/images/e-mail.gif HTTP/1.1" 304 -

    212.70.97.39 - - [27/Dec/1999:00:18:04 -0800] "GET /accessarchive/_vti_bin/fpcount.exe/accessarchive/?Page=Welcome.htm|Image=2 HTTP/1.1" 200 1098

    212.70.97.39 - - [27/Dec/1999:00:18:04 -0800] "GET /accessarchive/images/Books/bk20.gif HTTP/1.1" 304 –

  3. Microsoft IIS Log Format
  4. The Microsoft IIS creates a non-adaptable ASCII format. The parameters are separated by blank spaces, and the local time (of the Web server) is shown.

    Example

    #Software: Microsoft Internet Information Server 4.0

    #Version: 1.0

    #Date: 2000-03-01 02:30:05

    #Fields: date time c-ip cs-username s-ip cs-method cs-uri-stem cs-uri-query sc-status sc-bytes cs-bytes time-taken cs-version cs(User-Agent) cs(Cookie) cs(Referer)

    2000-03-01 02:30:04 62.157.46.58 - 168.144.96.104 GET /index.htm - 200 1823 251 844 HTTP/1.1 Mozilla/4.0+(compatible;+MSIE+5.0;+Windows+NT;+DigExt) - http://microsys-kramer.de/Links.htm

    2000-03-01 02:30:04 62.157.46.58 - 168.144.96.104 GET /ContentsAA.htm - 200 1653 259 1406 HTTP/1.1 Mozilla/4.0+(compatible;+MSIE+5.0;+Windows+NT;+DigExt) - http://www.accessarchive.com/

    2000-03-01 02:30:07 62.157.46.58 - 168.144.96.104 GET /Welcome.htm - 200 6305 256 63 HTTP/1.1 Mozilla/4.0+(compatible;+MSIE+5.0;+Windows+NT;+DigExt) - http://www.accessarchive.com/

    2000-03-01 02:30:10 62.157.46.58 - 168.144.96.104 GET /index.htm - 200 1823 184 3937 HTTP/1.1 Mozilla/4.0+(compatible;+MSIE+5.0;+Windows+NT;+DigExt) - -

  5. NCSA Common Log File Format
  6. NCSA Common Format is a non-adaptable format, which is available for Web sites but not for FTP sites. The parameters are separated by blank spaces, and the local time (of the Web server) is shown.

  7. ODBC Logging

ODBC Logging Format is a non-adaptable format. It writes data to an ODBC-compatible database. The local time (of the Web server) is shown.

The following parameters are usually shown:

Prefix

Meaning

s-

Server actions

c-

Client actions

r-

Remote actions

cs-

Client-to-server actions

sc-

Client-to-server actions

rs-

Remote-to-server actions (this prefix is used by proxies)

sr-

Server-to-remote actions (this prefix is used by proxies)

x-

Application-specific identifier

 

 

Parameter name

Parameter

Description

Date

Date

The date when the activity took place has the field <date>.

Time

Time

The date when the activity took place has the field type <time>.

Client IP address

c-ip

The IP address of the client who accessed your server, has the field type <address>.

User name

c-username

The name of the authenticated user who accessed your server, has the field type <name>.

This includes non-anonymous users, identified by a hyphen.

Service name and instance number

s-site name

The internet service and the instance number that ran for the client.

Server name

s-computer name

The name of the server on which the log entry was generated.

Server IP

s-ip

The IP address of the server on which the log entry was generated.

Method

cs-method

The action that the client tried to carry out, for example, a GET method, has the field type <name>.

URI

uri

URI, field has type <uri>.

URI stem

cs-uri-stem

The source that was accessed, for example, default.htm, has the field type <uri>.

URI query

cs-uri-query

The client’s request has the field type <uri>.

HTTP status

sc-status

The status of the action, in HTTP name, has the field type <time>.

Win32 status

sc-win32-status

The status of the action, in the name used by Windows 2000.

Comment

comment

Comment that is given by the status code, has the field type <text>.

Bytes send

sc-bytes

The number of bytes that were sent by the server, has the field type <integer>.

Bytes received

cs-bytes

The number of bytes that were received by the server.

Server port

s-port

Die port number that the client is connected to.

Time taken

time taken

The length of time taken for the action, has the field type <time>.

Protocol version

cs-protocol

The protocol (HTTP, FTP) version that is used by the client. For HTTP, work with HTTP 1.0 or HTTP 1.1.

User agent

cs(User-Agent)

The browser used by the client.

Cookie

cs(Cookie)

The contents of the cookie that is sent or received, if available.

Referrer

cs(Referer)

The last page visited by the client. This page is indicated by a link on the present page.

Cached

cached

Shows whether the user access a cache, has field type <integer>.

0 indicates no access to a cache.

 

You can call up more information on the standards of Web log files on the homepage of the W3 consortium (http://www.w3c.org).

InfoSources

0WEB_SWLF_1: Web Log File - SAP ITS Log Format

0WEB_SWLF_2: Web Log File – W3C Extended Format

0WEB_SWLF_3: Web Log File - MS Internet Information Server Log Format

0WEB_SWLF_4: Web Log File – Tealeaf Technology

The first three InfoSources load the data from the Web log files created by the Web server. Since the formats of the Web log files are slightly different, you need three InfoSources. InfoSource 0WEB_SWLF_4 is filled with the BAPI interface from the database of the Tealeaf Technology company.

Characteristics

InfoObject

Description

0WEB_COOKIE

Cookie

0WEB_CS_MET

Method (client -> server)

0WEB_CS_PRO

Protocol (client -> server)

0WEB_C_IP

IP address (client)

0WEB_OPSYST

Operating system

0WEB_SERVER

Server

0WEB_SESSN

Session ID

0WEB_STATUS

Status

0WEB_S_IP

IP address (server)

0WEB_S_PORT

Server port

0WEB_USERAG

Browser

0TIME

Time (clock-time)

0WEB_FILE

File

0WEB_REF

Origin of the Hit

0WEB_GMT

Deviation from Greenwich Mean Time

Time Characteristics

InfoObject

Description

0CALDAY

Calendar day

Key Figures

InfoObject

Description

0WEB_CSBYTE

File volume (client -> server)

0WEB_DWELL

Dwell time

0WEB_HIT

Number of hits

0WEB_SCBYTE

File volume sent (server-> client)

0WEB_TIME

Time taken

Navigation Attributes

InfoObject

Description

0WEB_FTYPE

File Type

0WEB_FTIME

File with retention period calculation

0WEB_OPSYST

Operating system

0WEB_REFTYP

Referrer type

0WEBREFBAS

Referrer basis

0WEB_SITSEC

Web site Area

0WEB_STATYP

Status type

0WEB_U_TYPE

User agent type

0WEB_U_NAME

Browser name

 

 

End of Content Area