com.sapportals.wcm.util.html

Class HtmlTokenizer

java.lang.Object
  extended by com.sapportals.wcm.WcmObject
      extended by com.sapportals.wcm.util.html.HtmlTokenizer

public class HtmlTokenizer
extends WcmObject

HtmlTokenizer

Copyright (c) SAP AG 2001-2003


Field Summary
static int TOKEN_COMMENT
          comment token
static int TOKEN_EOF
          end of file
static int TOKEN_TAG
          tag token
static int TOKEN_TEXT
          text token
 
Fields inherited from class com.sapportals.wcm.WcmObject
ORDER_TYPE_MANUAL, ORDER_TYPE_NONE
 
Constructor Summary
HtmlTokenizer(InputStream in)
          Create a new HtmlTokenizer InputStream is = ; HtmlTokenizer htmlTokenizer = new HtmlTokenizer(is); while (htmlTokenizer.next() !
 
Method Summary
 String getEncoding()
           
 String getToken()
          this method returns the current token as a String.
 String getTokenContent()
          this method returns raw content of last token, e.g. a tag without "<" and ">".
 int getTokenType()
           
 int next()
          Parse the input stream and returns the type of the next token. type is one of the TOKEN_ defines:
 void writeToStream(Writer pw)
          This method writes the inputstream from the current position to the PrintWriter without any further parsing.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

TOKEN_EOF

public static final int TOKEN_EOF
end of file

See Also:
Constant Field Values

TOKEN_TEXT

public static final int TOKEN_TEXT
text token

See Also:
Constant Field Values

TOKEN_TAG

public static final int TOKEN_TAG
tag token

See Also:
Constant Field Values

TOKEN_COMMENT

public static final int TOKEN_COMMENT
comment token

See Also:
Constant Field Values
Constructor Detail

HtmlTokenizer

public HtmlTokenizer(InputStream in)
Create a new HtmlTokenizer
 InputStream is = ;
 HtmlTokenizer htmlTokenizer = new HtmlTokenizer(is);
 while (htmlTokenizer.next() != HtmlTokenizer.TOKEN_EOF) {
   int type = htmlTokenizer.getTokenType();
   if (type == HtmlTokenizer.TOKEN_TAG) {
     System.out.println("tag: " + htmlTokenizer.getToken());
   }
   else if (type == HtmlTokenizer.TOKEN_TEXT) {
     System.out.println("text: " + htmlTokenizer.getToken());
   }
   else if (type == HtmlTokenizer.TOKEN_COMMENT) {
     System.out.println("comment: " + htmlTokenizer.getToken());
   }
 

Parameters:
in - input stream
Method Detail

getEncoding

public String getEncoding()
Returns:
the encoding of the HTML page

getTokenType

public int getTokenType()
Returns:
the last token type. See TOKEN_ defines

getToken

public String getToken()
this method returns the current token as a String.

Returns:
token

getTokenContent

public String getTokenContent()
this method returns raw content of last token, e.g. a tag without "<" and ">".

Returns:
tokenContent

next

public int next()
         throws IOException
Parse the input stream and returns the type of the next token. type is one of the TOKEN_ defines:

Returns:
TBD: Description of the outgoing return value
Throws:
IOException - Exception raised in failure situation

writeToStream

public void writeToStream(Writer pw)
                   throws IOException
This method writes the inputstream from the current position to the PrintWriter without any further parsing.

Parameters:
pw - TBD: Description of the incoming method parameter
Throws:
IOException - Exception raised in failure situation
Access Rights

This class can be accessed from:


SC DC Public Part ACH
[sap.com] KMC-CM [sap.com] tc/km/frwk api EP-KM-CM
[sap.com] KMC-WPC [sap.com] tc/kmc/wpc/wpcfacade api EP-PIN-WPC-WCM


Copyright 2014 SAP AG Complete Copyright Notice