StreamTokenizer
The Java StreamTokenizer
class (java.io.StreamTokenizer
) can tokenize the characters read from a Reader
into tokens. For instance, in the string "Mary had a little lamb" each word is a separate token.
When you are parsing files or computer languages it is normal to break the input into tokens, before further processing them. This process is also called "lexing" or "tokenizing".
Using a Java StreamTokenizer
you can move through the tokens in the underlying Reader
. You do so by calling the nextToken()
method of the StreamTokenizer
inside a loop. After each call to nextToken()
the StreamTokenizer
has several fields you can read to see what kind of token was read, it's value etc. These fields are:
sval
The string value of the token, if the token was a string (word)
nval
The number value of the token, if the token was a number.
StreamTokenizer Example
Here is a simple Java StreamTokenizer
example:
The Java StreamTokenizer
is capable of recognizing identifiers, numbers, quoted strings, and various comment styles. You can also specify what characters are to be interpreted as white space, comment begin, end etc. All these things are configured on the StreamTokenizer
before you start parsing its contents. See the JavaDoc for more information about that.
Closing a StreamTokenizer
When you are finished reading tokens from the StreamTokenizer
you should remember to close it. Closing a StreamTokenizer
will also close the Reader
instance from which the StreamTokenizer
is reading.
Closing a StreamTokenizer
is done by calling its close()
method. Here is how closing a StreamTokenizer
looks:
Unfortunately the Java StreamTokenizer does not implement the AutoCloseable interface, so you cannot close it with a Java try-with-resources construct. Instead, look to the Java IO Exception Handling Tutorial for more information about how to handle exceptions and close a StreamTokenizer correctly using a try-finally block.
Last updated
Was this helpful?