|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectnu.xom.NodeFactory
nu.xom.samples.StreamingXHTMLPurifier
public class StreamingXHTMLPurifier
Demonstrates a custom NodeFactory
that strips out
all non-XHTML elements. It���s easy enough to drop out any elements
that are not in the XHTML namespace. However, in the case of SVG,
MathML and most other applications you���ll want to remove the
content of these elements as well. I���ll assume that the namespace
for text is the same as the namespace of the parent element.
(This is not at all clear from the namespaces specification,
but it makes sense in many cases.) To track the nearest namespace
for non-elements, makeElement() will push the element���s namespace
onto a stack and endElement() will pop it off. Peeking at the top
of the stack tells you what namespace the nearest element uses.
This is modeled after Example 8-9 in
Processing XML with Java.
Field Summary | |
---|---|
static java.lang.String |
XHTML_NAMESPACE
|
Constructor Summary | |
---|---|
StreamingXHTMLPurifier()
|
Method Summary | |
---|---|
nu.xom.Nodes |
finishMakingElement(nu.xom.Element element)
Signals the end of an element. |
static void |
main(java.lang.String[] args)
|
nu.xom.Nodes |
makeAttribute(java.lang.String name,
java.lang.String URI,
java.lang.String value,
nu.xom.Attribute.Type type)
Returns a new Nodes object containing an
attribute in the specified namespace
with the specified name and type. |
nu.xom.Nodes |
makeComment(java.lang.String data)
Returns a new Nodes object containing a
comment with the specified text. |
nu.xom.Nodes |
makeDocType(java.lang.String rootElementName,
java.lang.String publicID,
java.lang.String systemID)
Returns a new Nodes object containing a
DocType object with the specified root element
name, system ID, and public ID. |
nu.xom.Nodes |
makeProcessingInstruction(java.lang.String target,
java.lang.String data)
Returns a new Nodes object containing a
new ProcessingInstruction object with
the specified target and data. |
nu.xom.Nodes |
makeText(java.lang.String data)
Returns a new Nodes object containing a
text node with the specified content. |
nu.xom.Element |
startMakingElement(java.lang.String name,
java.lang.String namespace)
Creates a new Element in the specified namespace
with the specified name. |
Methods inherited from class nu.xom.NodeFactory |
---|
finishMakingDocument, makeRootElement, startMakingDocument |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final java.lang.String XHTML_NAMESPACE
Constructor Detail |
---|
public StreamingXHTMLPurifier()
Method Detail |
---|
public nu.xom.Nodes makeText(java.lang.String data)
nu.xom.NodeFactory
Returns a new Nodes
object containing a
text node with the specified content.
Subclasses may change the content or other characteristics of
the text returned. Subclasses may also change the nodes
returned from this method. They may return a Nodes
object containing any number of nodes which are added or
appended to the current parent node. This Nodes
object must not contain any Document
nodes. All of
the nodes returned must be parentless. Subclasses may return an
empty Nodes
to indicate the text should not be
included in the finished document.
makeText
in class nu.xom.NodeFactory
data
- the complete text content of the node
public nu.xom.Nodes makeComment(java.lang.String data)
nu.xom.NodeFactory
Returns a new Nodes
object containing a
comment with the specified text.
Subclasses may change the content or other
characteristics of the comment returned.
Subclasses may change the nodes returned from this method.
They may return a Nodes
object containing any
number of children and attributes which are appended and
added to the current parent element. This Nodes
object should not contain any Document
objects.
All of the nodes returned must be parentless.
Subclasses may return an empty Nodes
to indicate
the comment should not be included in the
finished document.
makeComment
in class nu.xom.NodeFactory
data
- the complete text content of the comment
public nu.xom.Element startMakingElement(java.lang.String name, java.lang.String namespace)
nu.xom.NodeFactory
Creates a new Element
in the specified namespace
with the specified name.
Subclasses may change the name, namespace, content, or other
characteristics of the Element
returned.
Subclasses may return null to indicate the
Element
should not be created.
However, doing so will only remove the element's start-tag and
end-tag from the result tree. Any content inside the element
will be attached to the element's parent by default, unless it
too is filtered. To remove an entire element, return an empty
Nodes
object from the
finishMakingElement()
method.
startMakingElement
in class nu.xom.NodeFactory
name
- the qualified name of the elementnamespace
- the namespace URI of the element
public nu.xom.Nodes finishMakingElement(nu.xom.Element element)
nu.xom.NodeFactory
Signals the end of an element. This method should return
the Nodes
to be added to the tree.
They need not contain the Element
that
was passed to this method, though most often they will.
By default the Nodes
returned contain
only the built element. However, subclasses may return
a list containing any number of nodes, all of which will be
added to the tree at the current position in the order given by
the list (subject to the usual well-formedness constraints, of
course. For instance, the list should not contain a
DocType
object unless the element is the root
element, and the document does not already have a
DocType
). All of the nodes returned must be
parentless. If this method returns an empty list,
then the element (including all its contents) is not included
in the finished document.
To process an element at a time, override this method in a
subclass so that it functions as a callback. When you're done
processing the Element
, return an empty list so
that it will be removed from the tree and garbage collected.
Be careful not to return an empty list for the root element
though. That is, when the element passed to this method is the
root element, the list returned must contain exactly one
Element
object. The simplest way to check this
is testing if element.getParent() instanceof
Document
.
Do not detach element
or any of its ancestors
while inside this method. Doing so can royally muck up the
build.
finishMakingElement
in class nu.xom.NodeFactory
element
- the finished Element
public nu.xom.Nodes makeDocType(java.lang.String rootElementName, java.lang.String publicID, java.lang.String systemID)
nu.xom.NodeFactory
Returns a new Nodes
object containing a
DocType
object with the specified root element
name, system ID, and public ID.
Subclasses may change the root element name, public ID,
system ID, or other characteristics of the DocType
returned. Subclasses may change the nodes returned from this
method. They may return a Nodes
object containing
any number of comments and processing instructions which are
appended to the current parent node. This Nodes
object may not contain any Document
,
Element
, Attribute
, or
Text
objects. All of the nodes returned must be
parentless. Subclasses may return an empty Nodes
to
indicate the DocType
should not be included in the
finished document.
makeDocType
in class nu.xom.NodeFactory
rootElementName
- the declared, qualified name
for the root elementpublicID
- the public ID of the external DTD subsetsystemID
- the URL of the external DTD subset
public nu.xom.Nodes makeProcessingInstruction(java.lang.String target, java.lang.String data)
nu.xom.NodeFactory
Returns a new Nodes
object containing a
new ProcessingInstruction
object with
the specified target and data.
Subclasses may change the target, data, or other
characteristics of the ProcessingInstruction
returned. Subclasses may change the nodes returned from this
method. They may return a Nodes
object containing
any number of nodes which are added or
appended to the current parent node. This Nodes
object must not contain any Document
nodes.
If the processing instruction appears in the prolog or epilog
of the document, then it must also not contain any
Element
, Attribute
, or
Text
objects.
All of the nodes returned must be parentless. Subclasses
may return an empty Nodes
to indicate the
processing instruction should not be included in the
finished document.
makeProcessingInstruction
in class nu.xom.NodeFactory
target
- the target of the processing instructiondata
- the data of the processing instruction
public nu.xom.Nodes makeAttribute(java.lang.String name, java.lang.String URI, java.lang.String value, nu.xom.Attribute.Type type)
nu.xom.NodeFactory
Returns a new Nodes
object containing an
attribute in the specified namespace
with the specified name and type.
Subclasses may change the nodes returned from this method.
They may return a Nodes
object containing any
number of children and attributes which are appended and
added to the current parent element. This Nodes
object may not contain any Document
objects.
All of the nodes returned must be parentless.
Subclasses may return an empty Nodes
to indicate
the attribute should not be created.
makeAttribute
in class nu.xom.NodeFactory
name
- the prefixed name of the attributeURI
- the namespace URIvalue
- the attribute valuetype
- the attribute type
public static void main(java.lang.String[] args)
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |