1 | /****************************************************************************
|
---|
2 | **
|
---|
3 | ** Copyright (C) 2009 Nokia Corporation and/or its subsidiary(-ies).
|
---|
4 | ** All rights reserved.
|
---|
5 | ** Contact: Nokia Corporation ([email protected])
|
---|
6 | **
|
---|
7 | ** This file is part of the documentation of the Qt Toolkit.
|
---|
8 | **
|
---|
9 | ** $QT_BEGIN_LICENSE:LGPL$
|
---|
10 | ** Commercial Usage
|
---|
11 | ** Licensees holding valid Qt Commercial licenses may use this file in
|
---|
12 | ** accordance with the Qt Commercial License Agreement provided with the
|
---|
13 | ** Software or, alternatively, in accordance with the terms contained in
|
---|
14 | ** a written agreement between you and Nokia.
|
---|
15 | **
|
---|
16 | ** GNU Lesser General Public License Usage
|
---|
17 | ** Alternatively, this file may be used under the terms of the GNU Lesser
|
---|
18 | ** General Public License version 2.1 as published by the Free Software
|
---|
19 | ** Foundation and appearing in the file LICENSE.LGPL included in the
|
---|
20 | ** packaging of this file. Please review the following information to
|
---|
21 | ** ensure the GNU Lesser General Public License version 2.1 requirements
|
---|
22 | ** will be met: http://www.gnu.org/licenses/old-licenses/lgpl-2.1.html.
|
---|
23 | **
|
---|
24 | ** In addition, as a special exception, Nokia gives you certain additional
|
---|
25 | ** rights. These rights are described in the Nokia Qt LGPL Exception
|
---|
26 | ** version 1.1, included in the file LGPL_EXCEPTION.txt in this package.
|
---|
27 | **
|
---|
28 | ** GNU General Public License Usage
|
---|
29 | ** Alternatively, this file may be used under the terms of the GNU
|
---|
30 | ** General Public License version 3.0 as published by the Free Software
|
---|
31 | ** Foundation and appearing in the file LICENSE.GPL included in the
|
---|
32 | ** packaging of this file. Please review the following information to
|
---|
33 | ** ensure the GNU General Public License version 3.0 requirements will be
|
---|
34 | ** met: http://www.gnu.org/copyleft/gpl.html.
|
---|
35 | **
|
---|
36 | ** If you have questions regarding the use of this file, please contact
|
---|
37 | ** Nokia at [email protected].
|
---|
38 | ** $QT_END_LICENSE$
|
---|
39 | **
|
---|
40 | ****************************************************************************/
|
---|
41 |
|
---|
42 | /*!
|
---|
43 | \group xml-tools
|
---|
44 | \title XML Classes
|
---|
45 |
|
---|
46 | \brief Classes that support XML, via, for example DOM and SAX.
|
---|
47 |
|
---|
48 | These classes are relevant to XML users.
|
---|
49 |
|
---|
50 | \generatelist{related}
|
---|
51 | */
|
---|
52 |
|
---|
53 | /*!
|
---|
54 | \page xml-processing.html
|
---|
55 | \title XML Processing
|
---|
56 | \brief An Overview of the XML processing facilities in Qt.
|
---|
57 |
|
---|
58 | In addition to core XML support, classes for higher level querying
|
---|
59 | and manipulation of XML data are provided by the QtXmlPatterns
|
---|
60 | module. In the QtSvg module, the QSvgRenderer and QSvgGenerator
|
---|
61 | classes can read and write a subset of SVG, an XML-based file
|
---|
62 | format. Qt also provides helper functions that may be useful to
|
---|
63 | those working with XML and XHTML: see Qt::escape() and
|
---|
64 | Qt::convertFromPlainText().
|
---|
65 |
|
---|
66 | \section1 Topics:
|
---|
67 |
|
---|
68 | \list
|
---|
69 | \o \l {Classes for XML Processing}
|
---|
70 | \o \l {An Introduction to Namespaces}
|
---|
71 | \o \l {XML Streaming}
|
---|
72 | \o \l {The SAX Interface}
|
---|
73 | \o \l {Working with the DOM Tree}
|
---|
74 | \o \l {Using XML Technologies}{XQuery/XPath and XML Schema}
|
---|
75 | \list
|
---|
76 | \o \l{A Short Path to XQuery}
|
---|
77 | \endlist
|
---|
78 | \endlist
|
---|
79 |
|
---|
80 | \section1 Classes for XML Processing
|
---|
81 |
|
---|
82 | These classes are relevant to XML users.
|
---|
83 |
|
---|
84 | \annotatedlist xml-tools
|
---|
85 | */
|
---|
86 |
|
---|
87 | /*!
|
---|
88 | \page xml-namespaces.html
|
---|
89 | \title An Introduction to Namespaces
|
---|
90 | \target namespaces
|
---|
91 |
|
---|
92 | \contentspage XML Processing
|
---|
93 | \nextpage XML Streaming
|
---|
94 |
|
---|
95 | Parts of the Qt XML module documentation assume that you are familiar
|
---|
96 | with XML namespaces. Here we present a brief introduction; skip to
|
---|
97 | \link #namespacesConventions Qt XML documentation conventions \endlink
|
---|
98 | if you already know this material.
|
---|
99 |
|
---|
100 | Namespaces are a concept introduced into XML to allow a more modular
|
---|
101 | design. With their help data processing software can easily resolve
|
---|
102 | naming conflicts in XML documents.
|
---|
103 |
|
---|
104 | Consider the following example:
|
---|
105 |
|
---|
106 | \snippet doc/src/snippets/code/doc_src_qtxml.qdoc 6
|
---|
107 |
|
---|
108 | Here we find three different uses of the name \e title. If you wish to
|
---|
109 | process this document you will encounter problems because each of the
|
---|
110 | \e titles should be displayed in a different manner -- even though
|
---|
111 | they have the same name.
|
---|
112 |
|
---|
113 | The solution would be to have some means of identifying the first
|
---|
114 | occurrence of \e title as the title of a book, i.e. to use the \e
|
---|
115 | title element of a book namespace to distinguish it from, for example,
|
---|
116 | the chapter title, e.g.:
|
---|
117 | \snippet doc/src/snippets/code/doc_src_qtxml.qdoc 7
|
---|
118 |
|
---|
119 | \e book in this case is a \e prefix denoting the namespace.
|
---|
120 |
|
---|
121 | Before we can apply a namespace to element or attribute names we must
|
---|
122 | declare it.
|
---|
123 |
|
---|
124 | Namespaces are URIs like \e http://www.example.com/fnord/book/. This
|
---|
125 | does not mean that data must be available at this address; the URI is
|
---|
126 | simply used to provide a unique name.
|
---|
127 |
|
---|
128 | We declare namespaces in the same way as attributes; strictly speaking
|
---|
129 | they \e are attributes. To make for example \e
|
---|
130 | http://www.example.com/fnord/ the document's default XML namespace \e
|
---|
131 | xmlns we write
|
---|
132 |
|
---|
133 | \snippet doc/src/snippets/code/doc_src_qtxml.qdoc 8
|
---|
134 |
|
---|
135 | To distinguish the \e http://www.example.com/fnord/book/ namespace from
|
---|
136 | the default, we must supply it with a prefix:
|
---|
137 |
|
---|
138 | \snippet doc/src/snippets/code/doc_src_qtxml.qdoc 9
|
---|
139 |
|
---|
140 | A namespace that is declared like this can be applied to element and
|
---|
141 | attribute names by prepending the appropriate prefix and a ":"
|
---|
142 | delimiter. We have already seen this with the \e book:title element.
|
---|
143 |
|
---|
144 | Element names without a prefix belong to the default namespace. This
|
---|
145 | rule does not apply to attributes: an attribute without a prefix does
|
---|
146 | not belong to any of the declared XML namespaces at all. Attributes
|
---|
147 | always belong to the "traditional" namespace of the element in which
|
---|
148 | they appear. A "traditional" namespace is not an XML namespace, it
|
---|
149 | simply means that all attribute names belonging to one element must be
|
---|
150 | different. Later we will see how to assign an XML namespace to an
|
---|
151 | attribute.
|
---|
152 |
|
---|
153 | Due to the fact that attributes without prefixes are not in any XML
|
---|
154 | namespace there is no collision between the attribute \e title (that
|
---|
155 | belongs to the \e author element) and for example the \e title element
|
---|
156 | within a \e chapter.
|
---|
157 |
|
---|
158 | Let's clarify this with an example:
|
---|
159 | \snippet doc/src/snippets/code/doc_src_qtxml.qdoc 10
|
---|
160 |
|
---|
161 | Within the \e document element we have two namespaces declared. The
|
---|
162 | default namespace \e http://www.example.com/fnord/ applies to the \e
|
---|
163 | book element, the \e chapter element, the appropriate \e title element
|
---|
164 | and of course to \e document itself.
|
---|
165 |
|
---|
166 | The \e book:author and \e book:title elements belong to the namespace
|
---|
167 | with the URI \e http://www.example.com/fnord/book/.
|
---|
168 |
|
---|
169 | The two \e book:author attributes \e title and \e name have no XML
|
---|
170 | namespace assigned. They are only members of the "traditional"
|
---|
171 | namespace of the element \e book:author, meaning that for example two
|
---|
172 | \e title attributes in \e book:author are forbidden.
|
---|
173 |
|
---|
174 | In the above example we circumvent the last rule by adding a \e title
|
---|
175 | attribute from the \e http://www.example.com/fnord/ namespace to \e
|
---|
176 | book:author: the \e fnord:title comes from the namespace with the
|
---|
177 | prefix \e fnord that is declared in the \e book:author element.
|
---|
178 |
|
---|
179 | Clearly the \e fnord namespace has the same namespace URI as the
|
---|
180 | default namespace. So why didn't we simply use the default namespace
|
---|
181 | we'd already declared? The answer is quite complex:
|
---|
182 | \list
|
---|
183 | \o attributes without a prefix don't belong to any XML namespace at
|
---|
184 | all, not even to the default namespace;
|
---|
185 | \o additionally omitting the prefix would lead to a \e title-title clash;
|
---|
186 | \o writing it as \e xmlns:title would declare a new namespace with the
|
---|
187 | prefix \e title instead of applying the default \e xmlns namespace.
|
---|
188 | \endlist
|
---|
189 |
|
---|
190 | With the Qt XML classes elements and attributes can be accessed in two
|
---|
191 | ways: either by refering to their qualified names consisting of the
|
---|
192 | namespace prefix and the "real" name (or \e local name) or by the
|
---|
193 | combination of local name and namespace URI.
|
---|
194 |
|
---|
195 | More information on XML namespaces can be found at
|
---|
196 | \l http://www.w3.org/TR/REC-xml-names/.
|
---|
197 |
|
---|
198 | \target namespacesConventions
|
---|
199 | \section1 Conventions Used in the Qt XML Documentation
|
---|
200 |
|
---|
201 | The following terms are used to distinguish the parts of names within
|
---|
202 | the context of namespaces:
|
---|
203 | \list
|
---|
204 | \o The \e {qualified name}
|
---|
205 | is the name as it appears in the document. (In the above example \e
|
---|
206 | book:title is a qualified name.)
|
---|
207 | \o A \e {namespace prefix} in a qualified name
|
---|
208 | is the part to the left of the ":". (\e book is the namespace prefix in
|
---|
209 | \e book:title.)
|
---|
210 | \o The \e {local part} of a name (also refered to as the \e {local
|
---|
211 | name}) appears to the right of the ":". (Thus \e title is the
|
---|
212 | local part of \e book:title.)
|
---|
213 | \o The \e {namespace URI} ("Uniform Resource Identifier") is a unique
|
---|
214 | identifier for a namespace. It looks like a URL
|
---|
215 | (e.g. \e http://www.example.com/fnord/ ) but does not require
|
---|
216 | data to be accessible by the given protocol at the named address.
|
---|
217 | \endlist
|
---|
218 |
|
---|
219 | Elements without a ":" (like \e chapter in the example) do not have a
|
---|
220 | namespace prefix. In this case the local part and the qualified name
|
---|
221 | are identical (i.e. \e chapter).
|
---|
222 |
|
---|
223 | \sa {DOM Bookmarks Example}, {SAX Bookmarks Example}
|
---|
224 | */
|
---|
225 |
|
---|
226 | /*!
|
---|
227 | \page xml-streaming.html
|
---|
228 | \title XML Streaming
|
---|
229 |
|
---|
230 | \previouspage An Introduction to Namespaces
|
---|
231 | \contentspage XML Processing
|
---|
232 | \nextpage The SAX Interface
|
---|
233 |
|
---|
234 | Since version 4.3, Qt provides two new classes for reading and
|
---|
235 | writing XML: QXmlStreamReader and QXmlStreamWriter.
|
---|
236 |
|
---|
237 | The QXmlStreamReader and QXmlStreamWriter are two new classes provided
|
---|
238 | in Qt 4.3 and later. A stream reader reports an XML document as a stream
|
---|
239 | of tokens. This differs from SAX as SAX applications provide handlers to
|
---|
240 | receive XML events from the parser whereas the QXmlStreamReader drives the
|
---|
241 | loop, pulling tokens from the reader when they are needed.
|
---|
242 | This pulling approach makes it possible to build recursive descent parsers,
|
---|
243 | allowing XML parsing code to be split into different methods or classes.
|
---|
244 |
|
---|
245 | QXmlStreamReader is a well-formed XML 1.0 parser that excludes external
|
---|
246 | parsed entities. Hence, data provided by the stream reader adheres to the
|
---|
247 | W3C's criteria for well-formed XML, as long as no error occurs. Otherwise,
|
---|
248 | functions such as \l{QXmlStreamReader::atEnd()}{atEnd()},
|
---|
249 | \l{QXmlStreamReader::error()}{error()} and \l{QXmlStreamReader::hasError()}
|
---|
250 | {hasError()} can be used to check and view the errors.
|
---|
251 |
|
---|
252 | An example of QXmlStreamReader implementation would be the \c XbelReader in
|
---|
253 | \l{QXmlStream Bookmarks Example}, which is a subclass of QXmlStreamReader.
|
---|
254 | The constructor takes \a treeWidget as a parameter and the class has Xbel
|
---|
255 | specific functions:
|
---|
256 |
|
---|
257 | \snippet examples/xml/streambookmarks/xbelreader.h 1
|
---|
258 |
|
---|
259 | \dots
|
---|
260 | \snippet examples/xml/streambookmarks/xbelreader.h 2
|
---|
261 | \dots
|
---|
262 |
|
---|
263 | The \c read() function accepts a QIODevice and sets it with
|
---|
264 | \l{QXmlStreamReader::setDevice()}{setDevice()}. The
|
---|
265 | \l{QXmlStreamReader::raiseError()}{raiseError()} function is used to
|
---|
266 | display a custom error message, inidicating that the file's version
|
---|
267 | is incorrect.
|
---|
268 |
|
---|
269 | \snippet examples/xml/streambookmarks/xbelreader.cpp 1
|
---|
270 |
|
---|
271 | The pendent to QXmlStreamReader is QXmlStreamWriter, which provides an XML
|
---|
272 | writer with a simple streaming API. QXmlStreamWriter operates on a
|
---|
273 | QIODevice and has specialised functions for all XML tokens or events you
|
---|
274 | want to write, such as \l{QXmlStreamWriter::writeDTD()}{writeDTD()},
|
---|
275 | \l{QXmlStreamWriter::writeCharacters()}{writeCharacters()},
|
---|
276 | \l{QXmlStreamWriter::writeComment()}{writeComment()} and so on.
|
---|
277 |
|
---|
278 | To write XML document with QXmlStreamWriter, you start a document with the
|
---|
279 | \l{QXmlStreamWriter::writeStartDocument()}{writeStartDocument()} function
|
---|
280 | and end it with \l{QXmlStreamWriter::writeEndDocument()}
|
---|
281 | {writeEndDocument()}, which implicitly closes all remaining open tags.
|
---|
282 | Element tags are opened with \l{QXmlStreamWriter::writeStartDocument()}
|
---|
283 | {writeStartDocument()} and followed by
|
---|
284 | \l{QXmlStreamWriter::writeAttribute()}{writeAttribute()} or
|
---|
285 | \l{QXmlStreamWriter::writeAttributes()}{writeAttributes()},
|
---|
286 | element content, and then \l{QXmlStreamWriter::writeEndDocument()}
|
---|
287 | {writeEndDocument()}. Also, \l{QXmlStreamWriter::writeEmptyElement()}
|
---|
288 | {writeEmptyElement()} can be used to write empty elements.
|
---|
289 |
|
---|
290 | Element content comprises characters, entity references or nested elements.
|
---|
291 | Content can be written with \l{QXmlStreamWriter::writeCharacters()}
|
---|
292 | {writeCharacters()}, a function that also takes care of escaping all
|
---|
293 | forbidden characters and character sequences,
|
---|
294 | \l{QXmlStreamWriter::writeEntityReference()}{writeEntityReference()},
|
---|
295 | or subsequent calls to \l{QXmlStreamWriter::writeStartElement()}
|
---|
296 | {writeStartElement()}.
|
---|
297 |
|
---|
298 | The \c XbelWriter class from \l{QXmlStream Bookmarks Example} is a subclass
|
---|
299 | of QXmlStreamWriter. Its \c writeFile() function illustrates the core
|
---|
300 | functions of QXmlStreamWriter mentioned above:
|
---|
301 |
|
---|
302 | \snippet examples/xml/streambookmarks/xbelwriter.cpp 1
|
---|
303 | */
|
---|
304 |
|
---|
305 | /*!
|
---|
306 | \page xml-sax.html
|
---|
307 | \title The SAX interface
|
---|
308 |
|
---|
309 | \previouspage XML Streaming
|
---|
310 | \contentspage XML Processing
|
---|
311 | \nextpage Working with the DOM Tree
|
---|
312 |
|
---|
313 | SAX is an event-based standard interface for XML parsers.
|
---|
314 | The Qt interface follows the design of the SAX2 Java implementation.
|
---|
315 | Its naming scheme was adapted to fit the Qt naming conventions.
|
---|
316 | Details on SAX2 can be found at \l{http://www.saxproject.org}.
|
---|
317 |
|
---|
318 | Support for SAX2 filters and the reader factory are under
|
---|
319 | development. The Qt implementation does not include the SAX1
|
---|
320 | compatibility classes present in the Java interface.
|
---|
321 |
|
---|
322 | \section1 Introduction to SAX2
|
---|
323 |
|
---|
324 | The SAX2 interface is an event-driven mechanism to provide the user with
|
---|
325 | document information. An "event" in this context means something
|
---|
326 | reported by the parser, for example, it has encountered a start tag,
|
---|
327 | or an end tag, etc.
|
---|
328 |
|
---|
329 | To make it less abstract consider the following example:
|
---|
330 | \snippet doc/src/snippets/code/doc_src_qtxml.qdoc 3
|
---|
331 |
|
---|
332 | Whilst reading (a SAX2 parser is usually referred to as "reader")
|
---|
333 | the above document three events would be triggered:
|
---|
334 | \list 1
|
---|
335 | \o A start tag occurs (\c{<quote>}).
|
---|
336 | \o Character data (i.e. text) is found, "A quotation.".
|
---|
337 | \o An end tag is parsed (\c{</quote>}).
|
---|
338 | \endlist
|
---|
339 |
|
---|
340 | Each time such an event occurs the parser reports it; you can set up
|
---|
341 | event handlers to respond to these events.
|
---|
342 |
|
---|
343 | Whilst this is a fast and simple approach to read XML documents,
|
---|
344 | manipulation is difficult because data is not stored, simply handled
|
---|
345 | and discarded serially. The \l{Working with the DOM Tree}{DOM interface}
|
---|
346 | reads in and stores the whole document in a tree structure;
|
---|
347 | this takes more memory, but makes it easier to manipulate the
|
---|
348 | document's structure.
|
---|
349 |
|
---|
350 | The Qt XML module provides an abstract class, \l QXmlReader, that
|
---|
351 | defines the interface for potential SAX2 readers. Qt includes a reader
|
---|
352 | implementation, \l QXmlSimpleReader, that is easy to adapt through
|
---|
353 | subclassing.
|
---|
354 |
|
---|
355 | The reader reports parsing events through special handler classes:
|
---|
356 | \table
|
---|
357 | \header \o Handler class \o Description
|
---|
358 | \row \o \l QXmlContentHandler
|
---|
359 | \o Reports events related to the content of a document (e.g. the start tag
|
---|
360 | or characters).
|
---|
361 | \row \o \l QXmlDTDHandler
|
---|
362 | \o Reports events related to the DTD (e.g. notation declarations).
|
---|
363 | \row \o \l QXmlErrorHandler
|
---|
364 | \o Reports errors or warnings that occurred during parsing.
|
---|
365 | \row \o \l QXmlEntityResolver
|
---|
366 | \o Reports external entities during parsing and allows users to resolve
|
---|
367 | external entities themselves instead of leaving it to the reader.
|
---|
368 | \row \o \l QXmlDeclHandler
|
---|
369 | \o Reports further DTD related events (e.g. attribute declarations).
|
---|
370 | \row \o \l QXmlLexicalHandler
|
---|
371 | \o Reports events related to the lexical structure of the
|
---|
372 | document (the beginning of the DTD, comments etc.).
|
---|
373 | \endtable
|
---|
374 |
|
---|
375 | These classes are abstract classes describing the interface. The \l
|
---|
376 | QXmlDefaultHandler class provides a "do nothing" default
|
---|
377 | implementation for all of them. Therefore users only need to overload
|
---|
378 | the QXmlDefaultHandler functions they are interested in.
|
---|
379 |
|
---|
380 | To read input XML data a special class \l QXmlInputSource is used.
|
---|
381 |
|
---|
382 | Apart from those already mentioned, the following SAX2 support classes
|
---|
383 | provide additional useful functionality:
|
---|
384 | \table
|
---|
385 | \header \o Class \o Description
|
---|
386 | \row \o \l QXmlAttributes
|
---|
387 | \o Used to pass attributes in a start element event.
|
---|
388 | \row \o \l QXmlLocator
|
---|
389 | \o Used to obtain the actual parsing position of an event.
|
---|
390 | \row \o \l QXmlNamespaceSupport
|
---|
391 | \o Used to implement namespace support for a reader. Note that
|
---|
392 | namespaces do not change the parsing behavior. They are only
|
---|
393 | reported through the handler.
|
---|
394 | \endtable
|
---|
395 |
|
---|
396 | The \l{SAX Bookmarks example} illustrates how to subclass
|
---|
397 | QXmlDefaultHandler to read an XML bookmark file (XBEL) and
|
---|
398 | how to generate XML by hand.
|
---|
399 |
|
---|
400 | \section1 SAX2 Features
|
---|
401 |
|
---|
402 | The behavior of an XML reader depends on its support for certain
|
---|
403 | optional features. For example, a reader may have the feature "report
|
---|
404 | attributes used for namespace declarations and prefixes along with
|
---|
405 | the local name of a tag". Like every other feature this has a unique
|
---|
406 | name represented by a URI: it is called
|
---|
407 | \e http://xml.org/sax/features/namespace-prefixes.
|
---|
408 |
|
---|
409 | The Qt SAX2 implementation can report whether the reader has
|
---|
410 | particular functionality using the QXmlReader::hasFeature()
|
---|
411 | function. Available features can be tested with QXmlReader::feature(),
|
---|
412 | and switched on or off using QXmlReader::setFeature().
|
---|
413 |
|
---|
414 | Consider the example
|
---|
415 | \snippet doc/src/snippets/code/doc_src_qtxml.qdoc 4
|
---|
416 | A reader that does not support the \e
|
---|
417 | http://xml.org/sax/features/namespace-prefixes feature would report
|
---|
418 | the element name \e document but not its attributes \e xmlns:book and
|
---|
419 | \e xmlns with their values. A reader with the feature \e
|
---|
420 | http://xml.org/sax/features/namespace-prefixes reports the namespace
|
---|
421 | attributes if the \link QXmlReader::feature() feature\endlink is
|
---|
422 | switched on.
|
---|
423 |
|
---|
424 | Other features include \e http://xml.org/sax/features/namespace
|
---|
425 | (namespace processing, implies \e
|
---|
426 | http://xml.org/sax/features/namespace-prefixes) and \e
|
---|
427 | http://xml.org/sax/features/validation (the ability to report
|
---|
428 | validation errors).
|
---|
429 |
|
---|
430 | Whilst SAX2 leaves it to the user to define and implement whatever
|
---|
431 | features are required, support for \e
|
---|
432 | http://xml.org/sax/features/namespace (and thus \e
|
---|
433 | http://xml.org/sax/features/namespace-prefixes) is mandantory.
|
---|
434 | The \l QXmlSimpleReader implementation of \l QXmlReader,
|
---|
435 | supports them, and can do namespace processing.
|
---|
436 |
|
---|
437 | \l QXmlSimpleReader is not validating, so it
|
---|
438 | does not support \e http://xml.org/sax/features/validation.
|
---|
439 |
|
---|
440 | \section1 Namespace Support via Features
|
---|
441 |
|
---|
442 | As we have seen in the previous section, we can configure the
|
---|
443 | behavior of the reader when it comes to namespace
|
---|
444 | processing. This is done by setting and unsetting the
|
---|
445 | \e http://xml.org/sax/features/namespaces and
|
---|
446 | \e http://xml.org/sax/features/namespace-prefixes features.
|
---|
447 |
|
---|
448 | They influence the reporting behavior in the following way:
|
---|
449 | \list 1
|
---|
450 | \o Namespace prefixes and local parts of elements and attributes can
|
---|
451 | be reported.
|
---|
452 | \o The qualified names of elements and attributes are reported.
|
---|
453 | \o \l QXmlContentHandler::startPrefixMapping() and \l
|
---|
454 | QXmlContentHandler::endPrefixMapping() are called by the reader.
|
---|
455 | \o Attributes that declare namespaces (i.e. the attribute \e xmlns and
|
---|
456 | attributes starting with \e{xmlns:}) are reported.
|
---|
457 | \endlist
|
---|
458 |
|
---|
459 | Consider the following element:
|
---|
460 | \snippet doc/src/snippets/code/doc_src_qtxml.qdoc 5
|
---|
461 | With \e http://xml.org/sax/features/namespace-prefixes set to true
|
---|
462 | the reader will report four attributes; but with the \e
|
---|
463 | namespace-prefixes feature set to false only three, with the \e
|
---|
464 | xmlns:fnord attribute defining a namespace being "invisible" to the
|
---|
465 | reader.
|
---|
466 |
|
---|
467 | The \e http://xml.org/sax/features/namespaces feature is responsible
|
---|
468 | for reporting local names, namespace prefixes and URIs. With \e
|
---|
469 | http://xml.org/sax/features/namespaces set to true the parser will
|
---|
470 | report \e title as the local name of the \e fnord:title attribute, \e
|
---|
471 | fnord being the namespace prefix and \e http://example.com/fnord/ as
|
---|
472 | the namespace URI. When \e http://xml.org/sax/features/namespaces is
|
---|
473 | false none of them are reported.
|
---|
474 |
|
---|
475 | In the current implementation the Qt XML classes follow the definition
|
---|
476 | that the prefix \e xmlns itself isn't associated with any namespace at all
|
---|
477 | (see \link http://www.w3.org/TR/1999/REC-xml-names-19990114/#ns-using
|
---|
478 | http://www.w3.org/TR/1999/REC-xml-names-19990114/#ns-using \endlink).
|
---|
479 | Therefore even with \e http://xml.org/sax/features/namespaces and
|
---|
480 | \e http://xml.org/sax/features/namespace-prefixes both set to true
|
---|
481 | the reader won't return either a local name, a namespace prefix or
|
---|
482 | a namespace URI for \e xmlns:fnord.
|
---|
483 |
|
---|
484 | This might be changed in the future following the W3C suggestion
|
---|
485 | \link http://www.w3.org/2000/xmlns/ http://www.w3.org/2000/xmlns/ \endlink
|
---|
486 | to associate \e xmlns with the namespace \e http://www.w3.org/2000/xmlns.
|
---|
487 |
|
---|
488 | As the SAX2 standard suggests, \l QXmlSimpleReader defaults to having
|
---|
489 | \e http://xml.org/sax/features/namespaces set to true and
|
---|
490 | \e http://xml.org/sax/features/namespace-prefixes set to false.
|
---|
491 | When changing this behavior using \l QXmlSimpleReader::setFeature()
|
---|
492 | note that the combination of both features set to
|
---|
493 | false is illegal.
|
---|
494 |
|
---|
495 | \section2 Summary
|
---|
496 |
|
---|
497 | \l QXmlSimpleReader implements the following behavior:
|
---|
498 |
|
---|
499 | \table
|
---|
500 | \header \o (namespaces, namespace-prefixes)
|
---|
501 | \o Namespace prefix and local part
|
---|
502 | \o Qualified names
|
---|
503 | \o Prefix mapping
|
---|
504 | \o xmlns attributes
|
---|
505 | \row \o (true, false) \o Yes \o Yes* \o Yes \o No
|
---|
506 | \row \o (true, true) \o Yes \o Yes \o Yes \o Yes
|
---|
507 | \row \o (false, true) \o No* \o Yes \o No* \o Yes
|
---|
508 | \row \o (false, false) \i41 Illegal
|
---|
509 | \endtable
|
---|
510 |
|
---|
511 | The behavior of the entries marked with an asterisk (*) is not specified by SAX.
|
---|
512 |
|
---|
513 | \section1 Properties
|
---|
514 |
|
---|
515 | Properties are a more general concept. They have a unique name,
|
---|
516 | represented as an URI, but their value is \c void*. Thus nearly
|
---|
517 | anything can be used as a property value. This concept involves some
|
---|
518 | danger, though: there is no means of ensuring type-safety; the user
|
---|
519 | must take care that they pass the right type. Properties are
|
---|
520 | useful if a reader supports special handler classes.
|
---|
521 |
|
---|
522 | The URIs used for features and properties often look like URLs, e.g.
|
---|
523 | \c http://xml.org/sax/features/namespace. This does not mean that the
|
---|
524 | data required is at this address. It is simply a way of defining
|
---|
525 | unique names.
|
---|
526 |
|
---|
527 | Anyone can define and use new SAX2 properties for their readers.
|
---|
528 | Property support is not mandatory.
|
---|
529 |
|
---|
530 | To set or query properties the following functions are provided: \l
|
---|
531 | QXmlReader::setProperty(), \l QXmlReader::property() and \l
|
---|
532 | QXmlReader::hasProperty().
|
---|
533 | */
|
---|
534 |
|
---|
535 | /*!
|
---|
536 | \page xml-dom.tml
|
---|
537 | \title Working with the DOM Tree
|
---|
538 | \target dom
|
---|
539 |
|
---|
540 | \previouspage The SAX Interface
|
---|
541 | \contentspage XML Processing
|
---|
542 | \nextpage {Using XML Technologies}{XQuery/XPath and XML Schema}
|
---|
543 |
|
---|
544 | DOM Level 2 is a W3C Recommendation for XML interfaces that maps the
|
---|
545 | constituents of an XML document to a tree structure. The specification
|
---|
546 | of DOM Level 2 can be found at \l{http://www.w3.org/DOM/}.
|
---|
547 |
|
---|
548 | \target domIntro
|
---|
549 | \section1 Introduction to DOM
|
---|
550 |
|
---|
551 | DOM provides an interface to access and change the content and
|
---|
552 | structure of an XML file. It makes a hierarchical view of the document
|
---|
553 | (a tree view). Thus -- in contrast to the SAX2 interface -- an object
|
---|
554 | model of the document is resident in memory after parsing which makes
|
---|
555 | manipulation easy.
|
---|
556 |
|
---|
557 | All DOM nodes in the document tree are subclasses of \l QDomNode. The
|
---|
558 | document itself is represented as a \l QDomDocument object.
|
---|
559 |
|
---|
560 | Here are the available node classes and their potential child classes:
|
---|
561 |
|
---|
562 | \list
|
---|
563 | \o \l QDomDocument: Possible children are
|
---|
564 | \list
|
---|
565 | \o \l QDomElement (at most one)
|
---|
566 | \o \l QDomProcessingInstruction
|
---|
567 | \o \l QDomComment
|
---|
568 | \o \l QDomDocumentType
|
---|
569 | \endlist
|
---|
570 | \o \l QDomDocumentFragment: Possible children are
|
---|
571 | \list
|
---|
572 | \o \l QDomElement
|
---|
573 | \o \l QDomProcessingInstruction
|
---|
574 | \o \l QDomComment
|
---|
575 | \o \l QDomText
|
---|
576 | \o \l QDomCDATASection
|
---|
577 | \o \l QDomEntityReference
|
---|
578 | \endlist
|
---|
579 | \o \l QDomDocumentType: No children
|
---|
580 | \o \l QDomEntityReference: Possible children are
|
---|
581 | \list
|
---|
582 | \o \l QDomElement
|
---|
583 | \o \l QDomProcessingInstruction
|
---|
584 | \o \l QDomComment
|
---|
585 | \o \l QDomText
|
---|
586 | \o \l QDomCDATASection
|
---|
587 | \o \l QDomEntityReference
|
---|
588 | \endlist
|
---|
589 | \o \l QDomElement: Possible children are
|
---|
590 | \list
|
---|
591 | \o \l QDomElement
|
---|
592 | \o \l QDomText
|
---|
593 | \o \l QDomComment
|
---|
594 | \o \l QDomProcessingInstruction
|
---|
595 | \o \l QDomCDATASection
|
---|
596 | \o \l QDomEntityReference
|
---|
597 | \endlist
|
---|
598 | \o \l QDomAttr: Possible children are
|
---|
599 | \list
|
---|
600 | \o \l QDomText
|
---|
601 | \o \l QDomEntityReference
|
---|
602 | \endlist
|
---|
603 | \o \l QDomProcessingInstruction: No children
|
---|
604 | \o \l QDomComment: No children
|
---|
605 | \o \l QDomText: No children
|
---|
606 | \o \l QDomCDATASection: No children
|
---|
607 | \o \l QDomEntity: Possible children are
|
---|
608 | \list
|
---|
609 | \o \l QDomElement
|
---|
610 | \o \l QDomProcessingInstruction
|
---|
611 | \o \l QDomComment
|
---|
612 | \o \l QDomText
|
---|
613 | \o \l QDomCDATASection
|
---|
614 | \o \l QDomEntityReference
|
---|
615 | \endlist
|
---|
616 | \o \l QDomNotation: No children
|
---|
617 | \endlist
|
---|
618 |
|
---|
619 | With \l QDomNodeList and \l QDomNamedNodeMap two collection classes
|
---|
620 | are provided: \l QDomNodeList is a list of nodes,
|
---|
621 | and \l QDomNamedNodeMap is used to handle unordered sets of nodes
|
---|
622 | (often used for attributes).
|
---|
623 |
|
---|
624 | The \l QDomImplementation class allows the user to query features of the
|
---|
625 | DOM implementation.
|
---|
626 |
|
---|
627 | To get started please refer to the \l QDomDocument documentation.
|
---|
628 | You might also want to take a look at the \l{DOM Bookmarks example},
|
---|
629 | which illustrates how to read and write an XML bookmark file (XBEL)
|
---|
630 | using DOM.
|
---|
631 | */
|
---|