1
0
mirror of https://gitlab.gnome.org/GNOME/libxml2.git synced 2025-01-12 09:17:37 +03:00
XML parser and toolkit
Go to file
Nick Wellnhofer 875bb08489 parser: Implement xmlCtxtSetOptions
Surprisingly, some options can only be enabled with xmlCtxtUseOptions
and it's impossible to unset them. Add a new API function
xmlCtxtSetOptions which sets or clears all options.

Finally document all parser options.

Make sure to synchronize option bits and struct members.
2024-01-02 19:42:06 +01:00
.gitlab-ci gitlab-ci: Revert accidental change to setup_mingw.sh 2022-12-21 22:30:25 +01:00
doc include: Move globals from xmlsave.h to parser.h 2023-12-07 12:31:06 +01:00
example tests: Don't use deprecated symbols 2023-09-20 22:06:49 +02:00
fuzz parser: Improve error handling 2023-12-21 02:46:27 +01:00
include parser: Implement xmlCtxtSetOptions 2024-01-02 19:42:06 +01:00
m4 Link with --undefined-version 2023-04-29 19:12:23 +02:00
os400 fix Schematron spelling 2022-05-06 10:44:03 +02:00
python io: Always forward IO errors to global handler 2023-12-29 01:22:13 +01:00
result parser: Rework parsing of attribute and entity values 2024-01-02 15:42:03 +01:00
test parser: Rework parsing of attribute and entity values 2024-01-02 15:42:03 +01:00
win32 Remove Trio 2023-11-23 15:48:52 +01:00
xstc .gitignore: Split up and rearrange .gitignore files 2023-04-30 16:22:27 +02:00
.editorconfig Fix .editorconfig 2022-11-22 17:01:21 +01:00
.gitattributes Fix .gitattributes 2020-07-23 20:46:42 +02:00
.gitignore tests: Start with testparser.c for extra tests 2023-10-22 14:24:43 +02:00
.gitlab-ci.yml encoding: Suppress -Wcast-align warnings 2023-10-02 15:04:18 +02:00
autogen.sh Build Autotools CI tests out of source tree (VPATH) 2022-04-21 06:59:02 +02:00
buf.c malloc-fail: Don't truncate parser input buffer 2023-12-12 15:22:23 +01:00
c14n.c c14n: Improve error handling 2023-12-21 02:46:27 +01:00
catalog.c parser: Stop using 'directory' struct member 2023-12-25 23:38:40 +01:00
check-relaxng-test-suite2.py Stop calling xmlMemoryDump 2023-04-30 15:48:41 +02:00
check-relaxng-test-suite.py Stop calling xmlMemoryDump 2023-04-30 15:48:41 +02:00
check-xinclude-test-suite.py Use python3 not python 2022-12-07 13:21:12 +00:00
check-xml-test-suite.py Use python3 not python 2022-12-07 13:21:12 +00:00
check-xsddata-test-suite.py Stop calling xmlMemoryDump 2023-04-30 15:48:41 +02:00
chvalid.c libxml.h: Don't include stdio.h 2022-12-08 04:24:57 +01:00
chvalid.def Remove executable bit from non-executable files 2019-09-16 15:48:59 +02:00
CMakeLists.txt dict: Get random seed from system PRNG 2023-12-24 16:24:34 +01:00
config.h.cmake.in dict: Get random seed from system PRNG 2023-12-24 16:24:34 +01:00
configure.ac dict: Get random seed from system PRNG 2023-12-24 16:24:34 +01:00
Copyright Remove Trio 2023-11-23 15:48:52 +01:00
dbgen.pl add a Timingtests target to check bad behaviour from the streaming engine 2003-09-17 22:33:22 +00:00
dbgenattr.pl add a Timingtests target to check bad behaviour from the streaming engine 2003-09-17 22:33:22 +00:00
debugXML.c debug: Improve error handling 2023-12-21 02:46:27 +01:00
dict.c dict: Move local RNG state to global state 2023-12-24 16:24:34 +01:00
encoding.c encoding: Support ASCII in xmlLookupCharEncodingHandler 2023-12-13 23:58:45 +01:00
entities.c parser: Always copy content from entity to target. 2023-12-29 01:22:11 +01:00
error.c parser: Fix XML_ERR_UNSUPPORTED_ENCODING errors 2024-01-02 14:17:27 +01:00
genChRanges.py libxml.h: Don't include stdio.h 2022-12-08 04:24:57 +01:00
gentest.py tests: Remove special handling of const pointers in gentest.py 2023-11-23 15:36:39 +01:00
genUnicode.py Remove XMLCALL and XMLCDECL macros from public headers 2022-12-08 02:48:27 +01:00
globals.c dict: Move local RNG state to global state 2023-12-24 16:24:34 +01:00
hash.c hash: Report malloc failures 2023-12-11 22:05:47 +01:00
HTMLparser.c parser: New input API 2023-12-29 01:22:13 +01:00
HTMLtree.c save: Improve error handling 2023-12-21 15:02:24 +01:00
legacy.c parser: Remove xmlSetEntityReferenceFunc feature 2023-12-29 01:20:51 +01:00
libxml2-config.cmake.cmake.in cmake: Check whether static linking dependencies found in config files 2023-09-23 16:48:54 +01:00
libxml2-config.cmake.in cmake: Check whether static linking dependencies found in config files 2023-09-23 16:48:54 +01:00
libxml2.doap DOAP: Add download and issue tracker entries 2023-11-27 12:53:58 +01:00
libxml2.syms libxml2.syms: Revert removal of version information 2023-04-29 18:59:41 +02:00
libxml-2.0-uninstalled.pc.in autoconf: Don't bake build time CFLAGS into pkg-config file 2023-09-04 22:14:02 +01:00
libxml-2.0.pc.in autoconf: Don't bake build time CFLAGS into pkg-config file 2023-09-04 22:14:02 +01:00
libxml.h Remove Trio 2023-11-23 15:48:52 +01:00
libxml.m4 libxml.m4: Fix "possibly undefined macro" error 2023-04-30 13:18:47 +02:00
list.c misc: Improve error handling 2023-12-21 15:02:24 +01:00
MAINTAINERS.md doc: Add notes about breaking ABI in MAINTAINERS.md 2023-12-12 22:00:20 +01:00
Makefile.am Makefile.am: omit $(top_builddir) from DEPS and LDADDS 2023-12-20 23:43:08 +00:00
nanoftp.c misc: Improve error handling 2023-12-21 15:02:24 +01:00
nanohttp.c misc: Improve error handling 2023-12-21 15:02:24 +01:00
NEWS Update NEWS 2023-12-24 22:11:49 +01:00
parser.c parser: Implement xmlCtxtSetOptions 2024-01-02 19:42:06 +01:00
parserInternals.c parser: Fix XML_ERR_UNSUPPORTED_ENCODING errors 2024-01-02 14:17:27 +01:00
pattern.c misc: Improve error handling 2023-12-21 15:02:24 +01:00
README.md doc: Improve documentation of configuration options 2023-08-21 11:13:33 +02:00
README.zOS Large batch of typo fixes 2019-09-30 18:04:38 +02:00
relaxng.c doc: Improve documentation of error handlers 2023-12-21 17:36:17 +01:00
rngparser.c relaxng: Improve error handling 2023-12-21 15:01:42 +01:00
runsuite.c tests: Fix build with older MSVC 2023-11-18 14:36:41 +01:00
runtest.c Fix filename and URI handling 2023-12-25 23:38:40 +01:00
runxmlconf.c tests: Fix build with older MSVC 2023-11-18 14:36:41 +01:00
SAX2.c parser: Rework parsing of attribute and entity values 2024-01-02 15:42:03 +01:00
SAX.c globals: Stop including globals.h 2023-09-20 22:07:40 +02:00
schematron.c error: Stop printing some errors by default 2023-12-21 15:02:24 +01:00
testapi.c parser: Report malloc failures 2023-12-11 22:13:05 +01:00
testchar.c parser: Remove xmlErrEncoding 2023-12-21 15:02:24 +01:00
testdict.c doc: Minor fixes for apibuild.py 2023-11-04 19:32:48 +01:00
testdso.c warning patches from Peter Breitenlohner serious DTD parsing speedups, 2005-01-25 00:22:52 +00:00
testlimits.c tests: Remove unneeded error formatting code 2023-12-28 16:47:45 +01:00
testModule.c include: Remove more unnecessary includes 2023-09-21 01:50:53 +02:00
testOOM.c include: Remove more unnecessary includes 2023-09-21 01:50:53 +02:00
testOOMlib.c Don't check for standard C89 headers 2022-03-02 00:43:54 +01:00
testOOMlib.h Remove unneeded #includes 2022-03-04 22:11:49 +01:00
testparser.c parser: Fix XML_ERR_UNSUPPORTED_ENCODING errors 2024-01-02 14:17:27 +01:00
testrecurse.c tests: Remove unneeded error formatting code 2023-12-28 16:47:45 +01:00
testThreads.c include: Remove more unnecessary includes 2023-09-21 01:50:53 +02:00
threads.c dict: Move local RNG state to global state 2023-12-24 16:24:34 +01:00
timsort.h Large batch of typo fixes 2019-09-30 18:04:38 +02:00
tree.c parser: Always copy content from entity to target. 2023-12-29 01:22:11 +01:00
uri.c uri: Keep fragment intact when resolving filesystem paths 2023-12-28 17:07:03 +01:00
valid.c parser: Always copy content from entity to target. 2023-12-29 01:22:11 +01:00
xinclude.c xinclude: Only set xml:base if necessary 2023-12-28 16:47:45 +01:00
xlink.c include: Remove more unnecessary includes 2023-09-21 01:50:53 +02:00
xml2-config.in build: Generate better pkg-config files for static-only builds 2023-09-03 08:52:36 +01:00
xmlcatalog.c globals: Stop including globals.h 2023-09-20 22:07:40 +02:00
xmlIO.c io: Always forward IO errors to global handler 2023-12-29 01:22:13 +01:00
xmllint.c parser: New input API 2023-12-29 01:22:13 +01:00
xmlmemory.c misc: Improve error handling 2023-12-21 15:02:24 +01:00
xmlmodule.c misc: Improve error handling 2023-12-21 15:02:24 +01:00
xmlreader.c parser: Mark more parser context members as unused 2023-12-29 01:20:08 +01:00
xmlregexp.c regexp: Improve error handling 2023-12-21 15:02:24 +01:00
xmlsave.c save: Output U+FFFD replacement characters 2024-01-02 15:39:11 +01:00
xmlschemas.c doc: Improve documentation of error handlers 2023-12-21 17:36:17 +01:00
xmlschemastypes.c schemas: Improve error handling 2023-12-21 02:46:27 +01:00
xmlstring.c error: Refactor error reporting 2023-12-21 02:46:27 +01:00
xmlunicode.c Mark more static data as const 2022-04-07 12:01:23 -07:00
xmlwriter.c parser: Improve error handling 2023-12-21 02:46:27 +01:00
xpath.c xpath: Rewrite substring-before and substring-after 2023-12-28 16:47:45 +01:00
xpointer.c xpath: Improve error handling 2023-12-21 02:46:27 +01:00
xzlib.c io: More refactoring and unescaping fixes 2023-12-25 23:38:40 +01:00

libxml2

libxml2 is an XML toolkit implemented in C, originally developed for the GNOME Project.

Official releases can be downloaded from https://download.gnome.org/sources/libxml2/

The git repository is hosted on GNOME's GitLab server: https://gitlab.gnome.org/GNOME/libxml2

Bugs should be reported at https://gitlab.gnome.org/GNOME/libxml2/-/issues

Documentation is available at https://gitlab.gnome.org/GNOME/libxml2/-/wikis

License

This code is released under the MIT License, see the Copyright file.

Build instructions

libxml2 can be built with GNU Autotools, CMake, or several other build systems in platform-specific subdirectories.

Autotools (for POSIX systems like Linux, BSD, macOS)

If you build from a Git tree, you have to install Autotools and start by generating the configuration files with:

./autogen.sh [configuration options]

If you build from a source tarball, extract the archive with:

tar xf libxml2-xxx.tar.gz
cd libxml2-xxx

Then you can configure and build the library:

./configure [configuration options]
make

The following options disable or enable code modules and relevant symbols:

--with-c14n             Canonical XML 1.0 support (on)
--with-catalog          XML Catalogs support (on)
--with-debug            debugging module and shell (on)
--with-history          history support for shell (off)
--with-readline[=DIR]   use readline in DIR (for shell history)
--with-html             HTML parser (on)
--with-http             HTTP support (on)
--with-iconv[=DIR]      iconv support (on)
--with-icu              ICU support (off)
--with-iso8859x         ISO-8859-X support if no iconv (on)
--with-lzma[=DIR]       use liblzma in DIR (on)
--with-mem-debug        memory debugging module (off)
--with-modules          dynamic modules support (on)
--with-output           serialization support (on)
--with-pattern          xmlPattern selection interface (on)
--with-push             push parser interfaces (on)
--with-python           Python bindings (on)
--with-reader           xmlReader parsing interface (on)
--with-regexps          regular expressions support (on)
--with-run-debug        runtime debugging module (off)
--with-sax1             older SAX1 interface (on)
--with-schemas          XML Schemas 1.0 and RELAX NG support (on)
--with-schematron       Schematron support (on)
--with-threads          multithreading support (on)
--with-thread-alloc     per-thread malloc hooks (off)
--with-tree             DOM like tree manipulation APIs (on)
--with-valid            DTD validation support (on)
--with-writer           xmlWriter serialization interface (on)
--with-xinclude         XInclude 1.0 support (on)
--with-xpath            XPath 1.0 support (on)
--with-xptr             XPointer support (on)
--with-zlib[=DIR]       use libz in DIR (on)

Other options:

--with-minimum          build a minimally sized library (off)
--with-legacy           maximum ABI compatibility (off)

Note that by default, no optimization options are used. You have to enable them manually, for example with:

CFLAGS='-O2 -fno-semantic-interposition' ./configure

Now you can run the test suite with:

make check

Please report test failures to the mailing list or bug tracker.

Then you can install the library:

make install

At that point you may have to rerun ldconfig or a similar utility to update your list of installed shared libs.

CMake (mainly for Windows)

Another option for compiling libxml is using CMake:

cmake -E tar xf libxml2-xxx.tar.gz
cmake -S libxml2-xxx -B libxml2-xxx-build [possible options]
cmake --build libxml2-xxx-build
cmake --install libxml2-xxx-build

Common CMake options include:

-D BUILD_SHARED_LIBS=OFF            # build static libraries
-D CMAKE_BUILD_TYPE=Release         # specify build type
-D CMAKE_INSTALL_PREFIX=/usr/local  # specify the install path
-D LIBXML2_WITH_ICONV=OFF           # disable iconv
-D LIBXML2_WITH_LZMA=OFF            # disable liblzma
-D LIBXML2_WITH_PYTHON=OFF          # disable Python
-D LIBXML2_WITH_ZLIB=OFF            # disable libz

You can also open the libxml source directory with its CMakeLists.txt directly in various IDEs such as CLion, QtCreator, or Visual Studio.

Dependencies

Libxml does not require any other libraries. A platform with somewhat recent POSIX support should be sufficient (please report any violation to this rule you may find).

However, if found at configuration time, libxml will detect and use the following libraries:

  • libz, a highly portable and widely available compression library.
  • liblzma, another compression library.
  • libiconv, a character encoding conversion library. The iconv function is part of POSIX.1-2001, so libiconv isn't required on modern UNIX-like systems like Linux, BSD or macOS.
  • ICU, a Unicode library. Mainly useful as an alternative to iconv on Windows. Unnecessary on most other systems.

Contributing

The current version of the code can be found in GNOME's GitLab at at https://gitlab.gnome.org/GNOME/libxml2. The best way to get involved is by creating issues and merge requests on GitLab. Alternatively, you can start discussions and send patches to the mailing list. If you want to work with patches, please format them with git-format-patch and use plain text attachments.

All code must conform to C89 and pass the GitLab CI tests. Add regression tests if possible.

Authors

  • Daniel Veillard
  • Bjorn Reese
  • William Brack
  • Igor Zlatkovic for the Windows port
  • Aleksey Sanin
  • Nick Wellnhofer