libxml2

mirror of https://gitlab.gnome.org/GNOME/libxml2.git synced 2025-02-20 13:57:22 +03:00

Author	SHA1	Message	Date
Daniel Veillard	38bbd3412d	Release of libxml2-2.9.0 * libxml.spec.in: update * doc/: updated and regenerated libxml2.syms testapi.c: regenerated v2.9.0	2012-09-11 15:00:08 +08:00
Daniel Veillard	7651606f31	Various cleanups to avoid compiler warnings	2012-09-11 14:02:08 +08:00
Daniel Veillard	742a0bbbcc	Keep libxml2.syms when running "make distclean"	2012-09-11 13:37:30 +08:00
Daniel Veillard	f8e3db0445	Big space and tab cleanup Remove all space before tabs and space and tabs at end of lines.	2012-09-11 13:26:36 +08:00
Csaba Raduly	429d3a0aae	Allow to set the quoting character of an xmlWriter It's otherwise impossible to set the quoting character of attribute values of an xmlWriter.	2012-09-11 11:52:46 +08:00
Daniel Veillard	e00778b492	Followup to LibXML2 docs/examples cleanup patch	2012-09-08 21:09:26 +08:00
Daniel Veillard	f933c89813	Keep non-significant blanks node in HTML parser For https://bugzilla.gnome.org/show_bug.cgi?id=681822 Regardless if the option HTML_PARSE_NOBLANKS is set or not, blank nodes are removed from a HTML document, for example: <html> <head> <title>This is a test.</title> </head> <body> <p>This is a test.</p> </body> </html> is read as: <html><head><title>This is a test.</title></head><body> <p>This is a test.</p> </body></html> This changes the default behaviour but the old behaviour is available as expected when using the parser flag HTML_PARSE_NOBLANKS Based on original patch from Igor Ignatyuk <igor_ignatiouk@hotmail.com> * HTMLparser.c: change various places in the parser where ignorable_space SAX callback was called without checking for the parser flag preference * xmllint.c: make sure we use the new flag even for HTML parsing * result/HTML/*: this modifies the output of a number of tests v2.9.0-rc2	2012-09-07 19:32:12 +08:00
Daniel Richard	878ec9db9d	Second round of cleanups for LibXML2 docs/examples configure.am: * Explicitly disallow --enable-rebuild-docs when builddir != srcdir, per what you said about needing to build docs with an in-source build doc/Makefile.am: * Ensure that xmlversion.h is in the source tree before running apibuild.py, to avoid generating an incomplete libxml2-api.xml * Update the .PHONY target (forgot to do this earlier) doc/devhelp/Makefile.am: * Wrap the doc-generating rule in an "if REBUILD_DOCS" conditional so it doesn't cause trouble for regular users * Added a handy-dandy "rebuild" target doc/examples/index.py: * NOTE: You need to run this script to regenerate the files it creates, and then commit the newly-updated files! The generated files currently in git master (e.g. doc/examples/Makefile.am) are out of date even before this patch! * index.html really needs to be in EXTRA_DIST * Wrap the doc-generating rules in an "if REBUILD_DOCS" conditional, because they shouldn't be active otherwise	2012-09-07 14:52:17 +08:00
Daniel Veillard	47881284ae	Add a forbidden variable error number and message to XPath Related to https://bugzilla.gnome.org/show_bug.cgi?id=680938 When the XML_XPATH_NOVAR flags is being used it means that variables are forbidden, not that they are missing	2012-09-07 14:24:50 +08:00
Michael Stahl	55b899a23a	Support long path names on WNT so we've got this patch to libxml2 2.7.6 in the LibreOffice code base, inherited from OOo. it fixes a definite problem, which is that Windows has a rather low maximum path length restriction, and there is a special trick on NT whereby path names can be prefixed with "\\?\", in which case the maximum length is 32k, which ought to be sufficient even for bloated office suites :) I'll attach the patch to the xmlCanonicPath function. note that i didn't write this and am by no means an expert on either Microsoftean platforms or libxml so maybe it's not the best way to do it.	2012-09-07 12:19:25 +08:00
Daniel Veillard	1bd45d1309	Change the XPath code to percolate allocation errors looping 1000 time on an error stating that a nodeset has grown out of control is useless, make sure we percolate error up to the various loops and break when errors occurs	2012-09-05 15:35:19 +08:00
Daniel Veillard	7d4c529a33	Improve HTML escaping of attribute on output Handle special cases of &{...} constructs as hinted in the spec http://www.w3.org/TR/html401/appendix/notes.html#h-B.7.1 and special values as comment <!-- ... --> used for server side includes This is limited to attribute values in HTML content.	2012-09-05 12:11:43 +08:00
Daniel Veillard	857104cd49	Remove all .cvsignore as they are not used anymore For https://bugzilla.gnome.org/show_bug.cgi?id=682985 suggested by Adrian Bunk <bunk@stusta.de>	2012-09-04 14:25:23 +08:00
Daniel Veillard	7a2215dbcd	Fix reuse of xmlInitParser While xmlCleanupParser() should not be used unless complete control is insured over the programe making sure libxml2 is not in use anywhere It should still be usable, and allow a sequence of xmlInitParser(); xmlCleanupParser(); calls if needed, the problem is that the thread key wasn't reallocated on subsequent xmlinitParser() calls leading to corruption of pthread keys used by the program. * threads.c: make sure xmlCleanupParser() reset the pthread_once() global variable driving thread key allocation.	2012-09-04 12:05:17 +08:00
Daniel Veillard	510e7583f4	Fix a Timsort function helper comment	2012-09-04 11:50:36 +08:00
Daniel Veillard	28f5e1a2d6	Fix potential crash on entities errors Related to https://bugs.launchpad.net/lxml/+bug/502959 Basically the core of the issue is that if an entity references another entity, then in case we are replacing entities content, we should always do so by copying the referenced content as long as the reference is done within the entity. Otherwise, if for some reason there is a later parsing error that entity content may be freed. Complex scenario exposed by command: thinkpad:~/XML/diveintopython-5.4/xml -> valgrind --db-attach=yes ../../xmllint --loaddtd --noout --noent diveintopython.xml Document references &a; a references &b; we references b content directly in by linking in the a content a has an error further down we free a, freeing the chunk from b Document references &b; after &a; we try to copy b content, but it was freed already => segfault * parser.c: never reference directly entity content without copying if we aren't in the document main entity	2012-09-04 11:18:39 +08:00
Christian Weisgerber	3b6d7b9aee	xml2-config.1 markup error There is a spurious ".l" in the xml2-config.1 man page. This line can simply be removed. $ mandoc -Tlint -Werror xml2-config.1 xml2-config.1:12:2: ERROR: skipping unknown macro: .l	2012-08-28 23:40:56 +08:00
Arfrever Frehtes Taifersar Arahesis	1f01f49ba6	Handle ICU_LIBS as LIBADD, not LDFLAGS to prevent linking errors For https://bugzilla.gnome.org/show_bug.cgi?id=677606 For https://bugs.gentoo.org/show_bug.cgi?id=417539 If libxml2-2.8.0 is built with --with-icu --with-python on a system that has an older version of libxml2 installed, then during "make install", libxml2mod.so gets relinked to the systemwide version of libxml2.so.2 instead of libxml2.so.2 from the build tree, and fails at runtime if symbol versions from the older libxml2.so.2 are not available. This effectively makes it impossible to build a libxml2-2.8.0 binary package on a system that does not already have libxml2-2.8.0 installed. Investigation by Rafał Mużyło and Arfrever Frehtes Taifersar Arahesis revealed the cause of the problem to be that libxml2's configure was adding ICU_LIBS to LDFLAGS instead of to LIBADD. This resulted in GNU libtool using the wrong argument order in its relinking command that gets run during "make install".	2012-08-28 22:16:50 +08:00
Akira TAGOH	961b535c10	Bug 676544 - fails to build with --without-sax1 Added some ifdef'd LIBXML_SAX1_ENABLED to make it buildable with --without-sax1 configure option.	2012-08-28 22:03:16 +08:00
Rob Richards	236ea1ea90	fix builds not having stdint.h	2012-08-27 11:56:07 -04:00
Rob Richards	8f2d6b575a	initialize var	2012-08-27 05:08:54 -04:00
Daniel Veillard	8880170e21	Fix the XPath arity check to also check the XPath stack limits Example xmlXPathNormalizeFunction() would do CHECK_ARITY(1) and the expect valuePop(ctxt); to return an object, except now valuePop() looks at the XPath stack frames and fails returning NULL, and we end up crashing dereferencing the object. Real solution is to exten CHECK_ARITY() and recompile all XPath functions using it.	2012-08-27 16:20:05 +08:00
Pietro Cerutti	890faa546b	Fix problem with specific and generic error handlers It seems that setting up both xmlTextReaderSetStructuredErrorHandler and xmlSetStructuredErrorFunc confuses the code around error.c:592 and following This patch works with any combinations of using xmlSetStructuredErrorFunc, xmlTextReaderSetStructuredErrorHandler, both, or none.	2012-08-27 13:24:08 +08:00
Daniel Veillard	466fcdaa33	Avoid a potential infinite recursion Which can happen when eliminating epsilon transitions, as reported by Pavel Madr <pmadr@opentext.com>	2012-08-27 12:03:40 +08:00
Vojtech Fried	3e031b7dae	Switching XPath node sorting to Timsort I use libxml xpath engine on quite large (and mostly "flat") xml files. It seems that Shellsort, that is used in xmlXPathNodeSetSort is a performance bottleneck for my case. I have read some posts about sorting in libxml in the libxml archive, but I agree that qsort was not the way to go. I experimented with Timsort instead and my results were good for me. For about 10000 nodes, my test was about 5x faster with Timsort, for 1000 nodes about 10% faster, for small data files, the difference was not measurable. * timsort.h: the algorithm, kept in a separate header * xpath.c: plug in the new algorithm in xmlXPathNodeSetSort * Makefile.am: add the header to the EXTRA_DIST * doc/apibuild.py: avoid indexing the new header	2012-08-24 16:52:44 +08:00
Daniel Veillard	73f94c602c	Small cleanup for valgrind target	2012-08-24 16:38:54 +08:00
Nick Wellnhofer	622705398a	Optimizing '//' in XPath expressions When investigating the libxslt performance problem reported in bug #657665, I found that '//' in XPath expressions can be very slow when working on large subtrees. One of the reasons is the seemingly quadratic time complexity of the duplicate checks when merging result nodes. The other is a missed optimization for expressions of the form 'descendant-or-self::node()/axis::test'. Since '//' is expanded to '/descendant-or-self::node()/', this type of expression is quite common. Depending on the axis of the expression following the 'descendant-or-self' step, the following replacements can be made: from descendant-or-self::node()/child::test to descendant::test from descendant-or-self::node()/descendant::test to descendant::test from descendant-or-self::node()/self::test to descendant-or-self::test from descendant-or-self::node()/descendant-or-self::test to descendant-or-self::test 'test' can be any kind of node test. With these replacements the possibly huge result of 'descendant-or-self::node()' doesn't have to be stored temporarily, but can be processsed in one pass. If the resulting nodeset is small, the duplicate checks aren't a problem. I found that there already is a function called xmlXPathRewriteDOSExpression which performs this optimization for a very limited set of cases. It employs a complicated iteration scheme for rewritten expressions. AFAICS, this can be avoided by simply changing the axis of the expression like described above. With the attached patch against libxml2 and the files from bug #657665 I got the following results. Before: $ time xsltproc/xsltproc --noout service-names-port-numbers.xsl service-names-port-numbers.xml real 2m56.213s user 2m56.123s sys 0m0.080s After: $ time xsltproc/xsltproc --noout service-names-port-numbers.xsl service-names-port-numbers.xml real 0m3.836s user 0m3.764s sys 0m0.060s I also ran the libxml2 and libxslt test suites with the patch and couldn't detect any breakage. Nick >From e0f5a8261760e4f257b90410be27657e984237c8 Mon Sep 17 00:00:00 2001 From: Nick Wellnhofer <wellnhofer@aevum.de> Date: Sun, 19 Aug 2012 18:20:22 +0200 Subject: [PATCH] Optimizations for descendant-or-self::node() Currently, the function xmlXPathRewriteDOSExpression optimizes expressions of type '//child'. Instead of adding a 'rewriteType' and doing a compound traversal, the same can be achieved simply by setting the axis of the node test from 'child' to 'descendant'. There are also many other cases that can be optimized similarly. This commit augments xmlXPathRewriteDOSExpression to essentially rewrite the following subexpressions: - descendant-or-self::node()/child:: to descendant:: - descendant-or-self::node()/descendant:: to descendant:: - descendant-or-self::node()/self:: to descendant-or-self:: - descendant-or-self::node()/descendant-or-self:: to descendant-or-self:: Since the '//' shortcut in XPath is translated to '/descendant-or-self::node()/', this greatly speeds up expressions using '//' on large subtrees.	2012-08-24 12:04:25 +08:00
Daniel Veillard	c70d185a2a	Fix an XSD error when generating internal automata When generating a sequence add an extra epsilon transition to avoid further constructs from entering via the last state Bug reported by Johan Corveleyn <jcorvel@gmail.com>	2012-08-23 23:28:04 +08:00
Daniel Veillard	82cdfc4eb3	Expose xmlBufShrink in the public tree API As suggested by Andrew W. Nosenko: Proposal: expose the new xmlBufShrink() to the "public" API for compatibility with xmlBufUse(). Reason: the following scenario: 1. Read something into xmlParserInputBuffer (e.g. using xmlParserInputBufferRead()) 2. Extract content through xmlBufContent() 3. Extract content length through xmlBufUse(). Result have type 'size_t'. 4. Use this content 5. Now, you need to shrink the buffer. How to do it? Doing that through legacy xmlBufferShrink() is unsafe because it uses 'unsigned int' and the whole point of introducing the new API was handling the cases, when 'unsigned int' is not enough. Therefore, need to use the new xmlBufShrink(). But it is "private". Therefore, I propose to expose the new xmlBufShrink() in the same way, as xmlBufContent() and xmlBufUse() are exposed.	2012-08-22 11:05:09 +08:00
Daniel Veillard	ff7227f2c1	Patch for portability of latin characters in C files Coming from LibreOffice repository: http://cgit.freedesktop.org/libreoffice/core/plain/libxml2/libxml2-latin.patch	2012-08-20 20:58:24 +08:00
Vitaly Ostanin	dce1c8baae	Patch for xinclude of text using multibyte characters for bug https://bugzilla.gnome.org/show_bug.cgi?id=633166 When you xinclude a text file, reading portions (by 4000 bytes) of the buffer incorrectly handled the situation when the end comes across portions of the bytes in a multibyte character.	2012-08-17 22:59:44 +08:00
Daniel Veillard	40851d0c59	Fix a segfault on XSD validation on pattern error As reported by Sven <sven@e7o.de>: The following pattern will cause a segmentation fault in my Apache (using PHP5 to validate a XML against a XSD): <xs:pattern value="(.*)\|"/> Fix a cascade of error handling failures which led to the crash in that scenario.	2012-08-17 22:59:10 +08:00
Conrad Irwin	b60061a7a5	Visible HTML elements close the head tag In HTML email it's common to find arbitrary fragments of HTML, the one that triggered this change was of the form: <meta><font></font><div>... Before this change the <font> tag was part of the implicit <head> that gets created for the <meta> tag, after this change, it is part of the <body>, which more closely matches the behaviour of modern HTML implementations.	2012-08-17 19:14:29 +08:00
John Bradshaw	c9a575cff5	libxml(3) manpage typo fix	2012-08-17 11:59:01 +08:00
Daniel Veillard	dfc0aa0a6a	GetProcAddressA is available only on WinCE As Roumen pointed out "After recent commits I count not link build for mingw* host as GetProcAddressA is missing." Looking around a bit it seems you are right: http://voidnish.wordpress.com/2005/06/14/getprocaddress-in-unicode-builds/ except it was introduced in Windows CE http://msdn.microsoft.com/en-us/library/ms885634.aspx	2012-08-17 11:04:24 +08:00
Daniel Richard G	ec4fc529b3	More updates and cleanups on autotools and Makefiles Makefile.am, example/Makefile.am: * Replaced the obsolete INCLUDES variable with AM_CPPFLAGS/AM_CFLAGS acinclude.m4: * autoupdate replaced AC_FD_CC with AS_MESSAGE_LOG_FD autogen.sh: * Added -Wall to the autoreconf invocation, which turned up a whole slew of warnings that are fixed by this patch configure.in: * Most of the changes are due to autoupdate, with subsequent manual tidying * Note that autoupdate bumped the AC_PREREQ version from 2.59 to 2.68. If you normally use an older version of Autoconf, and everything works fine if you comment out that directive, feel free to bump down the version accordingly. * Ensure that #include directives in C fragments always have no whitespace to the left of the '#' mark, as some preprocessors need that to be in the first column example/Makefile.am: * Don't need DEPS * Use plain LDADD instead of LDADDS; if all programs in this file need to link against the same set of libraries, then this is all you need	2012-08-17 10:04:30 +08:00
Daniel Richard G	6842ee810e	More cleanups to the documentation part of libxml2 doc/Makefile.am: * Build what's in doc/ before doc/devhelp/, as the dependency graph flows that way * Add "--path $(srcdir)" so that xsltproc can find DTDs in srcdir * Replaced $(top_srcdir)/doc with an equivalent $(srcdir) * Qualified libxml2-api.xml with $(srcdir) as it's always generated there * Rewrote the dependencies for libxml2-api.xml so that xmlversion.h doesn't throw everything off doc/devhelp/Makefile.am: * Use Automake constructs to install the HTML files instead of an install-data-local rule * Reorganized the file a bit (hello whitespace!) * EXTRA_DIST doesn't need to list so many files now that dist_devhelp_DATA is being used * Only print "Rebuilding devhelp files" if rebuilding is actually occurring doc/examples/index.py: * Make the "this file is auto-generated" banner more prominent * Autotools updates: Use AM_CPPFLAGS/AM_CFLAGS instead of INCLUDES * Got rid of DEPS as it's not needed (Automake already sees the dependency on libxml2.la by way of LDADD(S)) * Replaced LDADDS with LDADD, which is applied to all programs listed in the file. Since all the test programs have the same link dependencies, this way is more concise yet equivalent. * Remove the .tmp files via "make clean" instead of having the test programs do it themselves (more on this later) Invoke index.py in srcdir, as it pretty much needs to run there * Restructured the index.html rule so that only the xmllint invocation is allowed to fail * Use $(MKDIR_P) instead of $(mkinstalldirs), $(VAR) instead of @VAR@ * Remove symlinks for test?.xml in an out-of-source build * Sort lists for neatness * Better formatting for EXTRA_DIST and noinst_PROGRAMS variables * Simplified the Automake bits printed for each program: _LDFLAGS doesn't need to be specified as it's empty anyway, _DEPENDENCIES is redundant, _LDADD isn't needed due to the global LDADD Added a bit that symlinks in test?.xml from srcdir in out-of-source builds. This allows the reader4 test to read these files in the current directory, which ensures that the output always looks the same (i.e. does not contain references to srcdir) * Don't hide the test program invocation (or else it's hard to tell which test failed), and don't use superfluous parentheses * NOTE: If you check in these changes, be sure to run this script and also check in the updated files that it generates! doc/examples/.c: Updated the test: lines so that + "&&" is used to separate commands instead of ";" so that errors are not masked + reference files are qualified with $(srcdir)/ + no "rm" takes place -- these are a problem because (1) if a test fails, it's useful to have the output file ready for inspection; (2) the "rm" invocation masks a potential non-zero exit status from diff (This is why I added the CLEANFILES line above) doc/examples/io1.res: * Updated this ref file so that the test passes. (This is correct, right?) doc/examples/reader4.res: * Changed this back to its original form, as the symlinking of test?.xml means this file no longer has to contain path prefixes on the filenames doc/examples/testWriter.c: * Changed the output filenames to .tmp instead of .res, partly for consistency, partly to not have to add special cases to CLEANFILES doc/examples/xpath1.c: * Removed the "./" prefix on the test invocation, which is redundant as index.py already adds one	2012-08-17 09:58:38 +08:00
Eric Zurcher	e02869807e	More changes for Win32 compilation	2012-08-15 15:42:11 +08:00
Eric Zurcher	414f269a4f	Basic changes for Win32 builds of release 2.9.0: compile buf.c Makes builds on Windows (whether by MSVC, BCB, or MinGW) to compile buf.c	2012-08-15 13:52:09 +08:00
Daniel Veillard	1f972e9f28	Cleanup some of the parser code Prefetching assumptions about the amount of data read in GROW should be backed up with test for 0 termination when at the end of the buffer.	2012-08-15 10:16:37 +08:00
Daniel Veillard	ef4526ad42	Fix a variable name in comment	2012-08-15 09:15:30 +08:00
Daniel Veillard	baaeadcf65	Regenerated testapi.c	2012-08-15 09:15:30 +08:00
Daniel Veillard	64d11249a5	Fix missing xmlsave.h module which was ignored in recent builds due to "save.h" rule discarding it too	2012-08-15 09:15:30 +08:00
Daniel Richard G	d10a72ce8c	Bundles all generated files for python into the distribution So end users building the Python module don't need to regenerate any source files. some clean-up too.	2012-08-14 12:50:25 +08:00
Daniel Veillard	97fa5b3c8f	Fix file and line report for XSD SAX and reader streaming validation Things now work correctly at the xmllint level: thinkpad:~/XML -> xmllint --sax --noout --schema test_schema.xsd test_xml.xml test_xml.xml:72721: Schemas validity error : Element 'level1': Missing child element(s). Expected is ( level2 ). test_xml.xml fails to validate thinkpad:~/XML -> xmllint --stream --schema test_schema.xsd test_xml.xml test_xml.xml:72721: Schemas validity error : Element 'level1': Missing child element(s). Expected is ( level2 ). test_xml.xml fails to validate thinkpad:~/XML -> * error.c: fix a corner case of not reporting lines when we should * include/libxml/xmlschemas.h doc/symbols.xml: had to add new entry points to set the filename on a validation context and a locator callback used to fetch the line and file from the context * xmlschemas.c: add the new entry points xmlSchemaValidateSetFilename() and xmlSchemaValidateSetLocator(), plus make sure the error reporting routine gets the information if available. Add a locator for SAX. * xmlreader.c: add and plug a locator for readers.	2012-08-14 11:01:07 +08:00
Patrick Gansterer	aabc08477c	Fix compiler warnings of wincecompat.c For https://bugzilla.gnome.org/show_bug.cgi?id=681592 ) Add and explicit cast when converting FILE to int *) Don't assign a c-string to the element of an char-array	2012-08-13 18:00:22 +08:00
Patrick Gansterer	fd4f6fddf5	Fix non __GNUC__ build For https://bugzilla.gnome.org/show_bug.cgi?id=681590 Length member of _xmlDictEntry is called "len" and not "l"	2012-08-13 17:54:20 +08:00
Daniel Veillard	3b66622462	Fix const qualifyer to definition of xmlBufferDetach For https://bugzilla.gnome.org/show_bug.cgi?id=676629 As the buffer is beng mdified by the call the const doesn't make sense.	2012-08-13 17:49:15 +08:00
Patrick Gansterer	5a82e48edc	Fix windows unicode build For https://bugzilla.gnome.org/show_bug.cgi?id=638650 After much discussions in the list: https://mail.gnome.org/archives/xml/2012-May/msg00062.html The simplest at this point is to fallback to only officially supporting ASCII names in those APIs, document it and use the "A" entry points on Windows.	2012-08-13 17:39:06 +08:00
Roumen Petrov	c3b1d09ba2	clean redefinition of {v}snprintf in C-source as those from *config.h are preferable (e.g. win32config.h)	2012-08-13 16:57:30 +08:00

1 2 3 4 5 ...

4101 Commits