libxml2

mirror of https://gitlab.gnome.org/GNOME/libxml2.git synced 2025-04-21 06:50:08 +03:00

Author	SHA1	Message	Date
Nick Wellnhofer	106c4cdd4b	testrecurse: Support multiple huge docs	2022-12-21 20:21:51 +01:00
Nick Wellnhofer	079da5b26d	testrecurse: Add external entities to huge test	2022-12-21 20:21:51 +01:00
Nick Wellnhofer	01bcb23de1	testrecurse: Add test cases for external entities Add test cases for external general and parameter entities.	2022-12-21 20:21:51 +01:00
Nick Wellnhofer	046f99c543	testrecurse: Add lol_param.xml Add test case contributed by Sebastian Pipping for CVE-2021-3541.	2022-12-21 20:20:11 +01:00
Nick Wellnhofer	fafa025209	testrecurse: Rename test files	2022-12-21 20:20:11 +01:00
Nick Wellnhofer	69aeff53c1	testrecurse: Also test without entity substitution	2022-12-21 20:20:11 +01:00
Nick Wellnhofer	4c7cb8f4d4	testrecurse: Also test SAX parser	2022-12-21 20:20:11 +01:00
Nick Wellnhofer	583cd2f64b	testrecurse: Start to test entity expansion stats	2022-12-21 20:19:10 +01:00
Nick Wellnhofer	ce76ebfd13	entities: Stop counting entities This was only used in the old version of xmlParserEntityCheck.	2022-12-21 20:19:10 +01:00
Nick Wellnhofer	a3c8b1805e	entities: Add entity flag for loop check	2022-12-21 20:19:10 +01:00
Nick Wellnhofer	463bbeeca1	entities: Rework entity amplification checks This commit implements robust detection of entity amplification attacks, better known as the "billion laughs" attack. We now limit the size of the document after substitution of entities to 10 times the size before expansion. This guarantees linear behavior by definition. There already was a similar check before, but the accounting of "sizeentities" (size of external entities) and "sizeentcopy" (size of all copies created by entity references) wasn't accurate. We also need saturation arithmetic since we're historically limited to "unsigned long" which is 32-bit on many platforms. A maximum of 10 MB of substitutions is always allowed. This should make use cases like DITA work which have caused problems in the past. The old checks based on the number of entities were removed. This is accounted for by adding a fixed cost to each entity reference. Entity amplification checks are now enabled even if XML_PARSE_HUGE is set. This option is mainly used to allow larger text nodes. Most users were unaware that it also disabled entity expansion checks. Some of the limits might be adjusted later. If this change turns out to affect legitimate use cases, we can add a separate parser option to disable the checks. Fixes #294. Fixes #345.	2022-12-21 20:19:10 +01:00
Nick Wellnhofer	7e3f469be9	entities: Use flags to store '<' check results Instead of abusing the LSB of the "checked" member, store the result of testing for occurrence of '<' character in "flags". Also use the flags in xmlParseStringEntityRef instead of rescanning every time.	2022-12-19 15:59:49 +01:00
Nick Wellnhofer	481d79d44c	entities: Add XML_ENT_PARSED flag To check whether an entity was already parsed, the code previously tested whether "checked" was non-zero or "children" was non-null. The "children" check could be unreliable because an empty entity also results in an empty (NULL) node list. Use a separate flag to make this check more reliable.	2022-12-19 15:26:46 +01:00
Nick Wellnhofer	f34f184f8e	entities: Add "flags" member to struct xmlEntity This will hold various flags and eventually replace the "checked" member.	2022-12-19 15:24:53 +01:00
Nick Wellnhofer	f67dc6189f	xmlreader: Try to fix regression when reading from memory This reverts a change from commit 2059df53, see #462.	2022-12-17 00:14:56 +01:00
Nick Wellnhofer	ae0c9cfa05	uri: Fix handling of port numbers Allow port number without host, real fix for #71. Also compare port numbers in xmlBuildRelativeURI. Fix handling of port numbers in xmlUriEscape.	2022-12-13 01:43:49 +01:00
Nick Wellnhofer	8ed40c621b	Revert "uri: Allow port without host" This reverts commit f30adb54f55e4e765d58195163f2a21f7ac759fb. Fixes #460.	2022-12-13 00:51:33 +01:00
Nick Wellnhofer	a77e32736c	xmlmemory.c: Remove xmlMemContentShow This debug function was always unsafe and hard-coded pointer sizes to 32 bits. Instead of attempting a fix, remove it completely. These days, tools like ASan are much better to debug memory issues. Fixes #214.	2022-12-08 19:45:40 +01:00
Nick Wellnhofer	25ea7b6aa0	testapi.c: Initialize catalog early Avoid leak reports when testing --with-mem-debug.	2022-12-08 19:44:09 +01:00
Nick Wellnhofer	eaebf37fb6	gentest.py: Fix memory leak in API tests Regressed in commit ff34ba3e.	2022-12-08 19:18:10 +01:00
Nick Wellnhofer	785cfcff49	doc/libxml2-api.xml: Regenerate	2022-12-08 19:18:09 +01:00
Nick Wellnhofer	0f54af7494	encoding.c: Fix for documentation generator Top-level macro invocations throw off the documentation parser.	2022-12-08 18:40:58 +01:00
Lukáš Tyrychtr	85c6cacd67	catalog.c: Silence a cast warning on VS 2022 Fixes #457.	2022-12-08 13:34:03 +01:00
Nick Wellnhofer	93a01c46f1	libxml.h: Add comments and indentation	2022-12-08 04:39:03 +01:00
Nick Wellnhofer	92b8ffada8	libxml.h: Remove dubious definition of LIBXML_STATIC This macro is supposed to be set by the build system.	2022-12-08 04:24:57 +01:00
Nick Wellnhofer	60d457be30	libxml.h: Don't include stdio.h	2022-12-08 04:24:57 +01:00
Nick Wellnhofer	924ed82735	libxml.h: Remove ancient LynxOS setup	2022-12-08 04:22:11 +01:00
Nick Wellnhofer	a6debffd7f	xmlexports.h: Disable docs for internal macro XMLPUBLIC	2022-12-08 04:22:11 +01:00
Nick Wellnhofer	3b6cc47ab9	xmlexports.h: Remove LIBXML_FASTCALL optimization This was an experimental and undocumented micro-optimization for Windows which apparently required different calling conventions for variable-argument functions, making it impossible to maintain without domain knowledge.	2022-12-08 04:19:02 +01:00
Nick Wellnhofer	ce9baf94d5	Remove XMLCALL and XMLCDECL macros from public headers	2022-12-08 02:48:27 +01:00
Nick Wellnhofer	dd3569eaa5	Remove XMLDECL macro from .c files	2022-12-08 02:43:17 +01:00
Nick Wellnhofer	06b7a7e05b	Update README.md Mention official releases and Git repo prominently. Remove links to old mailing list.	2022-12-08 00:54:13 +01:00
Nick Wellnhofer	b92768cd62	tests: Enable "runsuite" test This enables some tests with testcases in - test/xsdtest - test/relaxng/OASIS/spectest.xml - test/relaxng/testsuite.xml The XML Schema Test Suite will also be run it was downloaded, see xstc/Makefile.am. Gitlab CI should be updated to fetch these files. There are 10 expected errors in the XSD test suite. This seems to be the case since at least version 2.9.0 from 2012.	2022-12-08 00:24:53 +01:00
Ross Burton	4762c85668	Use python3 not python As per https://peps.python.org/pep-0394/, the python binary can be one of the following options: - Python 2 - Python 3 - Not exist All of the scripts in libxml2 use 'python', which may not exist. As Python 2 reached EOL on the 1st January 2020, it's safe to move the scripts to use python3 explicitly.	2022-12-07 13:21:12 +00:00
Ross Burton	ff49041c62	xstc/fixup-tests.py: port to Python 3	2022-12-07 13:21:12 +00:00
Ross Burton	7640362e76	xstc/fixup-tests.py: unify whitespace The source contains a mix of tabs and spaces, so unify on spaces.	2022-12-07 13:20:53 +00:00
Ross Burton	d598d8af09	libxml.m4: deprecate AM_PATH_XML2, wrap PKG_CHECK_MODULES instead pkg-config has been around for a very long time now, so deprecate the hand-written libxml.m4 fragment providing AM_PATH_XML2 and simply change it to a wrapper around PKG_CHECK_MODULES.	2022-12-06 18:17:49 +00:00
Ross Burton	0ac8c15eb4	python/tests/reader2: use absolute paths everywhere The expected errors contain an relative path, but the messages from the parser contain absolute paths. However, due to the tests not actually failing if there was an error this wasn't noticed. Instead of putting relative paths in the expected messages use format() to embed the correct absolute path. Also use os.path.join() consistently when constructing paths to ensure uniformly formatted paths.	2022-12-06 17:27:34 +00:00
Ross Burton	b9ba5e1d90	python/tests/reader2: always exit(1) if a test fails Batch up the errors in the first parse tests and ensure that the last tests exit with an error if they fail. Also remove an unused import.	2022-12-06 17:25:34 +00:00
Ross Burton	21f2ce7112	testModule: exit if the module can't be opened Instead of silently exiting with success when the module cannot be found, emit a message and fail the test.	2022-12-06 17:24:37 +00:00
Ross Burton	b1b0df6e9b	CI: disable modules in gcc:static build When shared libraries are disabled we can't build loadable modules either, so the testModule test can't work as the testdso.la target doesn't build a module.	2022-12-06 17:23:12 +00:00
Ross Burton	3aaaf5cae6	CI: fix CI on MinGW builds The XML test case tarball isn't actually compressed: the published URL is a .tar and fetches of the .tar.gz redirect silently to the .tar, which is then passed to gzip which refuses to decompress uncompressed data. Fetch the .tar as that is the documented URL, and remove the decompression.	2022-12-06 17:16:39 +00:00
Nick Wellnhofer	76c6da4209	error: Make sure that error messages are valid UTF-8 This has caused issues with the Python bindings for a long time. Should fix #64.	2022-12-04 23:34:19 +01:00
Alex Richardson	4b959ee168	Remove hacky heuristic from b2dc5675e94aa6b5557ba63f7d66b0f08dd17e4d Checking whether the context is close to the parent context by hardcoding 250 is not portable (I noticed tests were failing on Morello since the value is 288 there due to pointers being 128 bits). Instead we should ensure that the XML_VCTXT_USE_PCTXT flag is not set in cases where the user data is not actually a parser context (or ideally add a separate field but that would be an ABI break. From what I can see in the source, the XML_VCTXT_USE_PCTXT is only set if the userData field points to a valid context, and if this is not the case the flag should be cleared when changing userData rather than relying on the offset between the two. Looking at the history, I think d7cb33cf44aa688f24215c9cd398c1a26f0d25ff fixed most of the need for this workaround, but it looks like there are a few more locations that need updating; This commit changes two more places to set/clear/copy the XML_VCTXT_USE_PCTXT flag, so this heuristic should not be needed anymore. I've also drop two = NULL assignment in xmllint since this is not needed after a call to memset(). There was also an uninitialized vctxt.flags (and other fields) in `xmlShellValidate()`, which I've fixed by adding a memset() call.	2022-12-01 15:31:25 +00:00
Alex Richardson	c715ded086	Avoid creating an out-of-bounds pointer by rewriting a check Creating more than one-past-the-end pointers is undefined behaviour in C and while this code is unlikely to be miscompiled, I discovered that an out-of-bounds pointer is being created using UBSan on a CHERI-enabled system.	2022-12-01 15:30:12 +00:00
Alex Richardson	c62c0d82cc	Correctly relocate internal pointers after realloc() Adding an offset to a deallocated pointer and assuming that it can be dereferenced is undefined behaviour. When running libxml2 on CHERI-enabled systems such as Arm Morello this results in the creation of an out-of-bounds pointer that cannot be dereferenced and therefore crashes at runtime. The effect of this UB is not just limited to architectures such as CHERI, incorrect relocation of pointers after realloc can in fact cause FORTIFY_SOURCE errors with recent GCC: https://developers.redhat.com/articles/2022/09/17/gccs-new-fortification-level	2022-12-01 15:14:40 +00:00
Nick Wellnhofer	c7a9b85cbb	html: Improve parsing of nested lists Allow ul/ol as immediate children of ul/ol. This is more in line with the HTML5 spec. Fixes #447.	2022-11-30 17:11:33 +01:00
Nick Wellnhofer	ccb6d54409	Hide internal functions These functions were never declared in public headers, so it should be safe to hide them. Fixes #139.	2022-11-27 02:20:53 +01:00
Nick Wellnhofer	82bd2c3736	python: Fix memory leak checks xmlInitParser doesn't allocate memory anymore, so the checks can be simplified.	2022-11-27 02:11:07 +01:00
Nick Wellnhofer	1966382b34	memory: Don't use locks in xmlMemUsed The Python tests call xmlMemUsed after xmlCleanupParser which doesn't work with statically allocated mutexes. This is only used for debugging, so a lock isn't necessary.	2022-11-27 02:11:07 +01:00

... 3 4 5 6 7 ...

5903 Commits