28 Commits

Author SHA1 Message Date
Carlo Marcelo Arenas Belón
1074e9ebe0 maint: build fix for ucptest and related code tyding up (#782)
Since 9a868b06 (Tidy up config.h management (task from README) (#658),
2025-01-11), it fails to build with an "#error" because "config.h" is
no longer included directly.

Move the check and change some types in the utf8 tables it uses through
pcre2_ord2utf() so it better matches its current use, and other changes
in pcre2test.

While at it fix a typo in ManyConfigTests that was introduced in 0d0ac3aa
(Update EBCDIC support to support testing on normal ASCII systems (#656),
2025-02-12)
2025-09-01 09:55:15 +01:00
Nicholas Wilson
9a868b0605 Tidy up config.h management (task from README) (#658)
This fortuitously fixes the 16/32-bit 'unity' build.

Also tidy up the ckd_smul macros, for the sake of the Unity build.
2025-01-11 19:01:08 +00:00
Carlo Marcelo Arenas Belón
1bc34ffa64 pcre2grep: document better possible multiline matching misses (#252)
While at it, remove a misplaced cast that would cause problems for
subjects over 2GB and a few typos.
2023-05-12 15:54:02 +01:00
Carlo Marcelo Arenas Belón
9c905ce0c1 maint: avoid duplicated boolean properties and bad script extensions (#202)
`ucptest` was misbehaving and showing the wrong properties and
finding the wrong characters.
2023-02-03 14:57:32 +00:00
Carlo Marcelo Arenas Belón
af0839f911 maint: honor @missing in DerivedBidiClass and report non values (#201)
Starting with Unicode 15, the provided DerivedBidiClass data file
reports different default values to use for unassigned characters
in different groups.

Process the additional hints for that specific file, and allow
overriding the values later if more specific.

Since that was previously forbidden, change get_other_case() to
report when no valid value could be provided and allow skipping
conflicting rule lines that required that restriction.

While at it, Allow using the long identifiers in `ucptest` with
the `find bidi` command (underscores also allowed).
2023-02-02 17:31:13 +00:00
Carlo Marcelo Arenas Belón
72c9b57695 Ucptest updates (#199)
* ucptest: regenerate testoutput

Last sync with 1a5fcd (Remove unused variables in ucptest.c and update test data
for added properties, 2022-04-25), and showing significant differences.

* fix `findprop +` with UTF-8 characters and duplicated other case
2023-02-01 15:38:58 +00:00
Philip Hazel
c13d54f658 Implement PCRE2_EXTRA_CASELESS_RESTRICT and related features 2023-01-29 16:46:24 +00:00
Philip Hazel
51a5fcdc1f Remove unused variables in ucptest.c and update test data for added properties 2022-04-25 15:19:09 +01:00
Carlo Marcelo Arenas Belón
4279abbd7d pcre2test: allow using readline headers for libedit (#99)
At least in OpenBSD, there is a libedit library in base, but without
public headers.  Public headers for readline are available but since
15db5d36 (pcre2test: avoid using readline headers with libedit,
2022-04-07) won't be picked up automatically.

Allow pointing cmake to those headers by doing (for example):

  $ cmake -DEDITLINE_INCLUDE_DIR=/usr/include/readline

Or using custom CPPFLAGS with configure (for example):

  $ CPPFLAGS=-I/usr/include/readline ./configure --enable-pcre2test-libedit

Since the headers from readline.h would be otherwise incomplete, detect
that case and pull the extra headers that are required automagically and
while at it, cleanup the NCURSES dependency that was unnecessarily copied
from readline.
2022-04-14 16:51:51 +01:00
Philip Hazel
636569a957 Initial code for Boolean property support 2022-01-09 14:46:43 +00:00
Philip Hazel
d888d36013 Update script run code to work with new script extensions coding 2021-12-31 16:06:05 +00:00
Philip Hazel
7713f33e46 Add support for 4-character script abbreviations 2021-12-28 15:10:12 +00:00
Philip Hazel
98e7d70bc6 Refactor Python scripts for generating Unicode property data 2021-12-26 17:49:58 +00:00
Philip Hazel
b29732063b Revised script handling (see ChangeLog) 2021-12-21 16:11:30 +00:00
Philip Hazel
0246c6bf64 Add support for Bidi_Control and Bidi_Class properties 2021-12-08 15:34:27 +00:00
Philip Hazel
823d4ac956 Add bidi class and control information to Unicode property data 2021-12-05 18:00:10 +00:00
Philip.Hazel
9cebee7e75 Tidies and updates to maintenance programs utf8 and ucptest. 2020-04-01 17:00:33 +00:00
Philip.Hazel
c472f3f91a Update to Unicode 13.0.0. 2020-03-25 17:18:33 +00:00
Philip.Hazel
c0ed5a3ab3 Minor upgrade to pcre2test and comment in ucptest. 2019-07-30 17:59:42 +00:00
Philip.Hazel
aff5a78056 Upgrade to Unicode 12.1.0 2019-07-29 15:32:36 +00:00
Philip.Hazel
1c4dc562e4 Upgrade the ucptest program (used only by maintainer) and script run tests. 2018-10-14 14:27:16 +00:00
Philip.Hazel
04ba4bce0f Unicode properties data records extended to 12-bytes to include a
ScriptExtensions property.
2018-10-06 17:39:52 +00:00
Philip.Hazel
866750fd53 Basic "script run" implementation. Not yet complete, and not yet documented. 2018-10-02 15:25:58 +00:00
Philip.Hazel
937617f343 Update to Unicode 11.0.0 2018-07-07 16:10:29 +00:00
Philip.Hazel
d702527628 Update Unicode tables to 8.0.0. 2015-07-17 15:44:51 +00:00
Philip.Hazel
eee8530add API documentation and a lot of little related changes to the code. 2014-09-19 07:43:39 +00:00
Philip.Hazel
bf2bc83ed8 Update for Unicode 7.0.0 2014-06-20 12:40:32 +00:00
Philip.Hazel
225992aa3a Further work on pcre2test (can now display compiled code). 2014-05-13 11:20:03 +00:00