Author :
Bruno Haible
Date :
2025-09-24 23:28:14
Hash :5cb21152 Message :unilbrk: Fix handling of future emojis in rule LB30b.
* lib/unilbrk/lbrktables.h (LBP_ID): Renamed from LBP_ID1.
(LBP_EBF): Renamed from LBP_ID2.
* lib/gen-uni-tables.c (LBP_ID): Renamed from LBP_ID1.
(LBP_EBF): Renamed from LBP_ID2.
(get_lbp): Update. Move is_property_extended_pictographic invocation.
(debug_output_org_lbp): Augment the information from the LineBreak.txt
file with the Extended_Pictographic property. Print the LBP_ID and
LBP_EBF values distinctly.
(fill_org_lbp): Assign value LBP_EBF to all unassigned
Extended_Pictographic characters.
(debug_output_org_lbp): Handle LBP_EBF.
(lbp_value_to_string): Update.
(output_lbrk_rules_as_tables): Treat LBP_EBF like LBP_EB instead of like
LBP_ID.
* lib/unilbrk/u8-possible-linebreaks.c (u8_possible_linebreaks_loop):
Update.
* lib/unilbrk/u16-possible-linebreaks.c (u16_possible_linebreaks_loop):
Likewise.
* lib/unilbrk/u32-possible-linebreaks.c (u32_possible_linebreaks_loop):
Likewise.
* lib/unilbrk/lbrkprop2.h: Regenerated.
* lib/unilbrk/lbrktables.c: Regenerated.
* tests/unilbrk/test-uc-possible-linebreaks.c (main): Skip test cases
that contain the U+1F8FF character.
* modules/unilbrk/*-linebreaks: Bump required libunistring version.
modules/unilbrk/u8-possible-linebreaks
Description:
Line breaking of UTF-8 strings.
Files:
lib/unilbrk/internal.h
lib/unilbrk/u8-possible-linebreaks.c
lib/uniwidth/cjk.h
Depends-on:
unilbrk/base
unilbrk/tables
unistr/u8-mbtouc-unsafe
streq
configure.ac:
gl_LIBUNISTRING_MODULE([1.4], [unilbrk/u8-possible-linebreaks])
Makefile.am:
if LIBUNISTRING_COMPILE_UNILBRK_U8_POSSIBLE_LINEBREAKS
lib_SOURCES += unilbrk/u8-possible-linebreaks.c
endif
Include:
"unilbrk.h"
License:
LGPLv3+ or GPLv2+
Maintainer:
all