FreeBSD Manual Pages
UNICODE_CATEGORY_LOO(3) Courier Unicode Library UNICODE_CATEGORY_LOO(3) NAME unicode_category_lookup, unicode_isalnum, unicode_isalpha, unicode_isblank, unicode_isdigit, unicode_isgraph, unicode_islower, unicode_ispunct, unicode_isspace, unicode_isupper - unicode character categorization SYNOPSIS #include <courier-unicode.h> uint32_t unicode_category_lookup(char32_t c); int unicode_isalnum(char32_t c); int unicode_isalpha(char32_t c); int unicode_isblank(char32_t c); int unicode_isdigit(char32_t c); int unicode_isgraph(char32_t c); int unicode_islower(char32_t c); int unicode_ispunct(char32_t c); int unicode_isspace(char32_t c); int unicode_isupper(char32_t c); DESCRIPTION unicode_category_lookup() looks up the unicode character's categorization[1]. unicode_category_lookup() returns a 32 bit value. The value's UNICODE_CATEGORY_1 bits specify the first level of the unicode character's category, with UNICODE_CATEGORY_2, UNICODE_CATEGORY_3, and UNICODE_CATEGORY_4 bits specifying the 2nd, 3rd, and 4th level, if given. A value of 0 for each corresponding bit set indicates that no category is specified for this level, for this character; otherwise the possible values are defined in <courier-unicode.h>. The remaining functions implement comparable equivalents of their non-unicode versions in the standard C library, as follows: unicode_isalnum() Returns non-0 for all unicode_isalpha() or unicode_isdigit(). unicode_isalpha() Returns non-0 for all UNICODE_CATEGORY_1_LETTER. unicode_isblank() Return non-0 for TAB, and all UNICODE_CATEGORY_2_SPACE. unicode_isdigit() Returns non-0 for all UNICODE_CATEGORY_1_NUMBER | UNICODE_CATEGORY_2_DIGIT, only (no third categories). unicode_isgraph() Returns non-0 for all codepoints above SPACE which are not unicode_isspace(). unicode_islower() Returns non-0 for all unicode_isalpha() for which the character is equal to unicode_lc(3) of itself. unicode_ispunct() Returns non-0 for all UNICODE_CATEGORY_1_PUNCTUATION. unicode_isspace() Returns non-0 for unicode_isblank() or for unicode characters with linebreaking properties of BK, CR, LF, NL, and SP. unicode_isupper() Returns non-0 for all unicode_isalpha() for which the character is equal to unicode_uc(3) of itself. SEE ALSO courier-unicode(7), unicode_convert_tocase(3). AUTHOR Sam Varshavchik Author NOTES 1. unicode character's categorization https://unicode.org/notes/tn36/ Courier Unicode Library 05/18/2024 UNICODE_CATEGORY_LOO(3)
NAME | SYNOPSIS | DESCRIPTION | SEE ALSO | AUTHOR | NOTES
Want to link to this manual page? Use this URL:
<https://man.freebsd.org/cgi/man.cgi?query=unicode_ispunct&sektion=3&manpath=FreeBSD+Ports+14.3.quarterly>