| Nom | Exemple | Fonction ICU |
|---|---|---|
| union | union([a-z], [A-Z]) = [a-zA-Z] | uset_addAll |
| intersection | intersection([0-9], [4-5]) = [4-5] | uset_retainAll |
| difference | difference([0-9], [4-5]) = [0-36-9] | uset_removeAll |
| inverse | inverse([\x40-\5A]) = [\x00-\x3F\x5B-\U12FFFF] | uset_complement |
| Représentation | Signification | Équivalent |
|---|---|---|
| [:alnum:] | caractères alphanumériques | [A-Za-z0-9] |
| [:alpha:] | caractères alphabétiques | [A-Za-z] |
| [:blank:] | espace et tabulation | [ \t] |
| [:cntrl:] | caractères de contrôle | [\x00-\x1F\x7F] |
| [:digit:] | chiffres décimaux | [0-9] |
| [:graph:] | caractères visibles | [\x21-\x7E] |
| [:lower:] | lettres minuscules | [a-z] |
| [:print:] | caractères visibles et espace | [\x20-\x7E] |
| [:punct:] | ponctuations | [!"#$%&'()*+,-./:;?@[\\\]_`{|}~] |
| [:space:] | caractères blancs | [ \t\r\n\v\f] |
| [:upper:] | lettres majuscules | [A-Z] |
| [:xdigit:] | chiffres hexadécimaux | [A-Fa-f0-9] |
Quex :
ICU :
Equivalents PCRE :
UChar32 c; UErrorCode status; status = U_ZERO_ERROR; c = u_charFromName(U_UNICODE_CHAR_NAME, "LATIN SMALL LETTER N WITH CEDILLA", &status); // U_EXTENDED_CHAR_NAME, U_CHAR_NAME_ALIAS if (U_FAILURE(status)) { // ... } u_fprintf(ustdout, "%04X %C\n", c, c);
TODO (yylex, yyless, yymore, etc)
TODO