Nom | Exemple | Fonction ICU |
---|---|---|
union | union([a-z], [A-Z]) = [a-zA-Z] | uset_addAll |
intersection | intersection([0-9], [4-5]) = [4-5] | uset_retainAll |
difference | difference([0-9], [4-5]) = [0-36-9] | uset_removeAll |
inverse | inverse([\x40-\5A]) = [\x00-\x3F\x5B-\U12FFFF] | uset_complement |
Représentation | Signification | Équivalent |
---|---|---|
[:alnum:] | caractères alphanumériques | [A-Za-z0-9] |
[:alpha:] | caractères alphabétiques | [A-Za-z] |
[:blank:] | espace et tabulation | [ \t] |
[:cntrl:] | caractères de contrôle | [\x00-\x1F\x7F] |
[:digit:] | chiffres décimaux | [0-9] |
[:graph:] | caractères visibles | [\x21-\x7E] |
[:lower:] | lettres minuscules | [a-z] |
[:print:] | caractères visibles et espace | [\x20-\x7E] |
[:punct:] | ponctuations | [!"#$%&'()*+,-./:;?@[\\\]_`{|}~] |
[:space:] | caractères blancs | [ \t\r\n\v\f] |
[:upper:] | lettres majuscules | [A-Z] |
[:xdigit:] | chiffres hexadécimaux | [A-Fa-f0-9] |
Quex :
ICU :
Equivalents PCRE :
UChar32 c; UErrorCode status; status = U_ZERO_ERROR; c = u_charFromName(U_UNICODE_CHAR_NAME, "LATIN SMALL LETTER N WITH CEDILLA", &status); // U_EXTENDED_CHAR_NAME, U_CHAR_NAME_ALIAS if (U_FAILURE(status)) { // ... } u_fprintf(ustdout, "%04X %C\n", c, c);
TODO (yylex, yyless, yymore, etc)
TODO