SDW Software

SDW Software - Home

FileMove Standard Edition 2.5

overview

 

Find files and folders

The power of FileMove SE are the search options. Beside the definition of file properties the use of pattern matching for file and folder names is supported in two modes. One of this mode is called regular expressions (RegEx) the other one simple search. The pattern matching can be done as included and excluded. 

Simple SearchNach oben

The simple search mode is similar then the Windows search. Substitutions for characters are * and ?. They do the following:

*  Finds a character zero or more times at this place. 'Letter*Business' finds 'LetterMoneyBusiness' but 'Letter1Business' or  'LetterBusiness' also.

?  Searches only for a character one or zero time in the filename. 'Letter?Business' finds 'Letter1Business' or 'LetterBusiness'  but not 'LetterForBusiness'. The substitutions can be used more then once.

In combination with the file extensions a single file can be monitored that way. Allowed signs are the same as Windows allows to create file names. This signs can not be used for a file name:
\ : / " < > |
A filter checks for wrong input and shows a message in the edit window.

In RegEx terms, simple search would find the following for the placeholder *

Search pattern in the folder name field:   Karl*Maij
RegEx:   ^Karl(.?)*Maij$
FileMove SE searches the entire path for:   \Karl(.?)*Maij\

Search pattern in the file name field:   Karl*Maij
RegEx:   ^Karl(.?)*Maij$
FileMove SE searches the file name for:   \Karl(.?)*Maij\

In RegEx terms, simple search would find the following for the placeholder ?

Search pattern in the folder name field:   Karl?Maij
RegEx:   ^Karl(.?)Maij$
FileMove SE searches the entire path for:   \Karl(.?)Maij\

Search pattern in the file name:   Karl?Maij
RegEx:   ^Karl(.?)Maij$
FileMove SE searches the file name for:   ^Karl(.?)Maij$

This looks more dangerous as it really is. Use the preview mode to test your patterns.

Regular expressions Nach oben

Regular expressions are probably more for advanced users. Information about this technology is easy available in the web. In FileMove SE the RegEx on files is applied to the file name and the RegEx on folders is applied to the whole path. The characters ^and $ will be replaced internally to a \ (backslash).
For example if your pattern is Maij$ then the internal pattern would be Maij\ or if you enter ^Karl then \Karl would be the used pattern for the match. An expression like (^Karl|^John) would be replaced to (\Karl|\John) and an expression like (Maij$|Sinclair$) would be replaced to (Maij\|Sinclair\).

In the editor window the option 'include' does the search if the RegEx matches and the option 'exclude' does the search if the RegEx does not match.

Use the preview mode to check your search patterns!Nach oben

The following list shows the use of regular expressions for the file and folder search. The different combinations of this examples makes this a powerful tool.

Character Description
\ Marks the next character as either a special character, a literal, a backreference, or an octal escape. For example, 'n' matches the character "n". '\n' matches a newline character. The sequence '\\' matches "\" and "\(" matches "(".
^ Described as above
$ Described as above
* Matches the preceding character or subexpression zero or more times. For example, zo* matches "z" and "zoo". * is equivalent to {0,}.
+ Matches the preceding character or subexpression one or more times. For example, 'zo+' matches "zo" and "zoo", but not "z". + is equivalent to {1,}.
? Matches the preceding character or subexpression zero or one time. For example, "do(es)?" matches the "do" in "do" or "does". ? is equivalent to {0,1}
{n} n is a nonnegative integer. Matches exactly n times. For example, 'o{2}' does not match the 'o' in "Bob," but matches the two o's in "food".
{n,} n is a nonnegative integer. Matches at least n times. For example, 'o{2,}' does not match the "o" in "Bob" and matches all the o's in "foooood". 'o{1,}' is equivalent to 'o+'. 'o{0,}' is equivalent to 'o*'.
{n,m} m and n are nonnegative integers, where n <= m. Matches at least n and at most m times. For example, "o{1,3}" matches the first three o's in "fooooood". 'o{0,1}' is equivalent to 'o?'. Note that you cannot put a space between the comma and the numbers.
? When this character immediately follows any of the other quantifiers (*, +, ?, {n}, {n,}, {n,m}), the matching pattern is non-greedy. A non-greedy pattern matches as little of the searched string as possible, whereas the default greedy pattern matches as much of the searched string as possible. For example, in the string "oooo", 'o+?' matches a single "o", while 'o+' matches all 'o's.
. Matches any single character except "\n". To match any character including the '\n', use a pattern such as '[\s\S]'.
(?:pattern) Matches pattern but does not capture the match, that is, it is a non-capturing match that is not stored for possible later use. This is useful for combining parts of a pattern with the "or" character (|). For example, 'industr(?:y|ies) is a more economical expression than 'industry|industries'.
(?=pattern) Positive lookahead matches the search string at any point where a string matching pattern begins. This is a non-capturing match, that is, the match is not captured for possible later use. For example 'Windows (?=95|98|NT|2000)' matches "Windows" in "Windows 2000" but not "Windows" in "Windows 3.1". Lookaheads do not consume characters, that is, after a match occurs, the search for the next match begins immediately following the last match, not after the characters that comprised the lookahead.
(?!pattern) Negative lookahead matches the search string at any point where a string not matching pattern begins. This is a non-capturing match, that is, the match is not captured for possible later use. For example 'Windows (?!95|98|NT|2000)' matches "Windows" in "Windows 3.1" but does not match "Windows" in "Windows 2000". Lookaheads do not consume characters, that is, after a match occurs, the search for the next match begins immediately following the last match, not after the characters that comprised the lookahead.
x|y Matches either x or y. For example, 'z|food' matches "z" or "food". '(z|f)ood' matches "zood" or "food".
[xyz] A character set. Matches any one of the enclosed characters. For example, '[abc]' matches the 'a' in "plain".
[^xyz] A negative character set. Matches any character not enclosed. For example, '[^abc]' matches the 'p' in "plain".
[a-z] A range of characters. Matches any character in the specified range. For example, '[a-z]' matches any lowercase alphabetic character in the range 'a' through 'z'.
[^a-z] A negative range characters. Matches any character not in the specified range. For example, '[^a-z]' matches any character not in the range 'a' through 'z'.
\b Matches a word boundary, that is, the position between a word and a space. For example, 'er\b' matches the 'er' in "never" but not the 'er' in "verb".
\B Matches a nonword boundary. 'er\B' matches the 'er' in "verb" but not the 'er' in "never".
\cx Matches the control character indicated by x. For example, \cM matches a Control-M or carriage return character. The value of x must be in the range of A-Z or a-z. If not, c is assumed to be a literal 'c' character.
\d Matches a digit character. Equivalent to [0-9].
\D Matches a nondigit character. Equivalent to [^0-9].
\f Matches a form-feed character. Equivalent to \x0c and \cL.
\n Matches a newline character. Equivalent to \x0a and \cJ.
\r Matches a carriage return character. Equivalent to \x0d and \cM.
\s Matches any whitespace character including space, tab, form-feed, etc. Equivalent to [ \f\n\r\t\v].
\S Matches any non-white space character. Equivalent to [^ \f\n\r\t\v].
\t Matches a tab character. Equivalent to \x09 and \cI.
\v Matches a vertical tab character. Equivalent to \x0b and \cK.
\w Matches any word character including underscore. Equivalent to '[A-Za-z0-9_]'.
\W Matches any nonword character. Equivalent to '[^A-Za-z0-9_]'.
\xn Matches n, where n is a hexadecimal escape value. Hexadecimal escape values must be exactly two digits long. For example, '\x41' matches "A". '\x041' is equivalent to '\x04' & "1". Allows ASCII codes to be used in regular expressions.
\num Matches num, where num is a positive integer. A reference back to captured matches. For example, '(.)\1' matches two consecutive identical characters.
\n Identifies either an octal escape value or a backreference. If \n is preceded by at least n captured subexpressions, n is a backreference. Otherwise, n is an octal escape value if n is an octal digit (0-7).
\nm Identifies either an octal escape value or a backreference. If \nm is preceded by at least nm captured subexpressions, nm is a backreference. If \nm is preceded by at least n captures, n is a backreference followed by literal m. If neither of the preceding conditions exists, \nm matches octal escape value nm when n and m are octal digits (0-7).
\nml Matches octal escape value nml when n is an octal digit (0-3) and m and l are octal digits (0-7).
\un Matches n, where n is a Unicode character expressed as four hexadecimal digits. For example, \u00A9 matches the copyright symbol (©).

Recommended book: Mastering Regular Expressions by Jeffrey E. F. Friedl

SDW Software • Internet and Desktop Applications
Gattikonerstrasse 127 • 8136 Gattikon • Switzerland
kem@sitedesign.chwww.sitedesign.ch