Skip options

Top  Previous  Next

Page and folder skip list

This is a list of pages and folders that will not be scanned during the indexing process. Note that filenames and paths are case sensitive. Typically you would want to filter pages that the user should never be able to get to directly via the search function. Note that if the path to a page partially or fully matches any entry in this list it will be filtered. For example, an entry of “\private\” will filter “\private\file1.htm”, “\private\file2.htm” and “photos\private\athome.htm”.

lightbulb

Tip: You can also skip pages based on whether certain keywords are found or not found within the page content. See "Content filtering" for more information.

As of V6, you can now also use the asterisk wildcard (*) to match a sequence of characters. For example, an entry of "\private\*\index.html" will skip all files named "index.html" in a sub-folder within "\private\".

Skip files or directories that begin with an underscore

When this is enabled, files or directories beginning with an underscore (such as those kept by Dreamweaver and FrontPage) will be ignored and skipped during Offline Mode indexing.

Word skip list

This is a list of words that will be filtered during the indexing process. Typically you would want to filter some small words that appear on every page such as, “and”, “or”, “the”, etc. Depending on your site you may also want to skip other words. For example, if your site had a thousand pages about different aircrafts, you might want to remove the word “aircraft” from the index, as it would match every page on your site.

lightbulb

Tip: If a skip word begins with a ‘*’ character then it will match and filter any word that contains this keyword. For example, a skip word entry of “*fun” would filter out the following words: “fun”, “funny” and “nofun”. Note that the ‘*’ character must be at the beginning of the word for this to take effect.

Skip words less than x characters

You can now specify the minimum number of characters that a word must be before it is indexed. It is defaulted to two characters so all single-character words are skipped.

Skipping sections of a page from being indexed

To exclude sections of a page (such as headers, footers, and navigation menus), please see "Skipping sections of a page".