416-993-4953

32 Grenville Street M4Y 1A3 CGreaves@ChrisGreaves.com

Visit www.ChrisGreaves.com for this image!

Home

The Rules Table

INDXR

FAQ

Quick Start

Next Steps

Installation (And TroubleShooting)

Compatible Systems

Features

The Rules Table

Videos

Testimonials

Acknowledgements

Case Studies

User Guide

Companion Applications

What-FAQ

Timing

The Interesting Words Engine

Software Development Kit (SDK)

Macros

CURRENT BETA VERSION

Free Download of the 1-Click Indexer

Indexer contains an engine, a piece of programming code, that enables it to determine which words are Interesting, and which are not.

For your ultimate control, the engine is driven by a table of rules, and once you have run indexer for the first time, you will find an initial table of rules “IndxrRules01.doc” on your hard drive within one of your folders:-

C:\Documents and Settings\ChrisL\Application Data\Greaves\Indxr

C:\Users\ChrisC\AppData\Roaming\Greaves\Indxr

Rule

Active

Code

Negate

P1

P2

Comment

1

IN

4

24

Ignore long and short strings

3

StartsWith

ABCDEFGHIJKLMNOPQRSTUVWXYZ

Initial capital

6

InFile

Y

Noises1.txt

7

InFile

Y

Months1.txt

8

InFile

Y

Places1.txt

9

Contains

Y

0123456789

Part numbers

We will describe the rules in greater detail later on.

For the time being, inspect the table reproduced above, and understand that for now, a word will be declared “interesting” (and hence be a candidate for the Index, Keywords or Title, if it

is between 4 and 24 characters in length,

starts with an upper-case letter,

does NOT contain a digit, and

does NOT appear in any of the text files “Noises1.txt”, “Months1.txt” or “Places1.txt”.

Those are fairly elementary rules, but they will suffice for now.

You can maintain as many tables of rules as you want, and switch between them – a table of rules for medical documents, a table of rules for legal documents, and so on.

The Rules table is interpreted serially from top to bottom.

A string remains a candidate until it fails a rule. That is, the results of each row are "AND"ed.

Free Download of the 1-Click Indexer

Disable

A rule can be disabled during experimental runs by setting the ACTIVE cell to any character value.

Negate

A rule can be negated by setting the cell "negate" to the value "Y"

IN

If the LENGTH of the character string lies at or between the lower ("P1") and upper ("P2") limits, the string remains a candidate. In the default table, strings of 4, 5, 6, … 23 or 24 characters are considered as candidates as Interesting Words.

StartsWith

If the character string starts with any of the characters in the given string, the string remains a candidate.

EndsWith

If the character string ends with any of the characters in the given string, the string remains a candidate.

InFile

If the character string matched any of the strings in the named file, the string remains a candidate.

Squeeze

Squeezes all characters in the given string ("P1") from the character string.

Replace

Replaces all occurrences of the given string ("P1") in the character string with the string given as P2.

Contains

If the character string contains any of the characters in the given string, the string remains a candidate.

Only

If the character string consists only of the characters in the given string, the string remains a candidate.

Mask

If the character string matches the mask given by the given string, the string remains a candidate.

Syllable

If the count of syllables in the character string exceeds the value given in the first parameter ("P1") then the string remains a candidate.

The Default Table

This has been shown above.

If you are tweaking the table and make a disastrous mistake, just delete the document from the hard drive; Indxr will automatically create a new table on the next run.

Indxr will use the first table found in the specified document, so you can park spare copies of the rules table as the 2nd, 3rd etc tables in any rules table document.

If you would like to experiment with variations on the rules table, you can make a copy, adjust the rules, and then tell Indxr to use your copy.

Here's how you do it:

(1) Locate your documents and settings folder for Indxr. It will have a name like this "C:\Users\ChrisC\AppData\Roaming\Greaves\Indxr\", although your user name will appear in place of "ChrisL".

(2) Copy "IndxrRules01.doc" and paste it with a new name. For example "IndxrRules02.doc".

(3) A simple experiment is to place a character in the Active cell for the StartsWith code. In this way you can suspend that rule. (Re-instate it by removing the character from the cell).

Rule

Active

Code

Negate

P1

P2

Comment

1

IN

4

24

Ignore long and short strings

3

N

StartsWith

ABCDEFGHIJKLMNOPQRSTUVWXYZ

Initial capital

6

InFile

Y

Noises1.txt

7

InFile

Y

Months1.txt

8

InFile

Y

Places1.txt

9

Contains

Y

0123456789

Part numbers

(4) Save this updated file.

(5) Run the macro GUIEnvironment and select your new rules table from the Rules table drop-down list.

(6) Choose OK.

Your new rules table should take effect the next time you index a document.

Rule

Active

Code

Negate

P1

P2

Comment

1

IN

4

24

Ignore long and short strings

3

StartsWith

Y

ABCDEFGHIJKLMNOPQRSTUVWXYZ

Initial capital

6

InFile

Y

Noises1.txt

7

InFile

Y

Months1.txt

8

InFile

Y

Places1.txt

9

Contains

Y

0123456789

Part numbers

(1) Try a third rules table, "IndxrRules03.doc",

(2) Reinstate the StartsWith rule, but negate it by placing a character in the Negate cell.

Free Download of the 1-Click Indexer


Loading

416-621-9348 CGreaves@ChrisGreaves.com

Toronto and Mississauga, Thursday, February 28, 2013 6:03 PM

Copyright © 1996-2013 Chris Greaves. All Rights Reserved.