32 Grenville Street M4Y 1A3 CGreaves@ChrisGreaves.com
The Interesting Words Engine
The Interesting Words engine is at the core of the Indxr tool.
Just as you could unbolt a car engine and mount that engine in a boat, or on a grain-auger, so too the Interesting Words engine can be used to power many different applications.
The Interesting Words engine contains the program code necessary to detect Interesting Words in any chunk of text, (or "range" as VBA programmers are wont to say!).
You can extract the Interesting Words from the first five paragraphs of your web page and use those Interesting Words as keywords in your HTML web page.
You can locate the sentence that scores the highest number of UNIQUE Interesting Words and use that sentence as the Title in your HTML web page.
See also Case Studies for more example.
The Interesting Words engine is Rules-Based , and so it operates according to the specifications laid down in a simple table in a Word document – a set of rules that you are encouraged to tweak to optimize your application.
To get a concept of how it might wok, inspect the index in the back of any text book and ask yourself "What makes THESE words interesting?".
In many cases they will start with a capital letter.
They will be longer than 3 characters.
They won't contain digits of weird punctuation symbols, although they may contain a hyphen.
And so on.
A simple set of rules can suffice to build an algorithm that is pretty good at filtering out UN-Interesting Words, and by definition you will be left with only Interesting Words!
Rules-based applications are computer programs – applications – that run according to a set of rules; specifically, rules that can be changed by you – the end-user.
Indexer is such an application; you can change the contents of the Rules Table and thus change the way that Indxr detects Interesting Words.
I like rules-based applications because they are versatile; I can develop a new process for a client without writing a new program or customizing an existing program.
My clients can operate independently from me, which is good for their peace of mind.
Some of my other publicly-available rules-based applications include Précis, Weather-vain, and the Document Cleaners.
I have Excel tools, such as automatic formatting and automatic style generation tools that are rules-driven.
My clients enjoy document generators that are rules-based and, in most cases, table-driven.
Toronto and Mississauga, Thursday, February 28, 2013 6:02 PM
Copyright © 1996-2013 Chris Greaves. All Rights Reserved.