Rules for Phrase Selection

The selection of which phrases appear in a glossary depends on the value of the Minimum Word Frequency given in the Compilation Rules in two ways:

1.

First, it ensures that any word retained in a glossary phrase has at least the minimum word frequency. For example, if you select a value of 8 then every word of a glossary phrase will have at least a frequency of 8.

2.

The Minimum Word Frequency is also used as a Minimum phrase frequency:

  • All phrases that have at least this frequency are retained.

  • In addition, phrases whose frequency is at least half the Minimum word frequency are retained unless their average number of letters per word is less than 5.

Note on frequencies:

Assume that the phrase without further notice appears 6 times. With a Minimum word frequency of 5, the phrase will be retained.

If we set a Minimum word frequency of 8, the phrase will still be retained because its frequency of 6 is higher than the half value (4) and since the average number of letters per word (6.7) is greater than 5. On the other hand, the phrase will not be retained if we set a Minimum word frequency of 15.