As part of transcribing recordings, Conversation Analyzer categorizes the textual contents of the transcript, by identifying key phrases based on the defined rules, and recording the subcategory or category those rules belong to. A category is a collection of subcategories, which in turn contain a series of rules. Each rule consists of a word or phrase and the party who said that word or phrase. If the transcript contains the word or phrase and was spoken by the specified party, Conversation Analyzer matches it against the category.
...
Valid Expression and Find field values contain only alphanumeric, apostrophe and space characters; that is, values can contain spaces (U+0020), apostrophes (U+0027), and characters from the following Unicode categories:
Unicode Category Name | Description |
---|---|
Ll | Letter, Lowercase. For example, a-z, ᵯ, ḅ, ṥ, ở, ﬓ |
Lu | Letter, Uppercase. For example, A-Z, Ý, Ŧ, Ǣ, Щ, 𝕐 |
Lt | Letter, Titlecase. For example, Dž, ᾎ, ᾟ, ᾭ |
Lo | Letter, Other (e.g. ª, ܗ, 爨) The Mongolian Letter "Manchu Ali Gali Lha" (U+18AA,) is not allowed within expression and find values. This character is used internally within the categorisation engine. If the character appears within spoken text, Conversation Analyzer treats the character as an apostrophe. |
Lm | Letter, Modifier. For example, ʰ, ᵓ, 〲, ꟹ |
Mn | Mark, Nonspacing. For example, ុ, ᜴ |
Nd | Number, Decimal Digit. For example, 0-9, ۳, ૮, ๗ |
Pc | Punctuation, Connector. For example, _, ‿, ⁀, ⁔, ︳, ︴, ﹍, ﹎, ﹏, _ This category includes ten characters; the most commonly used is the LOW LINE character (_), u+005F. |
...
The ??
in the expression represents two characters must appear after "wh" and before "cat" in matching text.
Text | Does it match? | Why |
---|---|---|
what cat | Yes | The |
when cat | Yes | The ?? in the expression represents the "en" in the text. |
who cat | No | The ?? in the expression requires two characters after "wh" not one. |
which cat | No | The |
Example 6. Expression using the *
word wildcard
Expression: the cat sits * on the mat
The text must contain the phrase "the cat sits on the mat" with zero to many words between "sits" and "on".
Text | Does it match? | Why |
---|---|---|
the cat sits on the mat | Yes | The |
the cat sits happily on the mat | Yes | The |
the cat always sits on the mat | No | The |
Example 7. Expression using the Words between
field field
Expression: cat mat
Words between: 3
...
Text | Does it match? | Why |
---|---|---|
the cat eagerly sat on the mat | Yes | The text contains one word between "cat" and "sat", and two words between "sat" and "mat"; the expression allows up to three. |
the cat eagerly and promptly sat on the green mat | Yes | The text contains three words between "cat" and "sat", and three words between "sat" and "mat"; the expression allows up to three. |
the cat sat on the green and blue mat | No | The text contains too many words (five) between "sat" and "mat". |
Example 9. Expression using the Words between
field and the * word wildcard
Expression: cat * sat mat
...
between |
...
Even when used with a Words between value in an expression, a *
word wildcard can represent any number of words. In this example, matching text can contain any number of words between "cat" and "sat", but a maximum of two words between "sat" and "mat".
Text | Does it match? | Why |
---|---|---|
the cat sat on the mat | Yes | The text contains no words between "cat" and "sat", and two words between "sat" and "mat". |
the cat waited calmly whilst the mouse ran around and then sat on the mat | Yes | The text contains nine words between "cat" and "sat", and two words between "sat" and "mat". |
the cat always sat on the green mat | No | The text contains too many words (three) between"sat" and "mat". |
Panel | ||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ||||||||||||||
|
...