Page Comparison

Table of Contents

minLevel	1
maxLevel	6
outline	false
type	flat
printable	false

There are many cases where we have a handful (10 - 2000) past examples of text data and we want to see if new text is close to these saved examples. Machine learning techniques like classification are not appropriate because we don’t have enough data to train an accurate model.

...

Example

Input
table = github_logs

corpus	label	domain
a b c, d e f, g, h, i, j	x	google
aa b, c, d ee, ff, gg, hh, i, jj	y	facebook
k, l, m, n, o, p, q	z	apple

LQL command

Code Block

buildModelFromCorpus(inputTable, "corpusModel", "corpus", ["label", "domain"])
// table = inputTable
// text to train model = corpus
// columns to keep so they will be added after match is found = label and domain
// minDF and minTF are default

Output

RESULT
'Successfully created model and stored into <> file'

Versions Compared

Old Version 1

New Version Current

Key

Example