Save dictionary¶
VGram builders allow save dictionary to file. It’s a good way to work with v-grams because dictionary is built for a long time.
Save dictionary by save method and load by static methods load[ClassName].
Format¶
Dictionary saved as json-formatted file:
{
"alphabet": [
{
"freq": 1188,
"text": "fromthe",
"vec": [
0, 1, 2, 3, 15, 8, 6
]
},
..
],
"coder": [0, 1, 2, 3, 18, 12, ..],
"size": 1000,
"min_prob": 3.7657904299967802e-06,
"fitted": true,
"freqs_computed": true,
}
Field text is not nessesary and provided only when you work with tests. It contains the text of v-gram in alphabet items.
In Int-versions the text field will not be in the file.
After v-grams construction, you can analyze the resulting dictionary.
alphabet is a list of v-gram objectsfreq is a frequency of v-gram occurrence in data.vec is a vector of language alphabet symbols for v-gram presentation.coder is a sequence of symbols as they occur in the data.size is a size of a dictionarymin_prob, fitted and freqs_computed are inner model information.fitted and freqs_computed provided only for (Int)VGram class.