L2-normalization (scaling to unit euclidean length): the norm of each vector in the vector space will be normalized to 1. It is necessary for any linear operation of word vectors.
R code:
Vector:
vec / sqrt(sum(vec^2))
Matrix:
mat / sqrt(rowSums(mat^2))
Arguments
- x
A
wordvec
(data.table) orembed
(matrix), seedata_wordvec_load
.
Download
Download pre-trained word vectors data (.RData
):
https://psychbruce.github.io/WordVector_RData.pdf