Chars2vec: character-based language model for handling real world texts with spelling errors and…
▻https://hackernoon.com/chars2vec-character-based-language-model-for-handling-real-world-texts-w
Chars2vec: character-based language model for handling real world texts with spelling errors and human slangThis paper describes our open source character-based language model chars2vec. This model was developed with Keras library (TensorFlow backend) and now is available for Python 2.7 and 3.0+.IntroductionCreating and using word embeddings is the mainstream approach for handling most of the #nlp tasks. Each word is matched with a numeric vector which is then used in some way if the word appears in text. Some simple models use one-hot word embeddings or initialise words with random vectors or with integer numbers. The drawback of such models is obvious – such word vectorisation methods do not represent any semantic connections between words.There are other language models, called (...)
#machine-learning #artificial-intelligence #deep-learning #neural-networks