Word Sense Disambiguation (WSD)Series-1

3 min readSep 25, 2020

Natural Language Problems & Solutions …

There are words in Natural languages which have different meaning for different context but they are spelled same. Those words are called polysemous words. Word sense disambiguation (WSD) is the solution to the problem.

Word Sense Disambiguation is a task of finding the correct sense of the words and automatically assigning its correct sense to the words which are polysemous in a particular context.

Word Sense Disambiguation (WSD) is an important but challenging technique in the area of natural language processing (NLP).

Human beings are blessed with the learning power. They can easily find out what is the correct meaning of a word in a context. But for computer it is a difficult task. So, we need to develop an automatic system which can perform like humans do i.e. the system which can find out the correct meaning of the word in particular context and automatically assign the optimal sense to the target word.

conceptual Model for word sense disambiguation

Common approaches to solve WSD

Now we will see various approaches used for WSD and classifies existing WSD algorithms according to their techniques.

Machine learning techniques

A classifier is used to learn features and assign senses to unseen examples

In this approach initial input is the word to be disambiguated called target word and the text in which is embedded called as context.

In this approach features are themselves served by the words

Supervised Techniques

It uses ML techniques for inducing a classifier from manually sense-annotated data sets. Usually, the classifier is concerned with a single word and performs a classification task in order to assign the appropriate sense to each instance of that word

Examples of supervised techniques are decision lists, Decision Tree, Naïve Bayes , Nural Networks

Un-Supervised Techniques

It don’t need hand labeled knowledge of sense information in large scale resources for the disambiguation

Examples are context clustering and co-occurrence graphs

semi-supervised techniques

Here, the information is present like in supervised but might be less information is given.Here only critic information is available,not the exact information.

For example, School-Uniform, School-shoes are the words with semantic relationship. In this approach improper word senses are omitted and only those senses are selected which have harmony with common sense rules. The basic idea behind this approach is to count how many times this kind of word pair occurs in the corpus with syntactic relation. From this count, senses of words will be identified. There are other methods, which can find this kind of relation among words using conditional probability.

Dictionary Based approaches

Knowledge based approach based on knowledge resources of machine readable dictionaries in form of corpus, WorldNet etc

Overlap Based Approaches

This approach calls for the requirement of machine readable dictionary (MDR). It includes determination of the different fea-tures of the senses of words which are ambiguous along with features of the words in the context.

Conclusion

Different approaches available in word sense disambiguation with primarily focusing on the Machine Learning Approaches and Dictionary based approaches,knowledge based. with different experiments I observed supervised approach is found to perform better but one of its dis-advantage is the requirement of a large corpora with-out which training is impossible which can be over-came in unsupervised approach as it does not rely on any such large scale resource for the disambiguation.

Knowledge based approach on the other hand makes use of knowledge sources to decide upon the senses of words in a particular context provided machine readable knowledge base is available to apply.

will meet with next blog with solution…

Thanks for the reading and Besides, leave a few claps if you found this text helpful!!!

Word Sense Disambiguation (WSD)Series-1

Common approaches to solve WSD

Written by Asha Ganesh

No responses yet