You do Arithmetics with Numbers, we do it with Text
Natural Language Processing is a subset of Artificial Intelligence. Our company has got a team of 10 symantic and search engineers working in this area. Our major focus in this area is on Text Analytics. Our knowledge in this area can help in other domains like document management systems, knowledge management systems, SMS processing system and CRM Analytics.
We have experience in parsers, POS taggers and Named Entity recognizers on different open source tools like Stanford NLP, OpenNLP, Chaniak, and Lingpipe. Wordnet has been an excellent tool and quite instrumental in implementing techniques like synonym clubbing. We have also implemented algorithms on Pronomial core reference resolution and word sense disambiguation as well as solved problems using HMM (Hidden Markov Model).
Our basic work has been focussed on analyzing CGM (Consumer Generated Media) like blogs, forums and reviews. Our work involves crawling websites, storing the content in the database, and then automatically clustering and categorizing the content (Weka Toolkit).
We have good knowledge of development frameworks like:-
A General Architecture for Text Engineering.
The Unstructured Information Management Architecture.