My current understanding is that it's possible to extract entities from a text document using toolkits such as OpenNLP, Stanford NLP.
However, is there a way to find relationships between these entities?
For example consider the following text :
"As some of you may know, I spent last week at CERN, the European high-energy physics laboratory where the famous Higgs boson was discovered last July. Every time I go to CERN I feel a deep sense of reverence. Apart from quick visits over the years, I was there for three months in the late 1990s as a visiting scientist, doing work on early Universe physics, trying to figure out how to connect the Universe we see today with what may have happened in its infancy."
Entities: I (author), CERN, Higgs boson
Relationships : - I "visited" CERN - CERN "discovered" Higgs boson
Thanks.
entity linking
,anaphora resolution
– Cripplingentity linking
you have to check outName Entity Recognition
nltk.googlecode.com/svn/trunk/doc/book/ch07.html to know that something is an entity. Then linking them would allow you to know which entity are the same. Then you need to find something else that does relationship linking. Possiblyslot-filling
. – CripplingNER
thenentity linking
and thenslot-filling
. Actually the whole set of task could be a fullknowledge base population
task. =) Google more, i'm not an expert in this but i did some work previously. – Crippling