How to identify tags (key words) automatically from a given text?
Asked Answered
K

1

5

It should behave like Delicious toolbar for Firefox does; it lists possible tags to click. The effect is shown as below:

enter image description here

The code should be able to find key words for the text. Any good algorithm or open source project to recommend?

I found this post, but it is a bit too general for my specific need.

Katabatic answered 3/2, 2011 at 3:45 Comment(3)
OK, I think you need to be a little more specific than just "it lists possible tags to click"... delicious is a bookmarking service and the the question you referenced is for data-mining/text-mining. What are you trying to achieve?Jaala
edited. I wish the question to be clearer now..Katabatic
I thought that these tags are sampled from the tags that other people has already bookmarked on this URL.Levey
B
7

I think you're looking for one of these answers,

In a nutshell - you're looking to extract unigrams from the text that somehow represent the concepts within it - a technique to do this is called Pointwise Mutual Information, which is illustrated with an example in the first two links. Using the Python NLTK framework (which already has a bunch of these algorithms built in) might be your best starting point to work off from.

Good luck!

Broadbrim answered 24/6, 2011 at 2:10 Comment(2)
This is by far the most helpful answer for this question. Thanks.Katabatic
Thanks! In case you found the answer useful - you might want to accept it.Broadbrim

© 2022 - 2024 — McMap. All rights reserved.