Named entities as a feature in text categorization?

Tags:

With existing text categorization (supervised) techniques why don't we consider Named Entities (NE) in the text as a feature in training and testing? Do you think we can improve precision with using NEs as a feature?

477

asked Apr 09 '12 18:04

samsamara

1 Answers

It depends a lot on the domain you are working in. You have to define the features based on the domain. Say in a search engine you are working on learning to rank problem, generating a dynamic rank, the NE's wont give you any benefit here. It largerly depends on the domain that you are working and also the output categorization labels (supervised learning) defined.

Now say you are working on classifying documents pertaining to Soccer or Movie or Polictics and so on. In this case Named Entities can work. I will give you an example here, say you are using a Neural Network which categorizes documents into Soccer, Movie, Politics etc. Now say a document comes in "Lionel Messi was invited to attend the premier of "The Social Network", also present were the cast and crew including Jesse Eisenberg, Andrew Garfield and Justin Timberlake" Here the connection between named entities (input features) and movie (output defined) will be stronger and hence it will be classified as a document on Movie.

Another example, say our document is "Tom Cruise is portraying the character of Lionel Messi in the movie "The last soccer game". Here comes the benefit say your neural network has learnt that when an actor and footballer comes together in one document there is high probability of it being a movie. Again it depends on the data and training it may be other way round too (but that is what is learning all about; seeing the past data)

So my answer would be try it out, nobody is stopping you to have named entities as features. It might help for the domain that you are working in.

151

answered Nov 08 '22 02:11

Yavar

Related questions
                            
                                Is there some Unix util that will allow me to grep multiple files with little typing?
                            
                                How to set height of UITextView according to its content in objective c
                            
                                Vim - how to store and execute commonly used commands?
                            
                                How Can I get the text of the selected radio in the radio groups
                            
                                Android: Move Button Text to bottom
                            
                                Replace word in <p> in <div> using jquery
                            
                                Facebook Login button custom text reverting
                            
                                emacs string-insert-rectangle vector of numbers?
                            
                                Sublime Text 3 (and 2): newly installed dictionaries do not work
                            
                                How can I find duplicate lines in a text file and print them? [closed]
                            
                                edit text file using Python
                            
                                CSS making text align left and justify at same time
                            
                                Put Text on Image from database while editing image in Canvas
                            
                                How to make zsh word navigation/deletion work like Vim
                            
                                JavaFX - Center Text in TextFlow vertically
                            
                                Python: Clustering Search Engine Keywords
                            
                                rotating only the text (not the div) in a div element
                            
                                CSS Vertically Align Text in Header

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Named entities as a feature in text categorization?

Tags:

text

machine-learning

classification

named-entity-recognition

samsamara

People also ask

1 Answers

Yavar

Recent Activity

Donate For Us