Keyword/keyphrase extraction from text [closed]

1 Answers

It looks you need to narrow down more than just keywords/key phrases and find the subject and object per sentence. For subject/object recognition, I recommend the Stanford Parser or the Google Language API, where you send a string and get a dependency tree response.

You can test the Google API first to see if it works well with your corpus: https://cloud.google.com/natural-language/

The outcome here is a subject predicate object (SPO) triplet, where your predicate describes the relationship. You'll need to traverse the dependency graph and write a script to parse out the triplet.

Other Packages: I use NLTK, Spacy, and Textblob frequently. If the corpus is simple, generic, and straightforward, Spacy and Textblob work well OOTB. If the corpus is highly customized, domain-specific, messy (incorrect spelling or grammar), etc. I'll use NLTK and spend more time customizing my NLP text processing pipeline with scrubbing, lemmatizing, etc. You may want to add your own custom dictionary of technology related keywords and keyphrases so that your parser can catch these if you decide to go with one of these packages.

NLTK Tutorial: http://www.nltk.org/book/

Spacy Quickstart: https://spacy.io/usage/

Textblob Quickstart: http://textblob.readthedocs.io/en/dev/quickstart.html

answered Nov 15 '22 08:11

saucy wombat

Related questions
                            
                                How to save the encoded output in Keras
                            
                                tf.cond lowers the training speed
                            
                                How to convert Euclidean distance to range 0 and 1 like Cosine Similarity?
                            
                                Is it possible to get the objective function value during each training step?
                            
                                Binary Crossentropy to penalize all components of one-hot vector
                            
                                Is it possible to certify an AI-based solution for safety-critical systems? [closed]
                            
                                Least Squares method in practice
                            
                                Deep Learning an Imbalanced data set
                            
                                How to add a regression head after the fully connected layer in convolutional network using Tensorflow?
                            
                                Does CrossValidator in PySpark distribute the execution?
                            
                                Machine learning - normalizing features with no theoretical maximum value
                            
                                ValueError: X.shape[1] = 15 should be equal to 700, the number of features at training time
                            
                                NLP - Embeddings selection of `start` and `end` of sentence tokens
                            
                                Why does this neural network learn nothing?
                            
                                Training GAN on small dataset of images
                            
                                Keras - model.predict return classes and not probabilities
                            
                                Log Loss function in pyspark
                            
                                Using keras tokenizer for new words not in training set
                            
                                How to use K.get_session in Tensorflow 2.0 or how to migrate it?
                            
                                What is a weak learner?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keyword/keyphrase extraction from text [closed]

Tags:

machine-learning

text-extraction

nlp

text-mining

jnlp

Surbhi Singh

People also ask

1 Answers

saucy wombat

Recent Activity

Donate For Us