how to automatically detect acronym meaning / extension

1 Answers

Reading your question and the comments I understand that you want to create a mapping from an acronym to its extension.

Assuming you have a collection of textual documents where both the acronym and its expansion occur you can apply an algorithm to extract (acronym,extension) pairs.

A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text by A.S Schwartz and M.A. Hearst, does exactly this by looking at patterns. The Java implementation is available here.

I applied this algorithm to the English Wikipedia, you can see the results here. I also applied it to a collection of Portuguese new articles, results are here.

169

answered Oct 09 '22 11:10

David Batista

Related questions
                            
                                Understanding Word2Vec's Skip-Gram Structure and Output
                            
                                Result Difference in Stanford NER tagger NLTK (python) vs JAVA
                            
                                Intuition behind tf-idf for term extraction
                            
                                Extract grocery list out of free text
                            
                                What exactly are WordNet lexicographer files? Understanding how WordNet works
                            
                                Fuzzy matching a word inside a pyspark dataframe string
                            
                                ValueError: operands could not be broadcast together with shapes in Naive bayes classifier
                            
                                How to recognize entities in text that is the output of optical character recognition (OCR)?
                            
                                What are the inputs to the transformer encoder and decoder in BERT?
                            
                                Can't find model 'en_core_web_md'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory
                            
                                Document Layout Analysis for text extraction
                            
                                Extracting nouns from Noun Phase in NLP
                            
                                How can I tweak Levenshtein distance in classifying linguistically similar words (e.g. verb tenses, adjective comparisons, singular and plural)
                            
                                C++ Sentiment Analysis Library [closed]
                            
                                Intelligent spell checking
                            
                                Interesting NLP/machine-learning style project -- analyzing privacy policies
                            
                                How google recognises 2 words without spaces?
                            
                                Counting with scipy.sparse
                            
                                How do I use the book functions (e.g. concoordance) in NLTK?
                            
                                What does the dependency-parse output of TurboParser mean?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to automatically detect acronym meaning / extension

Tags:

nlp

acronym

information-extraction

Thorsten Niehues

People also ask

1 Answers

David Batista

Recent Activity

Donate For Us