Retrieve definition for parenthesized abbreviation, based on letter count

Tags:

I need to retrieve the definition of an acronym based on the number of letters enclosed in parentheses. For the data I'm dealing with, the number of letters in parentheses corresponds to the number of words to retrieve. I know this isn't a reliable method for getting abbreviations, but in my case it will be. For example:

String = 'Although family health history (FHH) is commonly accepted as an important risk factor for common, chronic diseases, it is rarely considered by a nurse practitioner (NP).'

Desired output: family health history (FHH), nurse practitioner (NP)

I know how to extract parentheses from a string, but after that I am stuck. Any help is appreciated.

 import re

 a = 'Although family health history (FHH) is commonly accepted as an 
 important risk factor for common, chronic diseases, it is rarely considered 
 by a nurse practitioner (NP).'

 x2 = re.findall('(\(.*?\))', a)

 for x in x2:
    length = len(x)
    print(x, length)

520

asked Jun 02 '19 02:06

tenebris silentio

1 Answers

Use the regex match to find the position of the start of the match. Then use python string indexing to get the substring leading up to the start of the match. Split the substring by words, and get the last n words. Where n is the length of the abbreviation.

import re
s = 'Although family health history (FHH) is commonly accepted as an important risk factor for common, chronic diseases, it is rarely considered by a nurse practitioner (NP).'


for match in re.finditer(r"\((.*?)\)", s):
    start_index = match.start()
    abbr = match.group(1)
    size = len(abbr)
    words = s[:start_index].split()[-size:]
    definition = " ".join(words)

    print(abbr, definition)

This prints:

FHH family health history
NP nurse practitioner

answered Nov 07 '22 07:11

Keatinge

Related questions
                            
                                Change Windows 10 background in Python 3
                            
                                How do I implement a PyTorch Dataset for use with AWS SageMaker?
                            
                                Plotting a map using geopandas and matplotlib
                            
                                How to compose a list with conditional elements
                            
                                Python find duplicates which occur more than 3 times
                            
                                How to change marker size/scale in legend when marker is set to pixel
                            
                                Pandas dataframe groupby and sort
                            
                                What does ksize and k mean in cornerHarris?
                            
                                How to fix inconsistent return statement in python?
                            
                                Given a start color and a middle color, how to get the remaining colors? (Python)
                            
                                How to update a Postgres table column using a pandas data frame?
                            
                                Python list comprehension for if else statemets
                            
                                Pause Jupyter Notebook widgets, waiting for user input
                            
                                How to compile the resources.qrc file with pyrcc5
                            
                                Best way to combine a permutation of conditional statements
                            
                                How to get decision function in randomforest in sklearn
                            
                                Remove rows of a dataframe based on the row number
                            
                                Python Fuzzy matching strings in list performance
                            
                                Disabling `@tf.function` decorators for debugging?
                            
                                How exactly does inspect.signature work with classes?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Retrieve definition for parenthesized abbreviation, based on letter count

Tags:

python

regex

text

text-parsing

abbreviation

tenebris silentio

People also ask

1 Answers

Keatinge

Recent Activity

Donate For Us