split string and make key value pair

Tags:

I have a following string in python:

Date: 07/14/1995 Time: 11:31:50 Subject text: Something-cool

I want to prepare a dict() from it with following key: [value]

{"Date":["07/13/1995"], "Time": ["11:31:50"], "Subject text":["Something-cool"]}

If I split the string with : I get the following. How can I get the above desired result?

>>> text.split(": ")
['Date', '07/14/1995 Time', '11:31:50 Subject text', 'Something-cool']

973

asked May 27 '18 03:05

Anthony

1 Answers

Let's use re.findall here:

>>> import re
>>> dict(re.findall(r'(?=\S|^)(.+?): (\S+)', text))
{'Date': '07/14/1995', 'Subject text': 'Something-cool', 'Time': '11:31:50'}

Or, if you insist on the format,

>>> {k : [v] for k, v in re.findall(r'(?=\S|^)(.+?): (\S+)', text)}
{
   'Date'        : ['07/14/1995'],
   'Subject text': ['Something-cool'],
   'Time'        : ['11:31:50']
}

Details

(?=   # lookahead 
\S    # anything that isn't a space
|     # OR
^     # start of line
) 
(.+?) # 1st capture group - 1 or more characters, until...
:     # ...a colon
\s    # space
(\S+) # 2nd capture group - one or more characters that are not wsp

Semantically, this regex means "get me all pairs of items that follow this particular pattern of something followed by a colon and whitespace and a bunch of characters that are not whitespace". The lookahead at the start is so that the groups are not captured with a leading whitespace (and lookbehinds support only fixed-width assertions, so).

Note: This will fail if your values have spaces in them.

If you're doing this for multiple lines in a text file, let's build on this regex and use a defaultdict:

from collections import defaultdict
d = defaultdict(list)

with open(file) as f:
    for text in file:
        for k, v in re.findall(r'(?=\S|^)(.+?): (\S+)', text.rstrip()):
            d[k].append(v)

This will add one or more values to your dictionary for a given key.

answered Sep 29 '22 03:09

cs95

Related questions
                            
                                Create all possible combinations of multiple columns in a Pandas DataFrame
                            
                                Apply mask to image with OpenCv Python
                            
                                Load a single image in a pretrained pytorch net
                            
                                (discord.py) Getting a list of all of the members in a specific voice channel
                            
                                ignoring ensurepip failure pip requires ssl/tls error in Ubuntu 18.04
                            
                                python3.6 fails when creating venv
                            
                                ImportError: No module named 'gdbm' occuring while using source ~/.bashrc
                            
                                Changing value in data frame column in a loop python
                            
                                python3 replacing double backslash with single backslash [duplicate]
                            
                                How to Run Anaconda pompt in Ubuntu
                            
                                dictionary to multi-index pandas dataframe
                            
                                Explode column of lists into multiple columns
                            
                                Error: TensorFlow: tf.enable_eager_execution must be called at program startup
                            
                                Running periodic task at time stored in database
                            
                                Python storing Japanese word into JSON file
                            
                                Pandas rounding decimals not working
                            
                                Django error message: ["'on' value must be either True or False."]
                            
                                Yield from Async Generator in Python AsyncIO
                            
                                How to convert a pyw file to exe?
                            
                                Pandas read excel sheet with multiple header when first column is empty

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

split string and make key value pair

Tags:

python

string

dictionary

split

Anthony

People also ask

1 Answers

cs95

Recent Activity

Donate For Us