I have following two python lists. <pre class="prettyprint"><code>prob_tokens = ['119', '120', '123', '1234', '12345'] complete_tokens = ['112', '120', '121', '123', '1233', '1234', '1235', '12345'] min_len_sec_list = 3 max_len_sec_list = 5 </code></pre> I want to create a dictionary with elements from first list as keys and with following constraints : <ol> <li>if key does not exists in second list then the value will be <code>False</code>.</li> <li>if key exists in second list with variants then the value will be <code>False</code>.</li> </ol> Eg: (i) while checking <code>123</code>, if <code>1234</code>, <code>12345</code> exists (<code>123*</code>) in second list then value of <code>123</code> will be <code>False</code>. (ii). Similarly while checking <code>1234</code>, if <code>12345</code> exists (<code>1234*</code>) then value will be <code>False</code>. Here <code>*</code> will be <code>[0-9]{(max_len-len_token)}</code> <ol start="3"> <li>if key exists in second list with no variants then value will be <code>True</code>.</li> </ol> OUTPUT : <code>final_token_dict</code> <pre class="prettyprint"><code>{'119': False,'120': True, '123': False, '1234': False, '12345': True} </code></pre> Can I get any suggestions on how to achieve this? Thanks in advance!!!

You can use a custom function with a dictionary comprehension: <pre class="prettyprint"><code>prob_tokens = ['119', '120', '123', '1234', '12345'] complete_tokens = ['112', '120', '121', '123', '1233', '1234', '1235', '12345'] def mapper(val, ref_list): if any(x.startswith(val) and (len(x) > len(val)) for x in ref_list): return False if val in ref_list: return True return False res = {i: mapper(i, complete_tokens) for i in prob_tokens} print(res) {'119': False, '120': True, '123': False, '1234': False, '12345': True} </code></pre> If the number of characters criterion is important to you, you can adjust your logic accordingly using chained comparisons and an additional input: <pre class="prettyprint"><code>def mapper(val, ref_list, max_len): if any(x.startswith(val) and (0 < (len(x) - len(val)) <= max_len) for x in ref_list): return False if val in ref_list: return True return False min_len_sec_list = 3 max_len_sec_list = 5 add_lens = max_len_sec_list - min_len_sec_list res = {i: mapper(i, complete_tokens, add_lens) for i in prob_tokens} </code></pre>

You can use <code>any</code>: <pre class="prettyprint"><code>a = ['119', '120', '123', '1234', '12345'] b = ['112', '120', '121', '123', '1233', '1234', '1235', '12345'] new_d = {c:c in b and not any(i.startswith(c) and len(c) < len(i) for i in b) for c in a} </code></pre> Output: <pre class="prettyprint"><code>{'120': True, '1234': False, '119': False, '123': False, '12345': True} </code></pre>

Creating custom dictionary from two lists

I have following two python lists.

prob_tokens = ['119', '120', '123', '1234', '12345']

complete_tokens = ['112', '120', '121', '123', '1233', '1234', '1235', '12345']

min_len_sec_list = 3
max_len_sec_list = 5

I want to create a dictionary with elements from first list as keys and with following constraints :

if key does not exists in second list then the value will be False.
if key exists in second list with variants then the value will be False.

Eg:

(i) while checking 123, if 1234, 12345 exists (123*) in second list then value of 123 will be False.

(ii). Similarly while checking 1234, if 12345 exists (1234*) then value will be False.

Here * will be [0-9]{(max_len-len_token)}

if key exists in second list with no variants then value will be True.

OUTPUT :

final_token_dict

{'119': False,'120': True, '123': False, '1234': False, '12345': True}

Can I get any suggestions on how to achieve this? Thanks in advance!!!

How do you make a list into a dictionary?

To convert a list to dictionary, we can use list comprehension and make a key:value pair of consecutive elements. Finally, typecase the list to dict type.

How do you make a nested dictionary in Python?

In Python, a Nested dictionary can be created by placing the comma-separated dictionaries enclosed within braces.

How do you combine lists in Python?

One simple and popular way to merge(join) two lists in Python is using the in-built append() method of python. The append() method in python adds a single item to the existing list. It doesn't return a new list of items. Instead, it modifies the original list by adding the item to the end of the list.

You can use a custom function with a dictionary comprehension:

prob_tokens = ['119', '120', '123', '1234', '12345']
complete_tokens = ['112', '120', '121', '123', '1233', '1234', '1235', '12345']

def mapper(val, ref_list):
    if any(x.startswith(val) and (len(x) > len(val)) for x in ref_list):
        return False
    if val in ref_list:
        return True
    return False

res = {i: mapper(i, complete_tokens) for i in prob_tokens}

print(res)

{'119': False, '120': True, '123': False, '1234': False, '12345': True}

If the number of characters criterion is important to you, you can adjust your logic accordingly using chained comparisons and an additional input:

def mapper(val, ref_list, max_len):
    if any(x.startswith(val) and (0 < (len(x) - len(val)) <= max_len) for x in ref_list):
        return False
    if val in ref_list:
        return True
    return False

min_len_sec_list = 3
max_len_sec_list = 5

add_lens = max_len_sec_list - min_len_sec_list

res = {i: mapper(i, complete_tokens, add_lens) for i in prob_tokens}

You can convert your list into a Trie, or Prefix Tree, structure, then check whether any of the keys is a prefix in that Trie. This will be faster than checking whether its a prefix of each element in the list individually. More specifically, if you have k elements in your prob_tokens list, and n elements in complete_tokens, then this will make only O(n+k), whereas checking each pair is O(n*k).¹

def make_trie(lst):
    trie = {}
    for key in lst:
        t = trie
        for c in key:
            t = t.setdefault(c, {})
    return trie

def check_trie(trie, key):
    for c in key:
        trie = trie.get(c, None)
        if trie is None: return False # not in trie
        if trie == {}: return True    # leaf in trie
    return False  # in trie, but not a leaf

prob_tokens = ['119', '120', '123', '1234', '12345']
complete_tokens = ['112', '120', '121', '123', '1233', '1234', '1235', '12345']

trie = make_trie(complete_tokens)
# {'1': {'1': {'2': {}}, '2': {'0': {}, '1': {}, '3': {'3': {}, '4': {'5': {}}, '5': {}}}}}
res = {key: check_trie(trie, key) for key in prob_tokens}
# {'119': False, '120': True, '123': False, '1234': False, '12345': True}

¹⁾ Actually, the average length of the keys also is a factor, but it is so in both approaches.

This might be another alternative

import re

prob_tokens = ['119', '120', '123', '1234', '12345']

complete_tokens = ['112', '120', '121', '123', '1233', '1234', '1235', '12345']

dictionary = dict()
for tok in prob_tokens:
    if tok not in complete_tokens or any([bool(re.compile(r'^%s\d+'%tok).search(tok2)) for tok2 in complete_tokens]):
        dictionary[tok] = False
    else:
        dictionary[tok] = True

print(dictionary)`

You can use any:

a = ['119', '120', '123', '1234', '12345']
b = ['112', '120', '121', '123', '1233', '1234', '1235', '12345']
new_d = {c:c in b and not any(i.startswith(c) and len(c) < len(i) for i in b) for c in a}

Output:

{'120': True, '1234': False, '119': False, '123': False, '12345': True}

Creating custom dictionary from two lists

Tags:

python

string

dictionary

Avinash Clinton

People also ask

4 Answers

jpp

tobias_k

thushv89

Ajax1234

Recent Activity

Donate For Us

Creating custom dictionary from two lists

Tags:

python

string

dictionary

Avinash Clinton

People also ask

4 Answers

jpp

tobias_k

thushv89

Ajax1234

Related questions

Recent Activity

Donate For Us