How does Python provide a maintainable way to pass data structures around in a system?

Tags:

python

I am new to dynamic languages in general, and I have discovered that languages like Python prefer simple data structures, like dictionaries, for sending data between parts of a system (across functions, modules, etc).

In the C# world, when two parts of a system communicate, the developer defines a class (possibly one that implements an interface) that contains properties (like a Person class with a Name, Birth date, etc) where the sender in the system instantiates the class and assigns values to the properties. The receiver then accesses these properties. The class is called a DTO and it is "well- defined" and explicit. If I remove a property from the DTO's class, the compiler will instantly warn me of all parts of the code that use that DTO and are attempting to access what is now a non-existent property. I know exactly what has broken in my codebase.

In Python, functions that produce data (senders) create implicit DTOs by building up dictionaries and returning them. Coming from a compiled world, this scares me. I immediately think of the scenario of a large code base where a function producing a dictionary has the name of a key changed (or a key is removed altogether) and boom- tons of potential KeyErrors begin to crop up as pieces of the code base that work with that dictionary and expect a key are no longer able to access the data they were expecting. Without unit testing, the developer would have no reliable way of knowing where these errors will appear.

Maybe I misunderstand altogether. Are dictionaries a best practice tool for passing data around? If so, how do developers solve this kind of problem? How are implicit data structures and the functions that use them maintained? How do I become less afraid of what seems like a huge amount of uncertainty?

762

asked Mar 13 '14 06:03

Sam Atcheson

1 Answers

Coming from a compiled world, this scares me. I immediately think of the scenario of a large code base where a function producing a dictionary has the name of a key changed (or a key is removed altogether) and boom- tons of potential KeyErrors begin to crop up as pieces of the code base that work with that dictionary and expect a key are no longer able to access the data they were expecting.

I would just like to highlight this part of your question, because I feel this is the main point you are trying to understand.

Python's development philosophy is a bit different; as objects can mutate without throwing errors (for example, you can add properties to instances without having them declared in the class) a common programming practice in Python is EAFP:

EAFP

Easier to ask for forgiveness than permission. This common Python coding style assumes the existence of valid keys or attributes and catches exceptions if the assumption proves false. This clean and fast style is characterized by the presence of many try and except statements. The technique contrasts with the LBYL style common to many other languages such as C.

The LBYL referred to from the quote above is "Look Before You Leap":

LBYL

Look before you leap. This coding style explicitly tests for pre-conditions before making calls or lookups. This style contrasts with the EAFP approach and is characterized by the presence of many if statements.

In a multi-threaded environment, the LBYL approach can risk introducing a race condition between “the looking” and “the leaping”. For example, the code, if key in mapping: return mapping[key] can fail if another thread removes key from mapping after the test, but before the lookup. This issue can be solved with locks or by using the EAFP approach.

So I would say this is a bit of the norm and in Python you expect the objects will behave well and handle themselves with grace (mainly by throwing up lots of exceptions). Traditional "object hiding" and "interface contracts" are not what Python is all about. It is just like learning anything else, you have to acclimate to the programming environment and its rules.

The other part of your question:

Are dictionaries a best practice tool for passing data around? If so, how do developers solve this kind of problem?

The answer here is depends on your problem domain. If your problem domain does not lend itself to custom objects, then you can pass around any kind of container (lists, tuples, dictionaries) around. If however all you have to pass around decorated data ("rich" data) is objects, then your code becomes littered with classes that don't define behavior but rather properties of things.

Oh, by the way - this getting of keys and raising KeyError problem is already solved, as Python dictionaries have a get method, which can return a default value (it returns the sentinel None object by default) when a key doesn't exist:

>>> d = {'a': 'b'}
>>> d['b']
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
KeyError: 'b'
>>> d.get('b')  # None is returned, which is the only
                # object that is not printed by the
                # Python interactive interpreter.
>>> d.get('b','default')
'default'

155

answered Oct 13 '22 00:10

Burhan Khalid

Related questions
                            
                                How to download datasets for sklearn? - python
                            
                                Can I use the same cursor while looping through it?
                            
                                Python function with a lot of arguments
                            
                                Python: Force decorator inheritance?
                            
                                Why doesn't my current directory show up in the path using pytest on Windows?
                            
                                Scrape multiple urls using QWebPage
                            
                                Read files sequentially in order [duplicate]
                            
                                python dateutil unicode warning
                            
                                How to tell the amount of times a pypi package is downloaded by actual users?
                            
                                Copy file into a specific parent folder with Google Drive API?
                            
                                Extract subarray between certain value in Python
                            
                                Behave: How to import steps from another file?
                            
                                Run code in python script on shutdown signal
                            
                                Updating a pyplot 3d scatter plot in a loop, grid lines overlap points
                            
                                How to find and replace multiple lines in text file?
                            
                                Regex match if not before and after
                            
                                Does assertRaises (or assert_raises) exist in nose2
                            
                                Python mockito - Mocking a class which is being instantiated from the testable function
                            
                                Weird pdfs from Generalised Extreme Value (GEV) Maximum Likelihood fitted data
                            
                                how to run local python script on remote machine

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With