I want to write this code as pythonic. My real array much bigger than this example. ( 5+10+20+3+2 ) / 5 <blockquote> print(np.mean(array,key=lambda x:x[1])) TypeError: mean() got an unexpected keyword argument 'key' </blockquote> <pre class="prettyprint"><code>array = [('a', 5) , ('b', 10), ('c', 20), ('d', 3), ('e', 2)] sum = 0 for i in range(len(array)): sum = sum + array[i][1] average = sum / len(array) print(average) import numpy as np print(np.mean(array,key=lambda x:x[1])) </code></pre> How can avoid this? I want to use second example. I'm using Python 3.7

If you are using Python 3.4 or above, you could use the <code>statistics</code> module: <pre class="prettyprint"><code>from statistics import mean average = mean(value[1] for value in array) </code></pre> Or if you're using a version of Python older than 3.4: <pre class="prettyprint"><code>average = sum(value[1] for value in array) / len(array) </code></pre> These solutions both use a nice feature of Python called a generator expression. The loop <pre class="prettyprint"><code>value[1] for value in array </code></pre> creates a new sequence in a timely and memory efficient manner. See PEP 289 -- Generator Expressions. If you're using Python 2, and you're summing integers, we will have integer division, which will truncate the result, e.g: <pre class="prettyprint"><code>>>> 25 / 4 6 >>> 25 / float(4) 6.25 </code></pre> To ensure we don't have integer division we could set the starting value of <code>sum</code> to be the <code>float</code> value <code>0.0</code>. However, this also means we have to make the generator expression explicit with parentheses, otherwise it's a syntax error, and it's less pretty, as noted in the comments: <pre class="prettyprint"><code>average = sum((value[1] for value in array), 0.0) / len(array) </code></pre> It's probably best to use <code>fsum</code> from the <code>math</code> module which will return a <code>float</code>: <pre class="prettyprint"><code>from math import fsum average = fsum(value[1] for value in array) / len(array) </code></pre>

If you do want to use <code>numpy</code>, cast it to a <code>numpy.array</code> and select the axis you want using <code>numpy</code> indexing: <pre class="prettyprint"><code>import numpy as np array = np.array([('a', 5) , ('b', 10), ('c', 20), ('d', 3), ('e', 2)]) print(array[:,1].astype(float).mean()) # 8.0 </code></pre> The cast to a numeric type is needed because the original array contains both strings and numbers and is therefore of type <code>object</code>. In this case you could use <code>float</code> or <code>int</code>, it makes no difference.

With pure Python: <pre class="prettyprint"><code>from operator import itemgetter acc = 0 count = 0 for value in map(itemgetter(1), array): acc += value count += 1 mean = acc / count </code></pre> An iterative approach can be preferable if your data cannot fit in memory as a <code>list</code> (since you said it was big). If it can, prefer a declarative approach: <pre class="prettyprint"><code>data = [sub[1] for sub in array] mean = sum(data) / len(data) </code></pre> If you are open to using <code>numpy</code>, I find this cleaner: <pre class="prettyprint"><code>a = np.array(array) mean = a[:, 1].astype(int).mean() </code></pre>

you can use <code>map</code> instead of list comprehension <pre class="prettyprint"><code>sum(map(lambda x:int(x[1]), array)) / len(array) </code></pre> or <code>functools.reduce</code> (if you use Python2.X just <code>reduce</code> not <code>functools.reduce</code>) <pre class="prettyprint"><code>import functools functools.reduce(lambda acc, y: acc + y[1], array, 0) / len(array) </code></pre>

Is there any pythonic way to find average of specific tuple elements in array?

Tags:

python

arrays

python-3.x

tuples

average

I want to write this code as pythonic. My real array much bigger than this example.

( 5+10+20+3+2 ) / 5

print(np.mean(array,key=lambda x:x[1])) TypeError: mean() got an unexpected keyword argument 'key'

array = [('a', 5) , ('b', 10), ('c', 20), ('d', 3), ('e', 2)]

sum = 0
for i in range(len(array)):
    sum = sum + array[i][1]

average = sum / len(array)
print(average)

import numpy as np
print(np.mean(array,key=lambda x:x[1]))

How can avoid this? I want to use second example.

I'm using Python 3.7

854

asked Apr 25 '19 07:04

Şevval Kahraman

5 Answers

If you are using Python 3.4 or above, you could use the statistics module:

from statistics import mean

average = mean(value[1] for value in array)

Or if you're using a version of Python older than 3.4:

average = sum(value[1] for value in array) / len(array)

These solutions both use a nice feature of Python called a generator expression. The loop

value[1] for value in array

creates a new sequence in a timely and memory efficient manner. See PEP 289 -- Generator Expressions.

If you're using Python 2, and you're summing integers, we will have integer division, which will truncate the result, e.g:

>>> 25 / 4
6

>>> 25 / float(4)
6.25

To ensure we don't have integer division we could set the starting value of sum to be the float value 0.0. However, this also means we have to make the generator expression explicit with parentheses, otherwise it's a syntax error, and it's less pretty, as noted in the comments:

average = sum((value[1] for value in array), 0.0) / len(array)

It's probably best to use fsum from the math module which will return a float:

from math import fsum

average = fsum(value[1] for value in array) / len(array)

answered Oct 19 '22 10:10

Peter Wood

If you do want to use numpy, cast it to a numpy.array and select the axis you want using numpy indexing:

import numpy as np

array = np.array([('a', 5) , ('b', 10), ('c', 20), ('d', 3), ('e', 2)])
print(array[:,1].astype(float).mean())
# 8.0

The cast to a numeric type is needed because the original array contains both strings and numbers and is therefore of type object. In this case you could use float or int, it makes no difference.

answered Oct 19 '22 09:10

Graipher

If you're open to more golf-like solutions, you can transpose your array with vanilla python, get a list of just the numbers, and calculate the mean with

sum(zip(*array)[1])/len(array)

answered Oct 19 '22 09:10

Nick Amin

With pure Python:

from operator import itemgetter

acc = 0
count = 0

for value in map(itemgetter(1), array):
    acc += value
    count += 1

mean = acc / count

An iterative approach can be preferable if your data cannot fit in memory as a list (since you said it was big). If it can, prefer a declarative approach:

data = [sub[1] for sub in array]
mean = sum(data) / len(data)

If you are open to using numpy, I find this cleaner:

a = np.array(array)

mean = a[:, 1].astype(int).mean()

answered Oct 19 '22 10:10

gmds

you can use map instead of list comprehension

sum(map(lambda x:int(x[1]), array)) / len(array)

or functools.reduce (if you use Python2.X just reduce not functools.reduce)

import functools
functools.reduce(lambda acc, y: acc + y[1], array, 0) / len(array)

answered Oct 19 '22 10:10

minji

Related questions
                            
                                MongoDB Connection Management in Python
                            
                                NumPy won't upgrade from 1.5.1 to 1.6.2 on OS X 10.7
                            
                                How to make a click-able graph by networkx?
                            
                                Green to red colormap in matplotlib, centered on the median of the data
                            
                                Image scraping program in Python not functioning as intended
                            
                                Sublime Text 2 plugin won't show up in Command Platte
                            
                                A good blobstore / memcache solution
                            
                                asynchronous logging with python and mongodb
                            
                                how to show argparse subcommands in groups?
                            
                                Python Module imports in Terminal but not through Unix Shell
                            
                                Python scope, dictionary and variables difference?
                            
                                Neo4django Relationship properties
                            
                                Filter rows in Spark dataframe from the words in RDD
                            
                                Where does the `__mro__` attribute of a Python's class come from?
                            
                                TypeError: import_optional_dependency() got an unexpected keyword argument 'errors'
                            
                                Selecting an element by XPath for Selenium Web Scraping
                            
                                python progress bar using tqdm not staying on a single line
                            
                                How to remove \n and \r from a string
                            
                                Convert pandas series of lists to dataframe
                            
                                scrapy from script output in json

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is there any pythonic way to find average of specific tuple elements in array?

Tags:

python

arrays

python-3.x

tuples

average

Şevval Kahraman

People also ask

5 Answers

Peter Wood

Graipher

Nick Amin

gmds

minji

Recent Activity

Donate For Us