Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Sorting JSON data by keys value

I am currently getting JSON data from the discogs API (mp3 tag data) and wish to sort the results by the key's value. In this case I am trying to get data for a Guns n Roses song and the output has 1988 as the first one while the data actually has a record from 1987. How can I sort this data so that I can get to the sorted data by year (olderst to newest). The code below sorts by either key or value but thats not what I intended to get. Please help.

import json
import urllib2
request = urllib2.Request('http://api.discogs.com/database/search?sort=year&sort_order=asc&artist=%22Guns+N%27+Roses%22&track=%22Sweet+Child+O%27+Mine%22&format_exact=Album&type=master')
request.add_header('User-Agent','Mozilla/4.0 (compatible; MSIE 5.5; Windows NT)')
request.add_header('Content-Type','application/json')
response = urllib2.urlopen(request)
json_raw= response.readlines()
json_object = json.loads(json_raw[0])



for row in json_object['results']:
    try:
        from operator import itemgetter
        for k, v in sorted(row.items(), key=itemgetter(0)):
            print k, v
    except KeyError: 
        pass
like image 858
Prem Minister Avatar asked Nov 18 '12 08:11

Prem Minister


2 Answers

You could use list-comprehension and sorted() function for this:

# filter  json_object['results']  first, as some of the items are missing the key 'year'

In [33]: results = [x for x in json_object['results'] if 'year' in x]

In [34]: sorted(results, key=lambda x: x['year'])

or :

In [79]: from operator import itemgetter

In [80]: sorted(results, key=itemgetter('year'))
like image 128
Ashwini Chaudhary Avatar answered Oct 16 '22 09:10

Ashwini Chaudhary


To sort a list of dictionaries, use a methodcaller with the key on which to sort; you want to sort the results list, not the contained dictionaries. Moreover, some of the entries do not have a year, and that could lead to errors:

from operator import methodcaller

for row in sorted(json_object['results'], key=methodcaller('get', 'year', None)):
    # process the row dictionary

The methodcaller definition will basically do a entry.get('year', None) for each entry in json_object['results'], giving the sorted method the value to sort on.

You should not use readlines() to read your JSON response, it'll mis-interpret newlines. Let the json library do the reading instead (note the .load(), no s at the end):

response = urllib2.urlopen(request)
json_object = json.load(response)
like image 26
Martijn Pieters Avatar answered Oct 16 '22 08:10

Martijn Pieters