Setting smaller buffer size for sys.stdin?

Tags:

I'm running memcached with the following bash command pattern:

memcached -vv 2>&1 | tee memkeywatch2010098.log 2>&1 | ~/bin/memtracer.py | tee memkeywatchCounts20100908.log

to try and track down unmatched gets to sets for keys platform wide.

The memtracer script is below and works as desired, with one minor issue. Watching the intermediate log file size, memtracer.py doesn't start getting input until memkeywatchYMD.log is about 15-18K in size. Is there a better way to read in stdin or perhaps a way to cut the buffer size down to under 1k for faster response times?

#!/usr/bin/python  import sys from collections import defaultdict  if __name__ == "__main__":       keys = defaultdict(int)     GET = 1     SET = 2     CLIENT = 1     SERVER = 2      #if <     for line in sys.stdin:         key = None         components = line.strip().split(" ")         #newConn = components[0][1:3]         direction = CLIENT if components[0].startswith("<") else SERVER          #if lastConn != newConn:                 #    lastConn = newConn          if direction == CLIENT:                         command = SET if components[1] == "set" else GET             key = components[2]             if command == SET:                                 keys[key] -= 1                                                                                             elif direction == SERVER:             command = components[1]             if command == "sending":                 key = components[3]                  keys[key] += 1          if key != None:             print "%s:%s" % ( key, keys[key], )

490

asked Sep 08 '10 17:09

David

2 Answers

You can completely remove buffering from stdin/stdout by using python's -u flag:

-u     : unbuffered binary stdout and stderr (also PYTHONUNBUFFERED=x)          see man page for details on internal buffering relating to '-u'

and the man page clarifies:

   -u     Force stdin, stdout and stderr to  be  totally  unbuffered.   On           systems  where  it matters, also put stdin, stdout and stderr in           binary mode.  Note that there is internal  buffering  in  xread-           lines(),  readlines()  and  file-object  iterators ("for line in           sys.stdin") which is not influenced by  this  option.   To  work           around  this, you will want to use "sys.stdin.readline()" inside           a "while 1:" loop.

Beyond this, altering the buffering for an existing file is not supported, but you can make a new file object with the same underlying file descriptor as an existing one, and possibly different buffering, using os.fdopen. I.e.,

import os import sys newin = os.fdopen(sys.stdin.fileno(), 'r', 100)

should bind newin to the name of a file object that reads the same FD as standard input, but buffered by only about 100 bytes at a time (and you could continue with sys.stdin = newin to use the new file object as standard input from there onwards). I say "should" because this area used to have a number of bugs and issues on some platforms (it's pretty hard functionality to provide cross-platform with full generality) -- I'm not sure what its state is now, but I'd definitely recommend thorough testing on all platforms of interest to ensure that everything goes smoothly. (-u, removing buffering entirely, should work with fewer problems across all platforms, if that might meet your requirements).

190

answered Oct 05 '22 03:10

Alex Martelli

You can simply use sys.stdin.readline() instead of sys.stdin.__iter__():

import sys  while True:     line = sys.stdin.readline()     if not line: break # EOF      sys.stdout.write('> ' + line.upper())

This gives me line-buffered reads using Python 2.7.4 and Python 3.3.1 on Ubuntu 13.04.

answered Oct 05 '22 02:10

Søren Løvborg

Related questions
                            
                                Pandas SQL chunksize
                            
                                OpenCV - Saving images to a particular folder of choice
                            
                                How to find number of bytes taken by python variable
                            
                                Checking if float is equivalent to an integer value in python
                            
                                Python frozenset hashing algorithm / implementation
                            
                                Restrict django FloatField to 2 decimal places
                            
                                How to write the resulting RDD to a csv file in Spark python
                            
                                How to run pip of different version of python using python command?
                            
                                Loading JSONL file as JSON objects
                            
                                How to split a string within a list to create key-value pairs in Python
                            
                                Python list comprehensions to create multiple lists [duplicate]
                            
                                Does requests.codes.ok include a 304?
                            
                                How to extract all columns but one from an array (or matrix) in python?
                            
                                Clustering text documents using scikit-learn kmeans in Python
                            
                                Pandas - replacing column values
                            
                                I know Perl 5. What are the advantages of learning Perl 6, rather than moving to Python? [closed]
                            
                                How to sort a Pandas DataFrame according to multiple criteria?
                            
                                How to format raw string with different expressions inside?
                            
                                PyQt5 cannot import name 'QApplication'
                            
                                Calling a python function from bash script

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Setting smaller buffer size for sys.stdin?

Tags:

python

stdin

buffering

David

People also ask

2 Answers

Alex Martelli

Søren Løvborg

Recent Activity

Donate For Us