Wednesday, November 28, 2007

Thoughts on Agile techniques - I

Over the past few years, I have observed the agile movement and I feel that I have to put down some of my thoughts regarding this. The main reason for me writing it is to clarify my thoughts, and also, but not necessarily, to invite some feedback.
I cannot dispute the fact that in the process of evangelizing it, the people have talked about some software practices that contribute to making better software. But somehow I feel that the useful practices are all those that have to be part of writing software in general, rather than being attributed to a process.
Its my belief that there are mostly only two qualities that a developer has to have in order to deliver successful software. This is of course only in context on enterprise software development where the developer in employed to create and maintain internal software systems within the company. The two qualities are: one - not being afraid to go in and change code and two - constant communication of the status of his work.
The second one is an auxillary capability, since it solely exists to put people mind's at rest, so that they don't bother him/her while he's actually trying to solve a problem. I would think its an equally necessary quality to possess as the primary one because when it comes to deploying a fix or a system, its never only about the code. There's always a dynamic between the current users that have to look for a workaround or need to be informed about systems that will be down, etc.
In such an environment, if the developer possesses the confidence of being able to tackle changing the codebase or developing something new with the confidence that it can always be changed later, there's not many more practices that he needs to religiously follow to deliver working software.

This confidence comes from constantly learning from a good amount of experience as well as possessing a couple of tools that come in handy. I will delve on this aspect more in the next post.

Tuesday, March 06, 2007

The beauty of Python Generators

Say you had to read a file line by line and change the delimiter from comma to semi-colon. The python code would look like this:
f = open("c:\\testfile.txt") 
for line in f.readlines():
';'.join(line.split(','))
Now say you wanted to do the same thing, but only to the first 5 lines of the file. The impulse for a Python newbie would be to write the following code:
f = open("c:\\testfile.txt") 
counter = 0
for line in f.readlines():
if counter <> 5
';'.join(line.split(','))
counter += 1
A more savvy python newbie would write the following code taking the advantage of the beautiful enumerate() function:
f = open("c:\\testfile.txt") 
for counter,line in enumerate(f.readlines()):
if counter <> 5
';'.join(line.split(','))
The problem with is code is that it has two concepts intermingled, one to read 5 lines and the other for the actual process of changing delimiters. If we wanted to keep the concepts separate, we would have to be able to write something like this:
f = open("c:\\testfile.txt") 
for line in f.readfirst5lines():
';'.join(line.split(','))
Here the process of stopping after 5 lines is encapsulated by the readfirst5lines() method in the file object and the method body only does the changing of delimiters as before. Now this code will actually work! The reason that we can write code like this is due the python generators feature. readfirst5lines will look like this:
import itertools 
class myfile(file):
def readfirst5lines(self):
for i in itertools.count():
if i == 5:
break
next = self.readline()
i += 1
yield next
This is of course more lines of code, but the concept is abstracted away nicely. We are separating the conditions for processing the file from the actual processing. And whats more, this method can be slightly changed to take number of lines as the argument. So if you want choose to read the first 2, 3, 5 or how many ever lines, the method will look like this:
import itertools 
class myfile(file):
def readfirstfewlines(self, n):
for i in itertools.count():
if i == n:
break
next = self.readline()
i += 1
yield next
And you can use it like this:
f = myfile("c:\\testfile.txt") 
for line in f.readfirstfewlines(3):
';'.join(line.split(','))
This will process the first 3 lines.

(Note that we used the 'myfile' constructor method instead of the 'open' function to open the file, since we need an object of type 'myfile' and not 'file'. There are other ways to downcast in python but this is probably the simplest way to do it in this case.)

Python generators go a long way in making the code more elegant and encapsulating separate concepts. I will blog about the mechanics of how generators work and its other uses as and when I learn more about them. For Rubyists, this concept of generators is pretty much similar to "blocks".