Skip to main content

Duck typing considered harmful.

I've had a chance to work on a fairly large chunk of Python at this point and have decided that Python (like Perl) is completely untenable at scale.  <rant><rave><drool>  but wait!  I have reasons!

Programmers spend most of their time reading code.  We read massive amounts of code.  After we read massive amounts of code we write one... or change one... To make a change to a piece of code you first have to understand what it does and how it interacts with the system around it.  Then we start reading massive amounts of code again.  Anything you can do to minimize the amount of code a programmer has to understand to make a change becomes a huge gain in productivity.

Duck typing causes the amount of code you need to read to make a change to grow very large.
For example lets look at two functions, one in C++ and one in Python.

First in C++
int function(type1 arg1, type2 arg2) {
  return arg1->method(arg2);
}


In this function we know that it takes two types, type1 and type2.  We also know that type1 has a function called 'method' which takes an argument of type2.  If we need to understand this function we go to type1 and read the code.  If 'method' is virtual then we also read any class which inherits from type1.  Now lets look at an equivalent Python function.


Now in Python
def function(arg1, arg2):
  return arg1.method(arg2)


In this case the function takes any class which has a function called 'method'.  To understand this function you have to read the entire codebase to find classes which have a 'method' function or find every invocation of this function and determine the types used there (non trivial).  Even worse a class might not have 'method' but could gain it later.  To understand this function you really have to read everything.


This isn't a problem when dealing with hundreds of lines of code but when dealing with tens of thousands it becomes extremely time consuming.

Comments

  1. Completely agree with you. We have a large codebase dating back to 2013 and it's getting to be nearly impossible to know where things will be coming from when we call them.

    ReplyDelete

Post a Comment

Popular posts from this blog

Microprocessor Architectures in the top500 list.

The  top500 list has been swamped by x86 machines over the last several years.  Anyone who knows me is aware that I am NO fan of the x86 ISA. Somewhat heartening is that the most recent list seen a resurgence in other architectures.  The Power architecture from IBM has been a fixture on the list for a long and is still a strong contender even though it has fallen to No. 12 from No. 9 last time and No. 1 in November of 2007. A Sparc machine has captured the No. 1 place this year and is actually as powerful as the the No. 2 through No. 6 systems combined. The Chinese MIPS based Loongson processor is starting to make some noise and the Dawning 6000 is reported to eventually reach a Petaflop in computing power. One other note is that an increasing number of machines in the top500 list rely heavily on GPUs.  The No. 2 machine on the current list, the Chinese Tianhe-1a machine, is comprised of 14,336 Intel Xeon processors and 7,168 Nvidia Tesla GPUs. Now ...

Staking Claims: A History of Programming Language Design Claims and Evidence

Found on  Lambda the Ultimate  ( a great programming languages blog btw ) A great article  on programming languages through history and the truth of their claims. From the article: While still a relatively young field, computer science has a vast body of knowledge in the domain of programming languages. When a new language is introduced, its designers make claims which distinguish their language from previous languages. However, it often feels like language designers do not feel a pressing need to back these claims with evidence beyond personal anecdotes. Peer reviewers are likely to agree. In this paper, we present preliminary work which revisits the history of such claims by examining a number of language design papers which span the history of programming language development. We focus on the issue of claim-evidence correspondence, or determining how often claims are or are not backed by evidence. These preliminary results confirm that unsupported claims have been ...