Overdoing C++ Templates
Every decade or so, a new fashion in programming comes along and announces itself the successor to previous discredited paradigms. Again we believe that from now on, software will be more reliable, cheaper to build, and more fun to make. (No one believes it will be smaller or faster.) In the 70s, there was structured programming; in the 80s, object-oriented programming; and from the mid-90s onward, generic programming. This last one gets its name from powerful techniques for code reuse using templates (generic classes and functions).
Parameterized classes and functions (templates) are very useful beasts. A function sqr() can be written that will square anything that defines the multiplication operatorcomplex numbers, matrices, whatever. The standard container classes such as list<> are templatesone doesn't have to rewrite them for every new type. This was a genuine pain in plain old C++, and I think the ISO standard is a great improvement. However, some things were overdone in the process.
For instance, the standard library strings and iostreams are templates parameterized by a "character traits" type. This means that the same basic_string<> class definition can generate ASCII strings, Unicode strings, and Martian Trilobyte strings. (In principle. Many implementations have still only implemented it for ASCII characters, which seems particularly silly.) The standard mandates that these very common classes, which are used in nearly every C++ application, must be implemented as templates.
But there is a price to pay in performance and debuggability. Some experimentation (using Microsoft Visual C++ 6.0) illustrates the problem. This compiler supports the new-style iostreams library, as well as the "classic" iostreams, so we can compare the implementations. The first test program is of course "Hello, World," which compiles more than twice as fast using classic iostreams. A more serious program involved 200 lines, each writing out 10 variables to count. What is most striking is the compilation speed: It took nearly 10 sec using the standard library versus 1.5 sec for the classic implementation. Ten seconds is a lot of time; one can cram a whole highly irritating commercial into that kind of break. Executable size was 115K for standard and 70K for classic. Your mileage may vary, but the overall picture is of slower builds and larger executables when using the new iostreams library. And this isn't a peculiarity of the Microsoft compiler because GCC behaves in the same way.
It's true that executable size is not as important as it used to be, but these days the fastest growing class of programmable devices consists of information appliances (in which memory will remain at a premium for the next few years): hand-helds, cell phones, smart refrigerators, Bluetooth-enabled espresso machines, and so on. The extra size from using the standard iostreams comes from the wholesale inlining of template code, which makes it hard to fine-tune crucial operations without code bloat. For me, the build times are a more crucial problem because it means waiting longer and losing the "conversational flow" that is important in development.
Then, consider debuggability. The template implementation of the standard string class is highly ingenious, but not pretty to the applications debugger. She is faced with compiler and debugger messages using the following fully expanded name:
class std::basic_string<char,struct std::char_traits<char>,class std::allocator<char>>
And as for the very useful map<string,string>, I leave it to your imagination. The name is so long that one gets dozens of compiler warnings about the internal name being truncated. std::string should be as transparent as possible to the beginner, who should not be penalized for treating it as a built-in feature of the language. It is technically feasible to hunt for any defined typedefs in scope when putting out compiler error messages, and I may get around to doing that with the UnderC project. Verity Stob suggests writing a postprocessor to massage C++ error messages, and I hope she's joking. It is simpler just not to use such an overcomplicated type. My secret weapon in C++ development (which I'll confess here in public for the first time) is an interface-compatible string class that replaces the <string> header in my larger projects. Occasionally, I'll rebuild with the standard headers to see if my libraries are still honest, but I generally let other people struggle with being correct at the price of performance.
Let me say that there are indeed applications that need the kind of flexibility offered by std::string programs that manipulate both ASCII and Unicode strings or that need to customize the allocation strategy, etc. These are not generally very common (usually, programs either use ASCII or Unicode), and it seems unfair to burden programmers with the general machinery. It does make the library implementer's job more interesting, but it complicates that of the application programmers. This seems to be an inversion of the general principle that a well-designed library hides the difficulty of its implementation and is straightforward to use. That is, std::string does not hide its implementation sufficiently because application programmers are constantly being made aware of it during their development process. And we cannot assume that the users of these classes are rocket scientists. The standard is insisting on a particular implementation strategy, which goes against the idea of standards ideally specifying only the public interface and expected behavior of classes. Naturally, the general templates can always be available to those who genuinely need them.
These considerations apply as well to the standard containers such as list<>, which have an extra default template argument that defines an allocator. Most people don't need this facility, although it's very useful if you do, of course. Again, these more general versions can be defined as separate templates. I appreciate that this makes the standard library less technically interesting, but libraries must be primarily designed with the end users in mind. To rephrase the C++ mantra, users should not be bothered by what they don't use.
Apart from the danger of making things templates when they don't need to be templates, there is another problem that occurs with generic programming in C++. Most people agree that the standard algorithms can be very useful. If I have a vector of integers, then sort(v.begin(),v.end()) will sort it. Because the comparison is inlined, this generic algorithm will be faster than the old-fashioned qsort(), as well as being easier to use, especially if this was a vector of a user-defined type. copy() will copy anything, anywhere, and in the most efficient way possible.
But some uses are unnecessarily opaque:
copy_if(v.begin(),v.end(),ostream_iterator<int>(cout) bind2nd(greater<int>(),7));
To be pedantic, each name should be qualified with std::, but assume that everything has been brought into the global namespace, either by separate using clauses or by some other unmentionable means) This example from Stroustrup1 can be more conventionally expressed in a form that makes it obvious that all the integers will run together on the output:
vector<int>::iterator li; for (li = v.begin(); li != v.end(); ++li) if (*li > 7) cout << *li;
Stroustrup tells us that using explicit loops is "tedious and error-prone," but I cannot see any advantages in using the first version. Obviously, one can get used to this notation; humans are very adaptable, and as professionals, we have to learn the new idiom. But it is certainly not less tedious, and it is arguably much less readable and flexible. Also, it can constrain design decisions. For example, given that we have a list of Shape * pointers, we can tell them all to draw themselves with either
for_each(ls.begin(),ls.end(), bind2nd(mem_fun(&Shape::draw),canvas));
or
ShapeList::iterator li; for (li = ls.begin(); li != ls.end(); ++li) (*li)->draw(canvas);
Now, if I want to modify this so I only draw shapes matching some criteria (and specifically don't want to put that behavior into the shapes themselves), then I just have to add an if statement in the explicit loop. The only way I can think of doing this using the generic idiom is to actually define a function as the "payload" of the for_each() algorithm. Using the terminology of the software patterns book2, the first example is an internal iterator, and the second is an external iterator. The authors observe that C++ is not particularly good at internal iterators, and I think we should respect the limitations of the language. The problem here is the overenthusiastic application of the generic paradigm in C++again leading to unnecessary difficulties. C++ simply does not do general anonymous functions such as LISP, SmallTalk, Ruby, etc. An anonymous function (or lambda expression) in C++ would look rather like the following3; perhaps one day somebody will implement it:
for_each(ls.begin(),ls.end(), void lambda(Shape *p) { p->draw(canvas); });
C++ is a marvelous language that is found in everything from cell phones to continental switching networks. It flexibly supports many programming styles, but this flexibility can be a problem. The art of good taste in programming is picking an appropriate idiom for the particular problem, just as good style in prose writing needs always to be aware of the audience. I don't wish to undermine the standard, which involved a mammoth effort and has given us a common platform. My issue with the standard is that it is too bound up with the generic programming paradigm, and hence becomes prescriptive about what constitutes good programming style (for instance, that algorithms are to be strongly preferred over explicit loops). It is also burdening programmers with implementation details (such as basic_string<>), which reinforces the perception of C++ as a hardcore engineer's language.