The public sources say the following:
The above together leads to speculation that the WARP is a programmable processor, capable of all the arithmetics operations necessary for transformation and lighting (addition, multiplication, reciprocal or division).
Neither I nor the Matrox programmers think it is practical to use the
current WARPs to do T&L. It already has a lot to do, and it's better
to assign these tasks to the CPU. Basically everything the computer does
may be done in hardware or software. The decision is how to distribute
the tasks to make the sytem fast and not too expensive.
Early Silicon Graphics workstations used fully-hardware based graphics
pipeline. Later cheap models, like Indigo2 or O2, released some tasks to
software running on the main CPU. The latest cheap models, like the x86-based
320, again turn towards more functions in hardware. These decisions were
dictated by technology development and cost, both in terms of time and
money.
I believe that the decision of what tasks should be assigned to WARPs
and the main CPU depends on many factors. It's not that Matrox is so malicious
that it doesn't want the users to have T&L onboard. Under current conditions
the WARP is best used for what it is used - triangle setup. Maybe some
future WARPs will do more or less. Just have a little trust in programmers.
They usually know what they are doing, and they are trying to get the most
of the hardware. And the WARP architecture brings the flexibility needed
for future changes.
The goal is not to have T&L in the graphics chip. The goal is to have the fast graphics solution in a PC, and hardware T&L may or may not be the good method to achieve this.