What follows is the first of what I hope will be a series of occasional excerpts from my current book project, Mechanisms: New Media and the New Textuality. This material is drawn from the first chapter, entitled “Extreme Inscription.” Readers here will note that I’ve been posting a fair amount on magnetic media and disk storage, the subject for this chapter; indeed, bits and pieces of those posts are finding their way into the manuscript (in that regard the blog has been an invaluable freewriting tool). This excerpt is presented in a more polished state, but of course it’s still very much in draft and I’d greatly appreciate comments of any kind.
For simplicity I have omitted most of the notes.
Mechanisms is under contract to the MIT Press. All material is offered here as copyright © Matthew G. Kirschenbaum, all rights reserved. This copyright notice supersedes the Creative Commons license in place for the rest of the blog.
I am referring to the devices we call hard drives, which I will be examining in some detail as the pre-eminent digital storage technology of our day. The hard drive and magnetic media more generally are mechanisms of extreme inscription—that is, they offer a practical limit case for how the inscriptive act can be imagined and executed. The kind of inscription a hard drive actually performs is non-linguistic, invisible without highly-specialized instrumentation, and recursively encoded. Nonetheless, we will see that what happens at the surface of the disk is ultimately inscription, and that the hard drive is very literally a writing machine. Hard drives are in fact unique among magnetic storage technologies in that the mark-making instrument (the read/write-head of the drive) does not make physical contact with the inscription surface: the two are separated by a space a fraction of the width of a human hair. To examine the hard drive at this level is to enter a looking glass world where the Kantian manifold of space and time is measured in millionths of a meter (microns) and milliseconds, a world of experimental-edge engineering rooted in the age-old science of tribology, the study of interacting surfaces in relative motion. Inside of the hermetically sealed recesses of the drive the behaviors of magnetic fields are pushed to their physical limits by technologies with names like giant magnetoresistive cores; while the individual bits themselves are subjected to recondite data encryption schemes to impose digital structure on the relentlessly analog tendencies of magnetic media. Some of the material I will be presenting will be unapologetically technical; this is necessary because my ultimate goal is to situate the hard drive in a critical history of inscription that is millennia old, but which more specifically descends from the telegraph, the telephone, the phonograph, and other extrasensory engines of Victorian modernity. As students of old new media such as Friedrich Kittler or more recently Lisa Gitelman see so clearly, writing, for quite some time now, has meant more than visual transcription, and inscription has meant more than alphabetization—indeed, one way of reading mechanical writing machines, notes Gitelman, is as artifacts of a culture’s “consensual, embodied theories of language” (5). I will not be advancing any such ambitious “reading” of the hard drive here, but I will hope to show that the device has an aesthetic or symbolic as well as a functional dimension and that it is an emblem of a certain kind of human-computer interaction that has its roots in recoverable discourses about technologies and the body. This will involve us in critiques of inscription and instrumentation akin to those advanced by Bruno Latour and Timothy Lenoir in their work on the history of science, for the hard drive represents a rich site for examining technologies of inscription in relation to scientific practices of instrumentation.
Some may object that a decision to focus on hard drives, the most overtly mechanical portion of the computer, is arbitrary, even tendentious. The personal computer era was well underway without them, though the technology has actually been around since the 1950s. They are also, of course, by no means the only storage media in common use today, and there is increasing evidence that they will be surpassed, not only by solid state or laser optical devices, but also by more advanced techniques such as holography. Nonetheless, hard disk devices have been the primary storage media for personal computers since the mid-1980s, and also for countless internet and intranet servers; they are historically central to any narrative of computing and inscription in the twentieth-century. Though their speed, capacity, and reliability have all increased dramatically—increases in the capacity of drives have in fact outstripped the famous Moore’s Law for processor speeds—basic drive technology remains remarkably unchanged since the technology was first introduced by IBM.
Rather than offer up yet another generalized account of electronic textuality then, my objective in this chapter will be to examine one specific new media writing technology in its unique technical and imaginative milieu, and thereby connect to the kind of new histories of inscription being rendered by such diverse critics as Kittler, Gitelman, Latour, Lenoir, Patricia Crain, and Adrian Johns. Put another way, “the computer” as a generic appellation will not do as a starting point for the kind of investigation of electronic writing I am interested in, any more than “the book” by itself alone suffices as a useful rubric for serious students of earlier periods of textuality. Here we will follow the bits all the way down to the metal.
[ . . . ]
What, then, are the essential characteristics—the grammatological primitives, as it were—of the hard disk drive as inscription engine? I propose the following: it is random access; it is a signal processor; it is differential (and chronographic); it is volumetric; it is rationalized (and atomized); it is motion-dependant; and it is non-volatile (but also variable). I gloss each of these in further detail below, while also explaining something of the technical operation of the drive.*
It is random access. Like the codex and vertical file cabinets and vinyl records, unlike the scroll or magnetic tape or a filmstrip, hard drives permit (essentially) instantaneous access to any portion of the physical media, without the need to fast-forward or rewind a sequence. We will discuss the specific technological climate that lead to the development of magnetic disk storage in more detail later in the chapter, but here the point is simply to align the hard drive with one of two age-old traditions in the history of recordable media.
It is a signal processor. The conventional wisdom is that what gets written to a hard disk is a simple magnetic expression of a bit: a one or a zero, aligned as a north or south polarity. In fact, the process is a highly condensed and complex set of symbolic transformations, by which a “bit,” as a binary value in the computer’s memory, is converted to a voltage passed through the drive’s read/write head where it creates an electromagnetic field reversing the polarity of not one but several individual magnetic dipoles—a whole pattern of flux reversals—embedded in the material substrate of the platter. Likewise, to read data from the surface of the disk, these patterns of magnetic fields (actually patterns of magnetic resistance), which are received as analog signals, are interpreted by the head’s detection circuitry as a voltage spike that is then converted into a binary digital representation (a one or a zero) by the drive’s firmware. The relevant points are that writing and reading to the disk is ultimately a form of digital to analog or analog to digital signal processing—not unlike the function of a modem—and that the data contained on the disk is a second-order representation of the actual digital values the data assumes for computation.
It is differential. The read/write head measures reversals between magnetic fields rather than the actual charge of an individual magnetic dipole. In other words, it is a differential device—signification depends upon changes in the value of the signal being received rather than the substance of the signal itself. (Readers may recognize similarities to the classic Saussurian thesis of differential relation in linguistic meaning.) As noted above, the magnetic patterns on the surface of the disk are not a direct representation of bit values but an abstraction of those values, filtered through a range of encoding schemes that have evolved from basic frequency modulation to the current state of the art, which is known as PRML (Partial Response Maximum Likelihood). There are several reasons for this, but the most important concerns the drive head’s need to separate one bit representation from another: if the disk were to store a long, undifferentiated string of ones or zeros, the head would have no good way to determine precisely where in that long string it was located—was it at the 45th zero or the 54th zero? Frequency modulation, which was the first encoding scheme to address the issue, began each bit representation with a flux reversal, and then added another reversal for a one while omitting a second reversal to represent a zero. The result was that even a long string of absolute ones or zeros would consist of frequent flux reversals that the head could use to measure and orient its position. This is known as clock synchronization or simply “clocking,” and thus we can say that there is a sense in which the hard drive is also a chronographic inscription device. Subsequent encoding schemes have found various ways of improving upon the efficiency of these reversal patterns, such that a variable and always minimum number of reversals are used to encode a given bit value. Success in developing more efficient encoding schemes is one important factor in the rapidly escalating storage capacity of hard disk drives. PRML is especially interesting because, as its name implies, it is predictive rather than iterative in nature: rather than detecting the voltage spikes associated with each and every flux reversal, the firmware makes guesses as to the value of the bit representation from a sample of the overall pattern. Obviously this sampling, coupled with sophisticated error detection and correction routines built into the signal processing circuitry, works extremely well—users don’t notice that there is any “guesswork” involved in reading their data—but the performance does not change the essential characteristics of the process, which at this very low level are interpolative and stochastic.
It is volumetric. A hard disk drive is a three-dimensional writing space. The circular platters, sometimes as many as ten, are stacked one atop another, and data is written to both sides (like a vinyl record but unlike a CD-R). The read/write heads sit on the end of an actuator arm known as a slider, and are inserted over and under each of the individual platters. The slider arms themselves all extend from a common axis. Thus, a drive with four platters will also have eight vertically aligned slider arms and a total of eight separate read/write heads. That the hard disk offers a volumetric space for data storage is reflected in commonplace expressions, such as when we say a drive is “empty” or “full.”
The physical capacity of the platter to record bit representations is known as its aerial density (sometimes also bit density or data density), and innovations in drive technology have frequently been driven by the desire to squeeze more and more flux reversals onto ever decreasing surface space (for example, IBM now markets a hard disk device called a Mircodrive, a single platter one inch in diameter). Typical aerial densities are now at around 10,000,000,000 bits (not bytes) per square inch. Technologies or techniques that heighten the sensitivity of the drive head’s detection circuitry are critical to increasing aerial density because as bits are placed closer and closer together their magnetic fields must be weakened so that they don’t interfere with one another; indeed, some researchers speculate that we are about to hit the physical limit of how weak a magnetic field can be and still remain detectable, even by new generations of magnetoresistive drive heads and stochastic decoding techniques such as PRML. It is important to recognize that bit representations have actual physical dimensions at this level, however tiny: measured in units called microns (a millionth of a meter, abbreviated µm), an individual bit representation is currently a rectangular area about 4.0 µm high and .15 µm wide; by contrast, a red blood cell is about 8 µm in diameter, an anthrax spore about 6 µm. Individual bit representations are visible as traceable inscriptions using instrumentation like Magnetic Force Microscopy, which I will be discussing in more detail later in the chapter (the images are striking). While all storage media, including printed books, are volumetric—that is, the surface area and structural dimensions of the media impose physical limitations on its capacity to record data—the history of magnetic media in particular has been marked by a continuous struggle with aerial densities.
It is rationalized. There is no portion of the volumetric space of the drive that is left unmapped by an intricate planar geometry comprised of tracks (sometimes called cylinders) and sectors. Put another way, the spatial tolerances within which data is written onto the drive (and read back from it) are exquisitely rationalized, much more akin to a Cartesian matrix than a blank canvas. Tracks may be visualized as concentric rings around the central spindle of each platter, tens of thousands of them on a typical disk. Sectors, meanwhile, are the radial divisions extending from the spindle to the platter’s edge. The standard size for a sector is 512 bytes or 4096 bits; if we remember that aerial densities of 10,000,000,000 bits per square inch are common, we can get some idea (however abstract) of just how many sectors there in each of the disks many thousands of tracks. (A technique called zoned bit recording allows the outermost tracks, which occupy the greatest linear space, to accommodate proportionately more sectors than the inner tracks.) Formatting a disk, an exercise which many will have performed with floppies, is the process by which the track and sector divisions—which are themselves simply flux reversals—are first written onto the media. There is thus no such thing as writing to the disk anterior to the overtly rationalized gesture of formatting. There is in addition a very low-level type of formatting, always done at the factory, called servo writing. This entails writing a unique identifier (called a servo code) for each separate track so that the head can orient itself on the surface of the platter. Formatting a disk in the way that most are familiar with the process does not alter the servo codes, which the drive’s firmware prevent a user from even accessing. This information is permanently embedded in the platter for the practical life of the drive. Thus, digital inscription, even on the scale of flux reversals embedded in magnetic media, is never a homogenous act.
Every formatted hard disk stores its own self-representation, a table of file names and addresses known (on Windows systems) as the File Allocation Table (FAT). The FAT, which dates back to DOS (which itself stands for Disk Operating System, the software layer that moved data back and forth between disk storage and the computer’s semiconductor RAM), is the skeleton key to the drive’s content. It lists every file on the disk, together with its address. The notorious eight character/three character file naming convention of DOS and early Windows systems was a direct artifact of their FAT. The basic unit for file storage is not the sector but rather clusters, larger groupings of typically 32 or 64 contiguous sectors in a track. Since the size of a file rarely corresponds exactly to a multiple of the size of a cluster, most files have empty sectors appended after the logical end of the file—these unused sectors are called slack space and sometimes contain data remnants from previous files. Clusters, furthermore, are not necessarily contiguous; larger files may be broken up into clusters scattered all over volumetric interior of the drive. Thus, a file ceases to have much meaning at the level of the platter; instead the links of its cluster chain are recorded in the FAT, where files exist only as strings of relative associations. Defragmenting a disk, another maintenance task with which readers will be familiar, is the process of moving far flung clusters closer to one another in order to improve the performance of the drive (note that the only active mechanical motion the slider arm performs is moving the heads from one track to another; the more this motion can be kept to a minimum therefore, the faster the access times). The FAT, and the data structures it maps, are arguably the apotheosis of a rationalization and atomization of writing space that began with another random access device, the codex.
One final point: it is well known that “deleting” a file does not actually remove it from the disk, even after emptying the so-called Recycle Bin. Instead, in keeping with the volumetric nature of disk storage, the delete command simply tells the FAT to make the clusters associated with a given file available again for future use—a special hex character (E5h) is affixed to the beginning of the file name, but the data itself stays intact on the platter. Common desktop utilities work by removing the special character and restoring files to the FAT as allocated clusters; more advanced techniques are sometimes capable of deeper recoveries, even after the clusters have been rewritten. We will be looking at these matters in more detail later in the chapter, but for now the point is simply the master role played by the FAT, itself a purely grammatological construct, in legislating the writing space of the drive.
It is motion-dependent. As many commentators have pointed out computing is a culture of speed, and hard drives are no exception. Motion and raw speed are integral aspects of their operation as inscription technologies. Once the computer is turned on, the hard disk is in near constant motion. The spindle motor rotates the platters at up to 10,000 revolutions per minute. This motion is essential to the functioning of the drive for two reasons. First, while the read/write head is moved laterally across the platter by the actuator arm when seeking a particular track, the head depends upon passive motion to access individual sectors: that is, once the head is in position at the appropriate track it simply waits for the target sector to rotate past. (Incidentally, platters spin counter-clockwise; note that this means that head actually reads and writes right to left.) In the past, heads were not sensitive enough to read sectors fast enough as they spun by, which lead to elaborate encoding schemes that “interleaved” or staggered the sectors such that sequential pieces of the file were accessed over the course of multiple rotations. Due to a number of factors, heads are now more than sensitive enough to read each sector in passing, and interleaving is no longer necessary.
Motion is also fundamental to the operation of the drive in a second and even more basic sense. Unlike other forms of magnetic media such as video or audio tape, or even floppy disks, where the read/write heads physically touch the surface of the recording medium, the head of a hard disk drive “flies” above the platter at a distance a tiny fraction of the width of a human hair. (The actual distances are measured in units called nanometers. Earlier we encountered microns; one micron equals 1000 nanometers. Thus, even the length and breadth of bit representations vastly exceed the flying height of the drive head. If these distances are scaled upward we arrive at a picture of a jumbo jet flying a few millimeters above the surface of the Earth.) The rapid motion of the disk creates an air cushion that floats the head of the drive. Just as a shark must swim to breathe, a hard drive must be in motion to receive or return data. This air bearing technology, as it is called (pioneered at IBM in the 1950s), explains why dust and other contaminants must be kept out of the drive casing at all costs. If the heads touch the surface of the drive while it is in motion the result is what is known as a head crash: the head, which it must be remembered is moving at speeds upward of one hundred miles per hour, will plow a furrow across the platter, and data is almost impossible to recover. Thus, a key aspect of the hard drive’s materiality as an agent of digital inscription is quite literally created out of thin air.
It is non-volatile (but variable). Though magnetic media are subject to physical deterioration, tape reels kept in archival storage conditions have been known to preserve their data for upwards of fifty years. The advent of a random access storage device with non-volatile recording capabilities was a crucial catalyst for what we now consider “interactive” computing, a point to which I shall return shortly. Magnetic core memories, which were bulky mechanical precursors to disk storage, were random access but with much lower storage capacities and permanent data had to be rewritten with each successive access. The development of magnetic tape storage was roughly contemporaneous with disk technologies, but of course magnetic tape (and paper tape, which was used earlier) is a serial medium.
Just as important as magnetic disk storage’s non-volatility was the fact that the same volumetric area could be recycled and rewritten. Though the tendency in discussions of storage media is to fixate on permanence and preservation, it is worth remembering that the ability to erase and change data rapidly was a key characteristic of the computer as envisioned by pioneers like Norbert Wiener. Punched cards and paper tape clearly did not meet these criteria. Therefore, alongside of its non-volatility, we must also acknowledge magnetic media’s variability. Interestingly, holographic storage, which some see as eventually replacing magnetic media—data is stored in a solid array of crystals—is not generally reusable. One speculation is that holographic storage will be so cheap and capacious that it will not be functionally or economically necessary to ever erase anything. (With holographic storage, aerial density thus becomes a three-dimensional metric.) Such a technology would explode current conventions of data storage, re-conceiving human computer interaction as fundamentally as random-access non-volatile (but variable) storage media did in the 1950s. A glimpse of that future is perhaps to be had in Microsoft researcher Gorden Bell’s MyLifeBits project, described as “a lifetime store of everything. . . . Gordon Bell has captured a lifetime’s worth of articles, books, cards, CDs, letters, memos, papers, photos, pictures, presentations, home movies, videotaped lectures, and voice recordings and stored them digitally. He is now paperless, and is beginning to capture phone calls, television, and radio.”
I'm drinking this in ... great stuff. I was at a zoo today and suddenly realized that the term "fledgling" has an orinthological origin -- it's not a metaphor to apply the term to birds. It's amazing that I've been using that word for decades and it never occurred to me. Thanks for similarly making me understand the term "hard drive crash".
A post on netwoman reads:
Dale Spender and Helen Fallon (1998) also assert that terminology such as 'abort', 'chaining', 'thrashing', 'execute', 'head crash', and 'kill' portray negative images of sex and violence to women, creating an uncomfortable and unfamiliar terrain. http://www.netwomen.ca/Blog/2003_09_01_archive.html#106427418616569858
I haven't read the specific article referenced, but I wonder if your description of the technology of computers as a physical environment (on the micro level) would place the percieved violence of computer terminology into another context.
I have lost so many files to my hard drive, and I know I've been lazy in backing up a hard drive at home that's dying, so I know I'll suffer more data loss. I imagine if I were to get cancer, I'd want to know all I could about the disease, just so I could know what it was doing to my body. I realize that "my hard drive crashed" is the modern equivalent of "my dog ate my homework," but non-geeks are generally mystified by what goes on in their computers.
"Incidentally, platters spin counter-clockwise; note that this means that head actually reads and writes right to left."
1) Possibly a missing word... should that be "this means that THE head"?
2) I remember once some old college friends of mine had a disagreement about when a shower curtain was "open" -- was the curtian "open" when it was crumpled up to permit access to the shower, or whas the curtain "open" when it was unfolded, permitting the curtain to dry?
Maybe I just can't visualize the "right to left" motion you describe. When I watch the second hand sweeping around the face of a clock, from my perspective the hand is moving left to right as frequently as it is moving from right to left. A clock has a conventional neutral position -- 12:00 -- that can be a reference point to describe the motion of the second hand as starting off moving from left to right (even though it makes as much sense to say the second hand starts off moving from up to down). But the stylus of a phonograph album is probably a better analogy for the read/write head in a hard drive, and that stylus is generally on the right side, not either on the top or bottom -- so the vinyl is moving vertically through the position occupied by the needle. I presume that placement of the stylus is because most people are right-handed.
Please don't interpret my questions as nitpicks -- and if the notes or other sections will handle any of my objections, I'm happy to wait for the book. But I am curious... is there a similar reference point in a hard drive that makes the "right to left" perspective obvious?
Posted by: Dennis G. Jerz at October 13, 2003 11:40 PM | Link to CommentThanks for this, Dennis--just the kind of feedback I was hoping for in terms of letting me know where the techical description breaks down. "Right to left" because the head will be motionless as the surface of the platter, spinning counter-clockwise, rotates past underneath; if you visualize that it necessitates a right to left read/write pattern.
Posted by: MGK at October 14, 2003 12:22 AM | Link to CommentHmm... that works if you envision the r/w head like the hand of a clock pointing up to 12. But if you envision the r/w head as a hand pointing down to 6, then wouldn't it be equally possible to say that the surface is moving past the head left to right? I'm not saying that it makes more sense to envision the r/w head as a hand pointing down, but I don't know why it WOULD make more sense to envision a hand pointing up. Even saying something like "hard drives are conventionally mounted such that the head reads and writes from right to left" would satisfy my idle curiosity.
Posted by: Dennis G. Jerz at October 14, 2003 12:44 AM | Link to CommentOh, I see what you're asking . . . the head remains in a relatively fixed position at the end of the actuator arm. It seeks to find tracks, but waits passively for sectors to spin past--it doesn't circumnavigate the platter. So the head does in fact align at a roughly 12 o'clock position, and it's more or less fixed there except for some lateral movement.
Next time you a drive fails and you're done with it go ahead and break it open.
Posted by: MGK at October 14, 2003 08:39 AM | Link to CommentOK, that answers that question... thanks for the clarification. I've got a hard drive that's slowly dying on my home computer, so maybe I will soon get the chance to perform a post-mortem.
Posted by: Dennis G. Jerz at October 15, 2003 12:16 AM | Link to CommentWell, I haven't read the Kozierok, but this is the most lucidly materialist explanation of this particular (popularly mystified) component I've ever read. Nice work, Matt.
Posted by: Steve Jones at October 17, 2003 09:10 AM | Link to CommentIf files that are deleted remain on the hard drive doesn't it fill up? When you check properties to see how much memory is left on the HD there is a pie chart showing used and available portions. As the disk fills up with the deleted files (assuming you saved an awful lot of them!) is this reflected in the piecharts used section? And, finally, is there anywhere where you can look to see how much memory the deleted files are taking up?
Hi Andy,
Good questions. The files remain on the drive, but they are re-allocated as "free" space and are gradually overwritten as more data is saved. There are, however, a variety of techniques for recovering and restoring files even after they have been overwritten, though the effectiveness of such techniques diminishes over time. Check out Norton Utilities for a basic starting point.
Posted by: MGK at October 31, 2003 10:02 AM | Link to CommentI talked to someone the other night about whether the deleted files eventually got overwritten, and she said they did. Of course, it would be interesting to know at what point they were overwritten: i.e. whether it was when the HD was nearly full or whether it was to do with the fragmented state of the drive.
I find watching the defragmentation process quite therapeutic as the various pieces of files get moved about and larger spaces are created. In itself the thought that a file might be split up into various pieces and slotted into spaces all over the computer, but is magically "joined together" when you open it up again is facinating. Of course, it may be that although the file appears on the screen of all-of-a-piece, in reality it is still in pieces, so to speak: the bytes of the particular "length" of the bit stream in reality remaining separated. Although you may have a piece of text of a certain length, say 20 lines, which is a meaningful whole in cognitive terms, in reality it may be divided up on the hard-drive in a completely unmeaningful way, in the middle of sentences and words, even.
Sorry, getting carried away here.