Junctional DNA.

Posted on April 25, 2007 by T. Ryan Gregory

JR Minkel at the Scientific American blog has responded to the post on Evolgen about his earlier story regarding “junk DNA” (did you catch all that?). At the end of the post, he asks:

Scientists and scientist bloggers: Again, do you care [if journalists call it junk DNA]? If so, what term would you propose instead, or how would you make the distinction between functional and nonfunctional noncoding DNA clear to a popular audience?

Yes, I care, and here are my suggestions. If you mean the general category without any speculation either way about function, then it is simply and accurately “noncoding DNA”. If it has a function, then you specify what that function is: “regulatory DNA” or “structural DNA” or what have you. If the type of sequence is known, then you can use that as well or instead: “transposable elements” or “mobile DNA” or “pseudogenes” or “introns”. Maybe readers won’t know what those terms mean. This is a good opportunity to inform them.

What is missing is a term to describe a given collection of noncoding DNA for which there is thought to be some function, but for which that function and/or the type of sequence is unknown. This would reside somewhere between “junk DNA” (in the vernacular sense) and “functional DNA” (to which specific names can be applied). I therefore suggest the neologism “junctional DNA” to encompass this category. Note that Petsko (2003) suggested “funk DNA” to represent “functionally unknown DNA”, but I think “junctional DNA” is a little less, uh, funky.

Let me be even more specific. The proposed term “junctional DNA” derives from a dual etymology: 1) a simple portmanteau of â€œjunkâ€ and â€œfunctionalâ€; 2) an indication that the sequences so described reside at the crossroads between DNA with no evident function and that with a clear function.

Two terms in one day — “the onion test” and “junctional DNA” — how ’bout that.

Incidentally, my annoyance with such reports has less to do with the terminology than with the fact that the highly conserved sequences in question make up about 5% of the total genome. To jump from this to imply that all noncoding DNA is recognized as functional is inappropriate and misleading. I also wish they would cite the source papers they reference; some of us would like to look up the primary material when we see a summary in a news story.

_______________

Update: Other bloggers (RPM of Evolgen in personal correspondence, Sandwalk) seem to think this term is not needed. I point out that this post was given in direct response to Minkel’s appeal for a term that would “make the distinction between functional and nonfunctional noncoding DNA clear to a popular audience”. In light of the fact that a journalist sees the need for such a term, and that it was coined in response to that need, I think ‘junctional DNA’ could be a useful term.

The onion test.

Posted on April 25, 2007 by T. Ryan Gregory

I am not sure how official this is, but here is a term I would like to coin right here on my blog: “The onion test”.

The onion test is a simple reality check for anyone who thinks they have come up with a universal function for non-coding DNA¹. Whatever your proposed function, ask yourself this question: Can I explain why an onion needs about five times more non-coding DNA for this function than a human?

The onion, Allium cepa, is a diploid (2n = 16) plant with a haploid genome size of about 17 pg. Human, Homo sapiens, is a diploid (2n = 46) animal with a haploid genome size of about 3.5 pg. This comparison is chosen more or less arbitrarily (there are far bigger genomes than onion, and far smaller ones than human), but it makes the problem of universal function for non-coding DNA clear².

Further, if you think perhaps onions are somehow special, consider that members of the genus Allium range in genome size from 7 pg to 31.5 pg. So why can A. altyncolicum make do with one fifth as much regulation, structural maintenance, protection against mutagens, or [insert preferred universal function] as A. ursinum?

Left, A. altyncolicum (7 pg); centre, A. cepa (17 pg); right, A. ursinum (31.5 pg).

There you have it. The onion test. To be applied to any ambitious claims that a universal function has been found for non-coding DNA.

____________

¹ I do not endorse the use of the term “junk DNA”, which I think has deviated far too much from its original meaning and is now little more than a loaded buzzword; the descriptive term “non-coding DNA” is what I use to refer to the majority of eukaryotic sequences (of various types) that do not encode protein products.

² Some non-coding DNA certainly has a function at the organismal level, but this does not justify a huge leap from “this bit of non-coding DNA [usually less than 5% of the genome] is functional” to “ergo, all non-coding DNA is functional”.

From "Pangenesis" to "Genome".

Posted on April 18, 2007 by T. Ryan Gregory

The term “genetics” has been used in reference to the branch of science dealing with â€œthe physiology of heredity and variationâ€ since 1905. It was coined by the British biologist William Bateson, first in a 1905 letter (see Bateson 1928), and then publicly the following year (Bateson 1906). It was derived directly from the Greek for “birth” (or “origins”).

Straightforward enough. But what about “gene” and “genome”? These terms are interesting because they illustrate the evolution of both concept and language in science and involve both co-option and hybridization.

First, “gene”. Even after the term “genetics” was in use, it was not entirely clear what practitioners of the science were studying. Indeed, the concept of a fundamental physical and functional unit (or â€œdeterminerâ€) of heredity remained very vague. In 1909, Danish biologist Wilhelm Johannsen sought to pin down a term to describe these genetic elements. Although some people attribute the origin of “gene” to the same etymology as “genetics”, there is more to the story. In actuality, “gene” was derived indirectly from Darwin‘s (incorrect) theory of heredity known as “pangenesis“. Indirectly, because it morphed through the term “pangens” coined by the Dutch botanist Hugo de Vries in 1889 in reference to genetic units and as an homage to Darwin, even though his theory of heredity differed markedly from pangenesis (de Vries was a Mendelian).

According to Johannsen (1909, p.143), he came up with the term “gene” by choosing to isolate

the last syllable â€˜geneâ€™, which alone is of interest to us, from Darwinâ€™s well known word (Pangenesis) and thereby replace the less desirable ambiguous word â€˜determinerâ€™. Consequently, we will speak of â€˜the geneâ€™ and â€˜the genesâ€™ instead of â€˜pangenâ€™ and â€˜the pangensâ€™. The word gene is completely free from any hypothesis; it expresses only the evident fact that, in any case, many characteristics of the organism are specified in the germ cells by means of special conditions, foundations, and determiners which are present in unique, separate, and thereby independent ways â€“ in short, precisely what we wish to call genes. [Translation as in Portugal and Cohen 1977].

Johannsen (1909) was also responsible for the terms “genotype” and “phenotype“. As he summarized in 1911,

I have proposed the terms â€˜geneâ€™ and â€˜genotypeâ€™ … to be used in the science of genetics. The â€˜geneâ€™ is nothing but a very applicable little word, easily combined with others, and hence it may be useful as an expression for the â€˜unit-factorsâ€™, â€˜elementsâ€™ or â€˜allelomorphsâ€™ in the gametes, demonstrated by modern Mendelian researches. A â€˜genotypeâ€™ is the sum total of all the â€˜genesâ€™ in a gamete or in a zygote.

So, we have an evolution of the term from “pangenesis” (Darwin) to “pangens” (de Vries) to “genes” (Johannsen), passing through an incorrect theory of heredity to a term “completely free from any hypothesis” about inheritance to Mendelian genetics.

What about “genome”?

According to the Oxford English Dictionary, the term â€œgenom(e)â€ was coined by the German botanist Hans Winkler in 1920 as a portmanteau of gene and chromosome (the latter term having been coined by Wilhelm Waldeyer in 1888). This story has been repeated by many authors (including yours truly; Gregory 2001), but has been challenged by Lederberg and McCray (2001), who suggest that Winkler probably merged gene with the generalized suffix â€˜ome (referring to â€œthe entire collectivity of unitsâ€), and not â€˜some (â€œbodyâ€) from chromosome. In either case, Winklerâ€™s intent was to “propose the expression Genom for the haploid chromosome set, which, together with the pertinent protoplasm, specifies the material foundations of the speciesâ€ (translation as in Lederberg and McCray 2001).

Based on this initial formulation, â€œgenomeâ€ can accurately be taken to mean either the total gene complement (interchangeably with Johannsenâ€™s â€œgenotypeâ€), or the total DNA amount per haploid chromosome set â€“ but not both, as we now know that these are not correlated with one another. This latter issue remains the subject of active study, and I shall have much more to say about it in future postings.

__________

References

Bateson, W. 1906. A text-book of genetics. Nature 74: 146-147.

Bateson, W. 1928. Letter to Sedgwick, April 18, 1905. In William Bateson, F.R.S.: His Essays and Addresses (ed. B. Bateson), pp. 93. Cambridge University Press, Cambridge.

De Vries, H. 1889. IntrazellulÃ¤re Pangenesis. Fischer, Jena.

Gregory, T.R. 2001. The bigger the C-value, the larger the cell: genome size and red blood cell size in vertebrates. Blood Cells, Molecules, and Diseases 27: 830-843.

Johannsen, W. 1909. Elemente der Exakten Erblichkeitslehre. Fischer, Jena.

Johannsen, W. 1911. The genotype conception of heredity. American Naturalist 45: 129-159.

Lederberg, J. and A.T. McCray. 2001. ‘Ome sweet ‘omics — a genealogical treasury of words. The Scientist 15: 8.

Portugal, F.H. and J.S. Cohen. 1977. A Century of DNA. MIT Press, Cambridge, MA.

Winkler, H. 1920. Verbeitung und Ursache der Parthenogenesis im Pflanzen und Tierreiche. Verlag Fischer, Jena.

The discovery of DNA.

Posted on April 13, 2007 by T. Ryan Gregory

In the mid- to late 1800s (and to an extent, well into the 20^th century), proteins were considered the most significant components of cells. Their very name reflects this fact, being derived from the Greek proteios, meaning â€œof the first importanceâ€. In 1869, while developing techniques to isolate nuclei from white blood cells (which he obtained from pus-filled bandages, a plentiful source of cellular material in the days before antiseptic surgical techniques), 25 year-old Swiss biologist Friedrich Miescher stumbled across a phosphorous-rich substance which, he stated, â€œcannot belong among any of the protein substances known hithertoâ€ (quoted in Portugal and Cohen 1977 [1]). To this substance he gave the name nuclein, and published his results in 1871 after confirmation of the remarkable finding by his advisor, Felix Hoppe-Seyler (for reviews, see Mirsky 1968; Portugal and Cohen 1977; Lagerkvist 1998; Wolf 2003) [2, 3].

Miescher continued his work on nuclein for many years, in part refuting claims that it was merely a mixture of inorganic phosphate salts and proteins. Yet Miescher never departed from the common proteinocentric wisdom, and instead suggested that the nuclein molecule served as little more than a storehouse of cellular phosphorus. In 1879, Walther Flemming coined the term chromatin (Gr. â€œcolourâ€) in reference to the coloured components of cell nuclei observed after treatment with various chemical stains, and in 1888 Wilhelm Waldeyer used the term chromosome (Gr. â€œcolour bodyâ€) to describe the threads of stainable material found within the nucleus. For some time, debate existed over whether or not chromatin and nuclein were one and the same. The argument was largely settled when Richard Altman obtained protein-free samples of nuclein in 1889. As part of this work, Altman proposed a more appropriate (and familiar) term for the substance, nucleic acid. Over time, the components of the nucleic acid molecules were deduced, and by the 1930s, nuclein had become desoxyribose nucleic acid, and later, deoxyribonucleic acid (DNA).

The important developments that took place over the ensuing decades are well documented (e.g., Portugal and Cohen 1977; Judson 1996), including early hypotheses of DNAâ€™s structure (such as Phoebus Leveneâ€™s failed tetranucleotide hypothesis, or the incorrect helical model of Linus Pauling), Erwin Chargaffâ€™s discovery of the constant ratio of the two purines with their respective pyrimidines, Rosalind Franklinâ€™s x-ray crystallography of the DNA molecule, and other key developments leading up to Watson and Crickâ€™s monumental synthesis in 1953 and the subsequent deciphering of the genetic code.

Miescher died of tuberculosis in 1895 at the age of 51. His was a major contribution to biology, as were the discoveries of countless other individuals up to and beyond the elucidation of DNA’s physical structure and the dawn of molecular genetics.

————

Notes

[1] I stumbled across this book at a used bookstore in Madison, Wisconsin at the 1999 SSE meeting. That was in the days before searches on Amazon.com, Google, and Wikipedia were easy and routine, and I was unaware that the book existed so I considered it quite a lucky find.

[2] Hoppe-Seyler also had his own journal, in which Miescher’s results were published, but was not a co-author on the paper. My, how things have changed!

[3] For more information about Miescher, see the following:

References

Judson, H.F. 1996. The Eighth Day of Creation. CSHL Press, Plainview, NY.

Lagerkvist, U. 1998. DNA Pioneers and Their Legacy. Yale University Press, New Haven, CT.

Miescher, F. 1871. Ãœber die chemische Zusammensetzung der Eiterzellen. Hoppe-Seyler’s medizinish-chemischen Untersuchungen 4: 441-460.

Mirsky, A.E. 1968. The discovery of DNA. Scientific American 218 (June): 78-88.

Portugal, F.H. and J.S. Cohen. 1977. A Century of DNA. MIT Press, Cambridge, MA.

Tracy, K. 2005. Friedrich Miescher and the Story of Nuclei Acid. Mitchell Lane Publishers.

Wolf, G. 2003. Friedrich Miescher, the man who discovered DNA.

A word about "junk DNA".

Posted on April 11, 2007 by T. Ryan Gregory

â€œIt seems as though â€˜junk DNAâ€™ has become a legitimate jargon in a glossary of molecular biology. Considering the violent reactions this phrase provoked when it was first proposed in 1972, the aura of legitimacy it now enjoys is amusing, indeed.â€

– Ohno and Yomo, 1991

The origin of “junk DNA”

Two main problems struck Susumu Ohno as particularly important in his seminal work on the genetics of evolutionary diversification. The first was the lack of correspondence between genome size (amount of DNA) and morphological complexity (taken as a proxy for gene number), which was a prominent topic of discussion in the early 1970s. As he noted in 1972, â€œIf we take the simplistic assumption that the number of genes contained is proportional to the genome size, we would have to conclude that 3 million or so genes are contained in our genome. The falseness of such an assumption becomes clear when we realize that the genome of the lowly lungfish and salamanders can be 36 times greater than our ownâ€ (Ohno 1972a). In fact, Ohno and his colleagues were well aware that much of the DNA in the mammalian genome could not code for proteins, lest the mutational load become fatally high (e.g., Comings 1972; Ohno 1972b, 1974).

The second problem related to the conservative force of purifying selection and the limitations it places on the diversification of species. Ohno (1973) attempted to kill both of these vexatious birds with a single conceptual stone:

The points I wish to make are: 1) Natural selection is an extremely conservative force. So long as a particular function is assigned to a single gene locus in the genome, natural selection only permits trivial mutations of that locus to accompany evolution. 2)

Only a redundant copy of a gene can escape from natural selection and while being ignored by natural selection can accumulate meaningful mutation to emerge as a new gene locus with a new function. Thus, evolution has been heavily dependent upon the mechanism of gene duplication. 3) The probability of a redundant copy of an old gene emerging as a new gene, however, is quite small. The more likely fate of a base sequence which is not policed by natural selection is to become degenerate. My estimate is that for every new gene locus created about 10 redundant copies must join the ranks of functionless DNA base sequence. 4) As a consequence, the mammalian genome is loaded with functionless DNA.

The corpulent genomes of dipnoans and urodele amphibians were similarly thus accounted for under this view: â€œLungfish and salamanders clearly show the tragic consequences of exclusive dependence upon tandem duplicationâ€ (Ohno 1970, p.96). Of course, this differs from current thinking about lungfish and salamander genome size, but that’s another story.

To Ohno, this situation not only permitted, but also paralleled, the evolution of life at large. As he put it, â€œThe earth is strewn with fossil remains of extinct species; is it any wonder that our genome too is filled with the remains of extinct genes?â€ (Ohno 1972a). The primary outcome of this gene duplication mechanism would not be the generation of new genes, but the deactivation of redundant copies â€“ just as extinction has been the fate of more than 99% of species that have ever lived (Raup 1991). Once purifying selection ceased to shelter gene sequences from change, they would be free to mutate and, if one imagines a set of three gene copies initially sharing the same sequence, it is likely that â€œin a relatively short time, two of the three duplicates would join the ranks of â€˜garbage DNAâ€™â€ (Ohno 1970, p.62).

In Ohnoâ€™s usage, as in the vernacular, â€œgarbageâ€ refers to both the loss of function and the lack of any further utility (it was once useful, but now it isnâ€™t). â€œGarbage DNAâ€ proved to be an unsuccessful meme, but its essence remains in the wildly popular term coined by Ohno two years later â€“ â€œjunk DNAâ€. Thus, as Ohno (1972b) stated, â€œat least 90% of our genomic DNA is â€˜junkâ€™ or â€˜garbageâ€™ of various sortsâ€. Interestingly, Ohno mentioned â€œjunk DNAâ€ only in the titles of two of his papers (1972a, 1973), and invoked the term only once in passing in a third (1972b). Comings (1972), on the other hand, gave what must be considered the first explicit discussion of the nature of â€œjunk DNAâ€, and was the first to apply the term to all non-coding DNA.

There are several independent mechanisms by which non-coding DNA can accumulate in the genome. Gene duplication and deactivation is one such mechanism, but this, we now know, applies to only a minority of the non-coding sequences. Nevertheless, the term â€œjunk DNAâ€ was used in some early general descriptions of non-coding elements, including heterochromatin. For example, Comings (1972) noted that:

It has frequently been suggested that the DNA of genetically inactive heterochromatin represents the degenerate and useless DNA of the genome. However, heterochromatin rarely constitutes more than 20% of the genome. This suggests that there are two categories of junk DNA, (1) DNA of constitutive heterochromatin which is neither transcribed nor translated, and (2) nonheterochromatic junk DNA which is probably transcribed, but not translated. This distinction adds one more dimension to the mystery of heterochromatic DNA. Why is it singled out to be nontranscribable when being nontranslatable seems adequate for most of the junk DNA? Perhaps there is clustered junk (heterochromatic DNA) and nonclustered junk, just as there is clustered repetitious DNA (satellite DNA) and nonclustered repetitious DNA.

Later, Ohno himself began applying the term â€œjunkâ€ to heterochromatic, intergenic, and intronic sequences: â€œMuch of this junk DNA occurs as large heterochromatin blocks, often localized in pericentric regions of mammalian chromosomes, or as intergenic spacers and intervening sequences within genes.â€ (Ohno 1985).

It is clear, however, that Ohno (1982) believed all these sequences were produced by gene duplication:

This great preponderance of intergenic spacers in the euchromatic region is due mostly to the extreme inefficacy of the mechanism of gene duplication as a means of creating new genes with altered active sites. For every redundant copy of the pre-existent gene that emerged triumphant as a new gene, hundreds of other copies must have degenerated to join the rank of junk DNA.

This mechanism alone was considered capable of explaining the vast intergenic regions of eukaryotic genomes. According to Ohno (1985):

Indeed, the abundance of pseudogenes (recent degenerates) attests to the inefficacy of gene duplication as a means of acquiring new genes with novel functions. The net consequence of hundreds of millions of years of continuous gene duplication is the desertification of the euchromatic region of modern vertebrates; the average distance between still functioning gene loci becoming progressively longer.

Junk DNA, function, and non-function

â€œJunk DNAâ€ had a specific meaning when it first was formulated. It was meant to describe the loss of protein-coding function by deactivated gene duplicates, which in turn were believed to constitute the bulk of eukaryotic genomes. As different types of non-coding DNA were identified, the concept of gene duplication as their source â€“ and therefore â€œjunk DNAâ€ as their descriptor â€“ found new and broader application. However, it is now clear that most non-coding DNA is not produced by this mechanism, and is therefore not accurately described as â€œjunkâ€ in the original sense.

The term â€œpseudogeneâ€ — the technical term for functionless gene copies — was not coined until 1977 (Jacq et al. 1977), and the more explicit definition of these sequences that specified non-function in terms of protein-coding emerged almost a decade later. So, although Ohnoâ€™s original description of â€œjunk DNAâ€ obviously involved what are now called â€œpseudogenesâ€, there was no initial requirement for non-function. As Comings (1972) put it, â€œBeing junk doesnâ€™t mean it is entirely useless. Common sense suggests that anything that is completely useless would be discarded.â€ (This is what Sydney Brenner meant by the distinction between “trash” or “rubbish”, which one throws away, and “junk”, which one keeps; Brenner 1998). Of course, Ohno did reject the notion of protein-coding function for the extinct genes. As he described it, â€œa functional gene locus is defined as that DNA base sequence which may sustain deleterious mutationsâ€, and from this it followed that â€œa DNA base sequence in which all sorts of mutational changes are permissible is obviously not contributing to the well-being of an organism, and for this very reason, it has no functionâ€ (Ohno 1973). On the other hand, and in the same publication, Ohno (1973) suggested a different role for non-coding DNA: â€œThe bulk of functionless DNA in the mammalian genome may serve as a damper to give a reasonably long cell generation time (12 hours or so instead of several minutes)â€.

From the very beginning, the concept of â€œjunk DNAâ€ has implied non-functionality with regards to protein-coding, but left open the question of sequence-independent impacts (perhaps even functions) at the cellular level. “Junk DNA” may now be taken to imply total non-function and is rightly considered problematic for that reason, but no such tacit assumption was present in the term when it was coined.

Two groups of people, though maximally divergent in their reasons for so doing, have been driven by a philosophical need to identify functions for all non-coding DNA. The first includes strict adaptationists, among whom it was often assumed that all non-coding DNA, by virtue of its very existence, must be endowed with some as-yet-unknown function of critical importance: â€œThe very fact that amplified sequences have been maintained, withstanding rigours of selection, indicates some adaptive significanceâ€ (Sharma 1985).

We may also consider the following discussion comments recorded at the end of Ohno (1973):

Yunis: â€œThis is what I emphasized earlier, that this DNA must have a functional value since nothing is known so widespread and universal in nature that has proven useless.â€

Fraccaro: â€œWell, there is an exception to that rule. A lot of us have permanent positions at the University but are considered by others (mainly by students) meaningless and of no utility whatsoever.â€

These examples aside, it seems likely that most evolutionary biologists today could tolerate a conclusion, if such were rendered, that a significant fraction of non-coding DNA is functionless. This is not true of the second group in question, compared to whom the passion for function is unrivaled. As Dawkins (1999) suggested, â€œcreationists might spend some earnest time speculating on why the Creator should bother to litter genomes with untranslated pseudogenes and junk tandem repeat DNAâ€. In fact, many have done so (e.g., Gibson 1994; Wieland 1994; Batten 1998; JerlstrÃ¶m 2000; Walkup 2000; Woodmorappe 2000; Bergman 2001). Although apparently â€œnot enough is yet known about eukaryotic genomes to construct a comprehensive creationist model of pseudogenesâ€ (Woodmorappe 2000), the theme that undergirds all of these discussions is that all non-coding DNA must, a priori, be functional.

To satisfy this expectation, creationist authors (borrowing, of course, from the work of molecular biologists, as they do no such research themselves) simply equivocate the various types of non-coding DNA, and mistakenly suggest that functions discovered for a few examples of some types of non-coding sequences indicate functions for all (see Max 2002 for a cogent rebuttal to these creationist confusions). Case in point: a few years ago, much ado was made of Beaton and Cavalier-Smithâ€™s (1999) titular proclamation, based on a survey of cryptomonad nuclear and nucleomorphic genomes, that â€œeukaryotic non-coding DNA is functionalâ€. The point was evidently lost that the function proposed by Beaton and Cavalier-Smith (1999) was based entirely on coevolutionary interactions between nucleus size and cell size.

Those who complain about a supposed unilateral neglect of potential functions for non-coding DNA simply have been reading the wrong literature. In fact, quite a lengthy list of proposed functions for non-coding DNA could be compiled (for an early version, see Bostock 1971). Examples include buffering against mutations (e.g., Comings 1972; Patrushev and Minkevich 2006) or retroviruses (e.g., Bremmerman 1987) or fluctuations in intracellular solute concentrations (Vinogradov 1998), serving as binding sites for regulatory molecules (Zuckerkandl 1981), facilitating recombination (e.g., Comings 1972; Gall 1981; Comeron 2001), inhibiting recombination (Zuckerkandl and Hennig 1995), influencing gene expression (Britten and Davidson 1969; Georgiev 1969; Nowak 1994; Zuckerkandl and Hennig 1995; Zuckerkandl 1997), increasing evolutionary flexibility (e.g., Britten and Davidson 1969, 1971; Jain 1980; reviewed critically in Doolittle 1982), maintaining chromosome structure and behaviour (e.g., Walker et al. 1969; Yunis and Yasmineh 1971; Bennett 1982; Zuckerkandl and Hennig 1995), coordingating genome function (Shapiro and von Sternberg 2005), and providing multiple copies of genes to be recruited when needed (Roels 1966).

Does non-coding DNA have a function? Some of it does, to be sure. Some of it is involved in chromosome structure and cell division (e.g., telomeres, centromeres). Some of it is undoubtedly regulatory in nature. Some of it is involved in alternative splicing (Kondrashov et al. 2003). A fair portion of it in various genomes shows signs of being evolutionarily conserved, which may imply function (Bejerano et al. 2004; Andolfatto 2005; Kondrashov 2005; Woolfe et al. 2005; Halligan and Keightley 2006). On the other hand, the largest fraction is comprised of transposable elements — some of which become co-opted by the host genome, some of which play major role in generating genomic variation, some of which may be involved in cellular stress response, and yet others of which remain detrimental to host fitness (Kidwell and Lisch 2001; BiÃ©mont and Vieira 2006). The upshot is that some non-coding DNA is most certainly functional — but when it is, this usually makes sense only in an evolutionary context, particularly through processes like co-option. More broadly, those who would attribute a universal function for non-coding DNA must bear the following in mind: any proposed function for all non-coding DNA must explain why an onion or a grasshopper needs five times more of it than anyone reading this sentence.

Should â€œjunkâ€ be thrown out?

There is nothing wrong with a word taking on a new meaning as knowledge changes â€“ that is, unless reference to an original (and outmoded) sense lingers as a source of confusion, or the term expands so much as to lose contact with an initially accurate definition. Indeed, even the term â€œevolutionâ€ is technically a misnomer since its etymology implies an â€œunfoldingâ€, as of a pre-determined developmental program (see Bowler 1975). The objection raised here is not to terms that change in usage per se, but to those whose shifting usage involves collecting or retaining unwanted conceptual baggage. This is especially relevant when the baggage is toted surreptitiously (note that no serious biologist takes â€œevolutionâ€ to mean a pre-determined unfolding but that ideas of inherent â€œprogressâ€ have been almost impossible to shake; see Gould 1996; Ruse 1996).

â€œJunk DNAâ€, which originally was coined in reference to now-functionless gene duplicates (i.e., true broken-down â€œjunkâ€), is now used as â€œa catch-all phrase for chromosomal sequences with no apparent functionâ€ (Moore 1996). Its current usage also implies a lack of function which is accurate by definition for pseudogenes in regard to protein-coding, but which does not hold for all non-coding elements. The term has deviated from or outgrown its original use, and its continued invocation is non-neutral in its expression â€“ and generation â€“ of conceptual biases.

“Junk DNA” is not the only offender. Non-coding DNA has been called by many names that have had the same pejorative undertones (intentional or not) implying uselessness, if not outright wastefulness. Examples include excess DNA (Zuckerkandl 1976; Doolittle and Sapienza 1980), surplus or nonessential or degenerate or silent DNA (Comings 1972; Gilbert 1978), quiet DNA (Lefevre 1971), garbage DNA (Ohno 1970), non-informational or nonsense DNA (Ohno 1972b), worthless DNA (Ohno 1973), trivial DNA (Ohno 1974), vestigial DNA (Loomis 1973), redundant DNA (Vinogradov 1998), supplementary DNA (Hutchinson et al. 1980), secondary DNA (Hinegardner 1976), and incidental DNA (Jain 1980).

As Gould (2002, p.503) stated, â€œA rose may retain its fragrance under all vicissitudes of human taxonomy, but never doubt the power of a name to shape and direct our thoughtsâ€. Because it is generally no longer applied in its original meaningful sense, because the type of DNA to which it actually relates now has a more descriptive name (pseudogenes), and because of its connotations of total phenotypic inertness, the term â€œjunk DNAâ€ should probably be abandoned in favour of less subjective terminology. “Non-coding DNA” serves this purpose quite well.

Concluding remarks

It is an exciting time in genome biology. Aspects of genomic form and function that were largely inconceivable only a few decades ago are now being revealed on a daily basis. It should come as no surprise (and indeed, it probably does not) that new roles are being discovered for non-coding DNA and that some of yesterday’s buzzwords — including “junk DNA” — are destined for the dustbin. However, extrapolating each report that a given small segment of DNA may be functional to mean that all non-coding DNA is vital is as counterproductive as dismissing non-coding DNA as totally non-functional. Genomes are complex, and there is little use in approaching them from a simplistic point of view.

——

Andolfatto, P. 2005. Adaptive evolution of non-coding DNA in Drosophila. Nature 437: 1149-1152.

Batten, D. 1998. ‘Junk’ DNA (again). Creation Ex Nihilo Technical Journal 12: 5.

Beaton, M.J. and T. Cavalier-Smith. 1999. Eukaryotic non-coding DNA is functional: evidence from the differential scaling of cryptomonad genomes. Proceedings of the Royal Society of London, Series B 266: 2053-2059.

Bejerano, G., M. Pheasant, I. Makunin, S. Stephen, W.J. Kent, J.S. Mattick, and D. Haussler. 2004. Ultraconserved elements in the human genome. Science 304: 1321-1325.

Bennett, M.D. 1982. Nucleotypic basis of the spatial ordering of chromosomes in eukaryotes and the implications of the order for genome evolution and phenotypic variation. In Genome Evolution (eds. G.A. Dover and R.B. Flavell), pp. 239-261. Academic Press, New York.

Bergman, J. 2001. The functions of introns: from junk DNA to designed DNA. Perspectives on Science and Christian Faith 53: 170-178.

BiÃ©mont, C. and C. Vieira. 2006. Junk DNA as an evolutionary force. Nature 443: 521-524.

Bostock, C. 1971. Repetitious DNA. Advances in Cell Biology 2: 153-223.

Bowler, P.J. 1975. The changing meaning of “evolution”. Journal of the History of Ideas 36: 95-114.

Bremmerman, H.J. 1987. The adaptive significance of sexuality. In The Evolution of Sex and its Consequences (ed. S.C. Stearns), pp. 135-161. Birkhauser Verlag, Basel.

Brenner, S. 1998. Refuge of spandrels. Current Biology 8: R669.

Britten, R.J. and E.H. Davidson. 1969. Gene regulation for higher cells: a theory. Science 165: 349-357.

Britten, R.J. and E.H. Davidson. 1971. Repetitive and non-repetitive DNA sequences and a speculation on the origins of evolutionary novelty. Quarterly Review of Biology 46: 111-138.

Castillo-Davis, C.I. 2005. The evolution of noncoding DNA: how much junk, how much func? Trends in Genetics 21: 533-536.

Comeron, J.M. 2001. What controls the length of noncoding DNA? Current Opinion in Genetics & Development 11: 652-659.

Comings, D.E. 1972. The structure and function of chromatin. Advances in Human Genetics 3: 237-431.

Dawkins, R. 1999. The “information challenge”: how evolution increases information in the genome. Skeptic 7: 64-69.

Doolittle, W.F. and C. Sapienza. 1980. Selfish genes, the phenotype paradigm and genome evolution. Nature 284: 601-603.

Doolittle, W.F. 1982. Selfish DNA after fourteen months. In Genome Evolution (eds. G.A. Dover and R.B. Flavell), pp. 3-28. Academic Press, New York.

Gall, J.G. 1981. Chromosome structure and the C-value paradox. Journal of Cell Biology 91: 3s-14s.

Georgiev, G.P. 1969. On the structural organization of operon and the regulation of RNA synthesis in animal cells. Journal of Theoretical Biology 25: 473-490.

Gibbs, W.W. 2003. The unseen genome: gems among the junk. Scientific American 289(5): 46-53.

Gibson, L.J. 1994. Pseudogenes and origins. Origins 21: 91-108.

Gilbert, W. 1978. Why genes in pieces? Nature 271: 501.

Gould, S.J. 1996. Full House. Harmony Books, New York.

Gould, S.J. 2002. The Structure of Evolutionary Theory. Harvard University Press, Cambridge, MA.

Halligan, D.L. and P.D. Keightley. 2006. Ubiquitous selective constraints in the Drosophila genome revealed by a genome-wide interspecies comparison. Genome Research 16: 875-884.

Hinegardner, R. 1976. Evolution of genome size. In Molecular Evolution (ed. F.J. Ayala), pp. 179-199. Sinauer Associates, Inc., Sunderland.

Hutchinson, J., R.K.J. Narayan, and H. Rees. 1980. Constraints upon the composition of supplementary DNA. Chromosoma 78: 137-145.

Jacq, C., J.R. Miller, and G.G. Brownlee. 1977. A pseudogene structure in 5S DNA of Xenopus laevis. Cell 12: 109-120.

Jain, H.K. 1980. Incidental DNA. Nature 288: 647-648.

JerlstrÃ¶m, P. 2000. Pseudogenes: are they non-functional? Creation Ex Nihilo Technical Journal 14: 15.

Kidwell, M.G. and D.R. Lisch. 2001. Transposable elements, parasitic DNA, and genome evolution. Evolution 55: 1-24.

Kondrashov, F.A. and E.V. Koonin. 2003. Evolution of alternative splicing: deletions, insertions and origin of functional parts of proteins from intron sequences. Trends in Genetics 19: 115-119.

Kondrashov, A.S. 2005. Fruitfly genome is not junk. Nature 437: 1106.

Lefevre, G. 1971. Salivary chromosome bands and the frequency of crossing over in Drosophila melanogaster. Genetics 67: 497-513.

Loomis, W.F. 1973. Vestigial DNA? Developmental Biology 30: F3-F4.

Makalowski, W. 2003. Not junk after all. Science 300: 1246-1247.

Max, E.E. 2002. Plagiarized errors and molecular genetics: another argument in the evolution-creation controversy. Talk.Origins Archive.

Moore, M.J. 1996. When the junk isn’t junk. Nature 379: 402-403.

Nowak, R. 1994. Mining treasures from ‘junk DNA’. Science 263: 608-610.

Ohno, S. 1970a. Evolution by Gene Duplication. Springer-Verlag, New York.

Ohno, S. 1970b. The enormous diversity in genome sizes of fish as a reflection of nature’s extensive experiments with gene duplication. Transactions of the American Fisheries Society 1970: 120-130.

Ohno, S. 1972. So much “junk” DNA in our genome. In Evolution of Genetic Systems (ed. H.H. Smith), pp. 366-370. Gordon and Breach, New York.

Ohno, S. 1973. Evolutional reason for having so much junk DNA. In Modern Aspects of Cytogenetics: Constitutive Heterochromatin in Man (ed. R.A. Pfeiffer), pp. 169-173. F.K. Schattauer Verlag, Stuttgart, Germany.

Ohno, S. 1974. Chordata 1: protochordata, cyclostomata, and pisces. In Animal Cytogenetics, Vol. 4 (ed. B. John), pp. 1-92. GebrÃ¼der Borntraeger, Berlin.

Ohno, S. 1982. The common ancestry of genes and spacers in the euchromatic region: omnis ordinis hereditarium a ordinis priscum minutum. Cytogenetics and Cell Genetics 34: 102-111.

Ohno, S. 1985. Dispensable genes. Trends in Genetics 1: 160-164.

Patrushev, L.I. and I.G. Minkevich. 2006. Eukaryotic noncoding DNA sequences provide genes with an additional protection against chemical mutagens. Russian Journal of Bioorganic Chemistry 32: 1068-1620.

Petsko, G.A. 2003. Funky, not junky. Genome Biology 4: 104.

Raup, D.M. 1991. Exctinction. W.W. Norton & Co., New York.

Roels, H. 1966. “Metabolic” DNA: a cytochemical study. International Review of Cytology 19: 1-34.

Ruse, M. 1996. Monad to Man. Harvard University Press, Cambridge, MA.

Shapiro, J.A. and R. von Sternberg. 2005. Why repetitive DNA is essential to genome function. Biological Reviews 80: 227-250.

Sharma, A.K. 1985. Chromosome architecture and additional elements. In Advances in Chromosome and Cell Genetics (eds. A.K. Sharma and A. Sharma), pp. 285-293. Oxford and IBH Publishing Co., New Delhi.

Slack, F.J. 2006. Regulatory RNAs and the demise of ‘junk’ DNA. Genome Biology 7: 328.

Vinogradov, A.E. 1998. Buffering: a possible passive-homeostasis role for redundant DNA. Journal of Theoretical Biology 193: 197-199.

Walker, P.M.B., W.G. Flamm, and A. McLaren. 1969. Highly repetitive DNA in rodents. In Handbook of Molecular Cytology (ed. A. Lima-de-Faria), pp. 52-66. North-Holland Publishing Co., Amsterdam.

Walkup, L.K. 2000. Junk DNA: evolutionary discards or God’s tools? Creation Ex Nihilo Technical Journal 14: 18-30.

Wickelgren, I. 2003. Spinning junk into gold. Science 300: 1646-1649.

Wieland, C. 1994. Junk moves up in the world. Creation Ex Nihilo Technical Journal 8: 125.

Woodmorappe, J. 2000. Are pseudogenes ‘shared mistakes’ between primate genomes? Creation Ex Nihilo Technical Journal 14: 55-71.

Woolfe, A., M. Goodson, D.K. Goode, P. Snell, G.K. McEwen, T. Vavouri, S.F. Smith, P. North, H. Callaway, K. Kelly, K. Walter, I. Abnizova, W. Gilks, Y.J.K. Edwards, J.E. Cooke, and G. Elgar. 2005. Highly conserved non-coding sequences are associated with vertebrate development. PLoS Biology 3: e7.

Yunis, J.J. and W.G. Yasmineh. 1971. Heterochromatin, satellite DNA, and cell function. Science174: 1200-1209.

Zuckerkandl, E. 1976. Gene control in eukaryotes and the C-value paradox: “Excess” DNA as an impediment to transcription of coding sequences. Journal of Molecular Evolution 9: 73-104.

Zuckerkandl, E. and W. Hennig. 1995. Tracking heterochromatin. Chromosoma 104: 75-83.

Zuckerkandl, E. 1997. Junk DNA and sectorial gene expression. Gene 205: 323-343.

__________

Update: At Sandwalk, Larry Moran argues that the term “junk DNA” is “a good term”, “an accurate term”, and “a useful term”. You can read my response in the comments section of the original post or in my re-post on this blog.

Genomicron

Category Archives: Terminology

Junctional DNA.

The onion test.

From "Pangenesis" to "Genome".

The discovery of DNA.

A word about "junk DNA".