Acessibilidade / Reportar erro

Autapomorphies and the retention index: Solving a limit via l’Hôpital-Bernoulli’s rule

ABSTRACT

The retention index (r or ri) is used to compute the amount of similarity in a character that can be interpreted as a synapomorphy on a given cladogram. In the case of autapomorphies, the retention index assumes the form 0 / 0, which is a mathematically indeterminate form. Originally, this issue was resolved by assigning a value of one to the retention index in such cases. However, some authors did not follow this original interpretation and assumed that, in this case, the retention index must assume a value of zero. In this note, we present two arguments supporting the original definition. The first argument is an application of the concept of limit from calculus to determine the assumed value of the retention index function in this specific situation. The second argument is a biological one that invokes the original definitions of autapomorphy and synapomorphy.

KEY WORDS:
Calculus; homology; homoplasy; indeterminate form; mathematically undefined; synapomorphy

In phylogenetic systematics, several indices are used to measure the “fit”, i.e., the degree of homoplasy and/or the degree of synapomorphies, of a given character or phylogenetic tree, also interpreted as goodness of fit measures (Sober 2002Sober E (2002) Instrumentalism, parsimony, and the Akaike framework. Philosophy of Science 69: S112-S123. https://doi.org/10.1086/341839
https://doi.org/10.1086/341839...
, Turjak and Trontelj 2012Turjak M, Trontelj P (2012) A method for measuring support for synapomorphy using character state distributions on phylogenetic trees. Cladistics 28: 627-638. https://doi.org/10.1111/j.1096-0031.2012.00403.x
https://doi.org/10.1111/j.1096-0031.2012...
). For a good summary and review, see Kitching et al. (1998Kitching I, Forey P, Humphries C, Williams D (1998) Cladistics: The theory and practice of parsimony analysis. Oxford University Press, New York, 2nd ed., 228 pp.). Among these indices, the retention index (r or ri) is widely applied to compute “the fraction of apparent synapomorphy in the character that is retained as synapomorphy on the tree”, as originally defined (Farris 1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
). See the original definition of synapomorphy (Hennnig 1966Hennnig W (1966) Phylogenetic Systematics. University of Illinois Press, Urbana, 263 pp.):

“We will call the characters or character conditions from which transformation started (a, b) in a monophyletic group plesiomorphous, and the derived conditions (a’, a”, b’, b”) apomorphous. […] We will call the presence of plesiomorphous characters in different species symplesiomorphy, the presence of apomorphous characters synapomorphy, always with the assumption that the compared characters belong to one and the same transformation series. […] It is evident that the presence of corresponding characters in two or more species is a basis for assuming that these species form a monophyletic group only if the characters are apomorphous, if their correspondence rests on synapomorphy.” ( Hennnig 1966 Hennnig W (1966) Phylogenetic Systematics. University of Illinois Press, Urbana, 263 pp. : 89, 90).

See also a more recent and logical definition of synapomorphy:

“Synapomorphy-the quality of sister taxa sharing a derived character state-is the central tenet of phylogenetic systematics.[…] A character state represents a synapomorphy for a group if the character state’s distribution pattern does not conflict with placement of the group in the taxonomic structure ( Patterson, 1982 Patterson C (1982) Morphological characters and homology. In: Joysey KA, Friday AE (Eds) Problems of phylogenetic reconstruction. Academic Press, London, 21-74. ). This means that, in an ideal case, a synapomorphic character state will be shared by all of the clade’s members (terminal taxa) and at the same time will not occur outside that clade.” ( Turjak and Trontelj 2012 Turjak M, Trontelj P (2012) A method for measuring support for synapomorphy using character state distributions on phylogenetic trees. Cladistics 28: 627-638. https://doi.org/10.1111/j.1096-0031.2012.00403.x
https://doi.org/10.1111/j.1096-0031.2012...
: 627, 628).

The retention index ranges from zero (which corresponds to the maximum apparent homoplasy in the given character) to one (which corresponds to the maximum apparent synapomorphy in the given character), and it can be easily computed via the following function:

ri=g-s/g-m, see (Farris 1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
: 417).

The variables are m, which “represents the minimum amount of change that the character may show on any tree”; s, which “denotes the amount of change in the character (for an integral character, number of steps) required parsimoniously by the considered tree”; and g, which “denotes the greatest amount of change that the character may require on any tree, that is, the greatest possible value of s” (Farris 1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
). The value of g can be calculated by the number of terminals with the less prevalent character states.

The ensemble or overall retention index (R or RI) is the general summation of the retention index among all characters in a given tree topology (Farris 1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
). These indices are normally computed for characters optimized on a parsimonious tree. However, it can be computed for any given tree, including suboptimal trees or trees generated by other methods for comparative purposes, such as maximum likelihood or Bayesian inference (although direct optimization is based on the maximum parsimony criterium). The same principle is also applied in the case of the consistency index (c or ci), the rescaled consistency index (rc), which is the product of ci and ri, the ensemble or overall consistency index (C or CI), and the ensemble or overall rescaled consistency index (RC) (Kluge and Farris 1969Kluge AG, Farris JS (1969) Quantitative phyletics and the evolution of anurans. Systematic Zoology 18: 1-32. https://doi.org/10.2307/2412407
https://doi.org/10.2307/2412407...
, 1989).

The problem

A binary autapomorphic character is one with a given state that is present in only one terminal (i.e., an operational taxonomic unit, OTU). In this situation, g = m = s = 1. A similar phenomenon occurs in the case of multistate autapomorphic characters, which have more than two states, all but one of which are only present in a single terminal. In this situation, g = m = s, and they have a value higher than one.

Therefore, in both of these situations (binary and multistate autapomorphic, non-homoplastic characters), since g = m = s, the retention index assumes the form 0 / 0, which is, formally, a mathematically undefined situation (specifically, an indeterminate form). There is a simple way one can understand 0 / 0 as an indeterminate form; just note that any real (ℝ) number x satisfies the following equation: 0 . x = 0 (any real number multiplied by zero equals zero).

There is also the case when two or more distantly related terminals independently present the same character state; we have a case of non-apparent autapomorphies, but in this case, it is also a homoplasy. Note that this can only be inferred after phylogenetic reconstruction. In this situation, g = s ≠ m, the retention index naturally assumes a value of zero; we do not have an indeterminate form.

One should keep in mind that the most common application of cladistics is to perform phylogenetic analyses with discrete characters, but this reasoning can be applied to continuous characters as well (Kluge and Farris 1969Kluge AG, Farris JS (1969) Quantitative phyletics and the evolution of anurans. Systematic Zoology 18: 1-32. https://doi.org/10.2307/2412407
https://doi.org/10.2307/2412407...
, Ackerly and Donoghue 1988Ackerly DD, Donoghue MJ (1988) Leaf size, sapling allometry, and Corner’s rules: phylogeny and correlated evolution in maples (Acer). The American Naturalist 152: 767-791. https://doi.org/10.1086/286208
https://doi.org/10.1086/286208...
, Klingenberg and Gidaszewski 2010Klingenberg CP, Gidaszewski NA (2010) Testing and quantifying phylogenetic signals and homoplasy in morphometric data. Systematic Biology 59: 245-261. https://doi.org/10.1093/sysbio/syp106
https://doi.org/10.1093/sysbio/syp106...
). In this sense, g, m, and s are values in the domain of positive real numbers (ℝ+).

Note that in the case of multistate autapomorphic non-homoplastic characters, this description is suitable considering unordered (nonadditive) characters, i.e., the Fitch (1971Fitch WM (1971) Toward defining the course of evolution: Minimum change for a specified tree topology. Systematic Zoology 20: 406-416. https://doi.org/10.2307/2412116
https://doi.org/10.2307/2412116...
) optimization. The abovementioned situation can be easily explored for other parsimony variations (Camin and Sokal 1965Camin JH, Sokal RR (1965) A method for deducing branching sequences in phylogeny. Evolution 19: 311-326. https://doi.org/10.2307/2406441
https://doi.org/10.2307/2406441...
, Kluge and Farris 1969Kluge AG, Farris JS (1969) Quantitative phyletics and the evolution of anurans. Systematic Zoology 18: 1-32. https://doi.org/10.2307/2412407
https://doi.org/10.2307/2412407...
, Farris 1977Farris JS (1977) Phylogenetic analysis under Dollo’s law. Systematic Zoology 26(1): 77-88. https://doi.org/10.2307/2412867
https://doi.org/10.2307/2412867...
) and character weighting schemes (Farris 1969Farris JS (1969) A successive approximations approach to character weighting. Systematic Zoology 18: 374-385. https://doi.org/10.2307/2412182
https://doi.org/10.2307/2412182...
).

In Farris (1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
), this indeterminate form was clearly resolved, assuming that in the case where g = m = s, the retention index assumes the value of one: “If g = m, then s = g, and r is taken to be unity, so that d (the distortion coefficient of Farris 1973Farris JS (1973) On comparing the shapes of taxonomic trees. Systematic Zoology 22: 50-54. https://doi.org/10.2307/2412378
https://doi.org/10.2307/2412378...
) is taken to be 0. I shall call r the retention index” (Farris 1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
). This original solution to the indeterminate form of 0 / 0 was followed by some authors, such as (Kitching et al. 1998Kitching I, Forey P, Humphries C, Williams D (1998) Cladistics: The theory and practice of parsimony analysis. Oxford University Press, New York, 2nd ed., 228 pp.). It is also the same way the TNT 1.5 software reports (Goloboff and Catalano 2016Goloboff PA, Catalano SA (2016) TNT version 1.5, including a full implementation of phylogenetic morphometrics. Cladistics 32: 221-238. https://doi.org/10.1111/cla.12160
https://doi.org/10.1111/cla.12160...
).

However, some authors did not follow this original interpretation for the case of the retention index applied to autapomorphies. To cite just a few examples: in the textbooks of Amorim (2002Amorim DS (2002) Fundamentos de Sistemática Filogenética. Holos Editora, Ribeirão Preto, 2nd ed., 156 pp.), Forey (2006Forey P (2006) Cladistics for Palaeontologists: Part 4 - Optimisation. Twitchett R editor. The Palaeontology Newsletter 63: 26-35.), Wiley and Lieberman (2011Wiley EO, Lieberman BS (2011). Phylogenetics: Theory and Practice of Phylogenetic Systematics. Wiley-Blackwell, Hoboken, 2nd ed., 406 pp. https://doi.org/10.1002/9781118017883
https://doi.org/10.1002/9781118017883...
), Morrrone (2013Morrrone JJ (2013) Sistemática: Fundamentos, métodos, aplicaciones. Prensas de Ciencias, Universidad Nacional Autónoma de México, Facultad de Ciencias, Ciudad de México, 508 pp.), and Brower and Schuh (2021Brower AVZ, Schuh RT (2021) Biological Systematics: Principles and Applications. Cornell University Press, Ithaca and London, 3rd ed., 436 pp.), it is clearly assumed that, in the case of autapomorphies, the retention index must assume the value of zero without a proper justification. In the textbook of Wiley et al. (1981Wiley EO, Siegel-Causey D, Brooks DR, Funk VA (1981) The Compleat Cladist: A primer of phylogenetic procedures. The University of Kansas, Museum of Natural History, Dyche Hall, Special Publications No. 19, Lawrence, 158 pp.) and in the PAUP* 4.0 software (Swofford and Bell 2017Swofford DL, Bell CD (2017) PAUP*. Phylogenetic Analysis Using Parsimony (*and other methods). Version 4.0a169. Sinauer Associates, Sunderland.), a conservative approach is found, the retention index is reported as “0 / 0” and simply termed undefined, avoiding further complications. Finally, the WinClada v.1.00.08 software reports it as “uninformative” (Nixon 2002Nixon KC (2002) WinClada. Version 1.00.08. Renner and Weerasooriya, New York.).

The confusion regarding the value of the retention index applied to autapomorphies has persisted in phylogenetic studies. Some recent studies have reported a value of zero for the retention index in the case of autapomorphies, e.g., de Medeiros and Vanin (2020de Medeiros BAS, Vanin SA (2020) Systematic revision and morphological phylogenetic analysis of Anchylorhynchus Schoenherr, 1836 (Coleoptera, Curculionidae: Derelomini). Zootaxa 4839: 1-98. https://doi.org/10.11646/zootaxa.4839.1.1
https://doi.org/10.11646/zootaxa.4839.1....
) and Zeballos et al. (2023Zeballos LF, Roza AS, Campello-Gonçalves L, Vaz S, da Fonseca CRV, Rivera SC, da Silveira LFL (2023) Phylogeny of Scissicauda species, with eight new species, including the first Photinini fireflies with biflabellate antennae (Coleoptera: Lampyridae). Diversity 15: 1-43. https://doi.org/10.3390/d15050620
https://doi.org/10.3390/d15050620...
), while others stated that the retention index cannot be calculated for autapomorphies, e.g., Sayad and Yassin (2019Sayad SA, Yassin A (2019) Quantifying the extent of morphological homoplasy: A phylogenetic analysis of 490 characters in Drosophila. Evolution Letters 3: 286-298. https://doi.org/10.1002/evl3.115
https://doi.org/10.1002/evl3.115...
). Therefore, we can ask the following question: What is the most suitable value for the retention index in the case of autapomorphies? One? Zero? 0 / 0?

In this rapid communication, we present two arguments supporting the original definition (Farris 1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
) to use the value of one for the retention index in the case of autapomorphies. A similar rationale can be extended to the distortion coefficient, d (Farris 1973, 1989), a complement of the retention index.

Argumentation and solution

Indeterminate forms are not mathematical dead ends. They can be solved for specific cases, given a specific function. With that in mind, the first argument is straightforward, applying the calculus’ limit concept, e.g., the epsilon-delta (ε, δ) definition of limit (see Felscher 2000Felscher W (2000) Bolzano, Cauchy, Epsilon, Delta. In: Shenitzer A, Stillwell J (Eds) The Evolution of… The American Mathematical Monthly 107: 844-862. https://doi.org/10.2307/2695743
https://doi.org/10.2307/2695743...
), to determine the assumed value of the function of the retention index in any desired case. We suspect that it is the position of Farris (1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
), but it was not originally (or otherwise) addressed by the author. In this view, one can promptly compute the limit of a function of the retention index via l’Hôpital-Bernoulli’s rule, which uses derivatives in the numerator and denominator to find the limit of a particular indeterminate form (not just 0 / 0; it can also be used for all other indeterminate forms: ∞ / ∞; 0 . ∞; ∞ - ∞; 0 0 ; 1 ; and 0 ); this method is indeed a robust way to address this problem (see Struik 1963Struik DJ (1963) The origin of L’Hôpital’s rule. The Mathematics Teacher 56: 257-260. https://doi.org/10.5951/MT.56.4.0257
https://doi.org/10.5951/MT.56.4.0257...
). Therefore, step by step:

limgm=sg-sg-m,,

Then,

limgmg-mg-m,

The l’Hôpital-Bernoulli’s rule general definition is:

limxcfxgx=limxcddxfxddxgx,

All four necessary conditions for l’Hôpital-Bernoulli’s rule are met:

1. limxcfx=limxcgx=0 or±,

2. ƒ(x) and g(x) are differentiable,

3. ddxgx0,

4. limxcddxfxddxgx is an existing limit.

Applying the l’Hôpital-Bernoulli’s rule, we have:

limgmddgg-mddgg-m,

Applying the subtraction rule of differentiation, we have:

limgmddgg-ddgmddgg-ddgm,

Solving the derivatives:

limgm1-01-0,

Simplify,

limgm11, limgm1,

Given that the limit of a constant function is equal to the constant itself,

limxac=c,

Then,

limgm1=1,

Therefore,

lim g m = s g - s g - m = 1

The second argument is biological or phylogenetic in nature. Let us assume the original formulation (Farris 1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
), where in the retention index, the value of zero is equivalent to the “maximum of apparent homoplasy in the given character” and the value of one is equivalent to the “maximum of apparent synapomorphy in the given character”. Considering that an autapomorphic character is an apomorphic character that is as exclusive as possible (i.e., present in only one specific or supraspecific terminal), it is most appropriate to assume the value of one. This interpretation is consistent with the original definition of autapomorphy and synapomorphy (Hennig 1966Hennnig W (1966) Phylogenetic Systematics. University of Illinois Press, Urbana, 263 pp.), see also Yeates (1992Yeates D (1992) Why remove autapomorphies? Cladistics 8(4): 387-389. https://doi.org/10.1111/j.1096-0031.1992.tb00080.x
https://doi.org/10.1111/j.1096-0031.1992...
):

“The apomorphous features characteristic for a particular monophyletic group (present only in it) can be ignored in discussing its relations to other groups; we will call such characters the autapomorphous characters of a monophyletic group. Naturally these characters are ‘autapomorphous’ only if the group in question is compared with other groups; so long as we are trying to show that the group itself is monophyletic, these same characters would be ‘synapomorphous characters’ of the species making up the group” ( Hennnig 1966 Hennnig W (1966) Phylogenetic Systematics. University of Illinois Press, Urbana, 263 pp. : 90).

This point of view is also apparently supported by Farris (1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
), in which the author also traced back the homology concept, which can be logically interpreted as synonymous with synapomorphy (see de Pinna 1991de Pinna MCC (1991) Concepts and tests of homology in the cladistic paradigm. Cladistics 7: 367-394. https://doi.org/10.1111/j.1096-0031.1991.tb00045.x
https://doi.org/10.1111/j.1096-0031.1991...
, Brower and de Pinna 2012Brower AVZ, de Pinna MCC (2012) Homology and errors. Cladistics 28: 529-538. https://doi.org/10.1111/j.1096-0031.2012.00398.x
https://doi.org/10.1111/j.1096-0031.2012...
, 2013Brower AVZ, de Pinna MCC (2013) About nothing. Cladistics 30: 330-336. https://doi.org/10.1111/cla.12050
https://doi.org/10.1111/cla.12050...
, but see also Nixon and Carpenter 2012aNixon KC, Carpenter JM (2012a) On homology. Cladistics 28: 160-169. https://doi.org/10.1111/j.1096-0031.2011.00371.x
https://doi.org/10.1111/j.1096-0031.2011...
, 2012bNixon KC, Carpenter JM (2012b) More on homology. Cladistics 28: 225-226. https://doi.org/10.1111/j.1096-0031.2011.00388.x
https://doi.org/10.1111/j.1096-0031.2011...
, 2012cNixon KC, Carpenter JM (2012c) More on errors. Cladistics 28: 539-544. https://doi.org/10.1111/j.1096-0031.2012.00409.x
https://doi.org/10.1111/j.1096-0031.2012...
, 2013Nixon KC, Carpenter JM (2013) More on absences. Cladistics 29: 1-6. https://doi.org/10.1111/j.1096-0031.2012.00430.x
https://doi.org/10.1111/j.1096-0031.2012...
):

“The interpretation of the retention index can be seen from a simple argument. On a tree for which s = m, r = 1, there is no homoplasy, and all similarities between terminals in the character are homologous” ( Farris 1989 Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
: 417).

Additionally, this interpretation is congruent with a proper formal logical definition of synapomorphy (see definition “1” in Turjak and Trontelj 2012Turjak M, Trontelj P (2012) A method for measuring support for synapomorphy using character state distributions on phylogenetic trees. Cladistics 28: 627-638. https://doi.org/10.1111/j.1096-0031.2012.00403.x
https://doi.org/10.1111/j.1096-0031.2012...
).

We are fully aware of the dependence of this argumentation strategy on the applied concept/definition of synapomorphy (Hennig 1966Hennnig W (1966) Phylogenetic Systematics. University of Illinois Press, Urbana, 263 pp.). However, exploring the variations and validity of other understandings beyond a Hennigian argument goes far beyond the scope of this communication. However, at least to us, creating an equivalence between autapomorphy and homoplasy (i.e., the case if we assume ri = 0 / 0 = 0) seems more problematic.

Final remarks

It is well known that, when the ensemble retention index (R or RI) is calculated for a given tree, this summation is not influenced by autapomorphies (Farris 1989Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
https://doi.org/10.1111/j.1096-0031.1989...
, Naylor and Kraus 1995Naylor G, Kraus F (1995) The relationship between s and m and the retention index. Systematic Biology 44: 559-562. https://doi.org/10.1093/sysbio/44.4.559
https://doi.org/10.1093/sysbio/44.4.559...
). Despite that, it is not adequate just to compute an ensemble retention index for a given tree. A more complete overview is necessary by measuring and exploring several indices and properties in a phylogenetic tree and individual characters, including metrics influenced by autapomorphies. It should be noted that autapomorphies are included in more recent morphological and paleontological studies, e.g., Costa et al. (2014Costa C, Vanin SA, Rosa SP (2014) Description of a new genus and species of Cerophytidae (Coleoptera: Elateroidea) from Africa with a cladistic analysis of the family. Zootaxa 3878: 248-260. https://doi.org/10.11646/zootaxa.3878.3.2
https://doi.org/10.11646/zootaxa.3878.3....
), Sayad and Yassin (2019Sayad SA, Yassin A (2019) Quantifying the extent of morphological homoplasy: A phylogenetic analysis of 490 characters in Drosophila. Evolution Letters 3: 286-298. https://doi.org/10.1002/evl3.115
https://doi.org/10.1002/evl3.115...
), de Medeiros and Vanin (2020de Medeiros BAS, Vanin SA (2020) Systematic revision and morphological phylogenetic analysis of Anchylorhynchus Schoenherr, 1836 (Coleoptera, Curculionidae: Derelomini). Zootaxa 4839: 1-98. https://doi.org/10.11646/zootaxa.4839.1.1
https://doi.org/10.11646/zootaxa.4839.1....
), Černý and Simonoff (2023Černý D, Simonoff AL (2023) Statistical evaluation of character support reveals the instability of higher level dinosaur phylogeny. Scientific Reports 13: 1-13. https://doi.org/10.1038/s41598-023-35784-3
https://doi.org/10.1038/s41598-023-35784...
), Zeballos et al. (2023Zeballos LF, Roza AS, Campello-Gonçalves L, Vaz S, da Fonseca CRV, Rivera SC, da Silveira LFL (2023) Phylogeny of Scissicauda species, with eight new species, including the first Photinini fireflies with biflabellate antennae (Coleoptera: Lampyridae). Diversity 15: 1-43. https://doi.org/10.3390/d15050620
https://doi.org/10.3390/d15050620...
).

In some older cladistic studies, autapomorphies were often neglected and altogether excluded, as they did not influence at all the topology of a tree generated via maximum parsimony (Kitching et al. 1998Kitching I, Forey P, Humphries C, Williams D (1998) Cladistics: The theory and practice of parsimony analysis. Oxford University Press, New York, 2nd ed., 228 pp.). We briefly present some arguments in favor of always including autapomorphic characters; see Yeates (1992Yeates D (1992) Why remove autapomorphies? Cladistics 8(4): 387-389. https://doi.org/10.1111/j.1096-0031.1992.tb00080.x
https://doi.org/10.1111/j.1096-0031.1992...
) for a more complete argument: 1) autapomorphies have the potential to be diagnostic characters to terminals (higher taxa, and species), therefore, they have taxonomic importance (e.g., Ferreira et al. 2023Ferreira VS, Barbosa FF, Bocakova M, Solodovnikov A (2023) An extraordinary case of elytra loss in Coleoptera (Elateroidea: Lycidae): discovery and placement of the first anelytrous adult male beetle. Zoological Journal of the Linnean Society 199: 553-566. https://doi.org/10.1093/zoolinnean/zlad026
https://doi.org/10.1093/zoolinnean/zlad0...
); 2) it is well-known that the consistency index (c or ci) is “inflated” by the inclusion of autapomorphies (Brooks et al. 1986Brooks DR, O’Grady RT, Wiley EO (1986) A measure of the information content of phylogenetic trees, and its use as an optimality criterion. Systematic Zoology 35(4): 571-581. https://doi.org/10.2307/2413116
https://doi.org/10.2307/2413116...
, Carpenter 1988Carpenter JM (1988) Choosing among multiple equally parsimonious cladograms. Cladistics 4: 291-296. https://doi.org/10.1111/j.1096-0031.1988.tb00476.x
https://doi.org/10.1111/j.1096-0031.1988...
), but there is no need to exclude them since most popular phylogenetic software, e.g., TNT 1.5 (Goloboff and Catalano 2016Goloboff PA, Catalano SA (2016) TNT version 1.5, including a full implementation of phylogenetic morphometrics. Cladistics 32: 221-238. https://doi.org/10.1111/cla.12160
https://doi.org/10.1111/cla.12160...
) and PAUP* 4.0 (Swofford and Bell 2017Swofford DL, Bell CD (2017) PAUP*. Phylogenetic Analysis Using Parsimony (*and other methods). Version 4.0a169. Sinauer Associates, Sunderland.), report the consistency index both with and without autapomorphies (so the inclusion of autapomorphies is not detrimental in any point and also does not significantly increase tree search time); 3) in maximum likelihood (Felsenstein 1973Felsenstein J (1973) Maximum likelihood and minimum-steps methods for estimating evolutionary trees from data on discrete characters. Systematic Zoology 22(3): 240-249. https://doi.org/10.2307/2412304
https://doi.org/10.2307/2412304...
, 1981Felsenstein J (1981) Evolutionary trees from DNA sequences: A maximum likelihood approach. Journal of Molecular Evolution 17: 368-376. https://doi.org/10.1007/BF01734359
https://doi.org/10.1007/BF01734359...
) and Bayesian (Rannala and Yang 1996Rannala B, Yang Z (1996) Probability distribution of molecular evolutionary trees: A new method of phylogenetic inference. Journal of Molecular Evolution 43: 304-311. https://doi.org/10.1007/BF02338839
https://doi.org/10.1007/BF02338839...
, Yang and Rannala 1997Yang Z, Rannala B (1997) Bayesian phylogenetic inference using DNA sequences: A Markov chain Monte Carlo method. Molecular Biology and Evolution 14: 717-724. https://doi.org/10.1093/oxfordjournals.molbev.a025811
https://doi.org/10.1093/oxfordjournals.m...
) approaches, autapomorphies inform branch lengths and equilibrium state frequencies, moreover, in the context of these methods, autapomorphies are phylogenetically informative (Felsenstein 1978Felsenstein J (1978) Cases in which parsimony or compatibility methods will be positively misleading. Systematic Zoology 27: 401-410. https://doi.org/10.2307/2412923
https://doi.org/10.2307/2412923...
, Lewis 2001Lewis PO (2001) A likelihood approach to estimating phylogeny from discrete morphological character data. Systematic Zoology 50: 913-925. https://doi.org/10.1080/106351501753462876
https://doi.org/10.1080/1063515017534628...
).

Therefore, to close our argumentation, we emphasize that we need robust proofs and well-defined properties for a metric applied in phylogenetic systematics. In this approach, some conceptual issues were reviewed, and we properly clarified a more appropriate way to interpret the retention index applied to the case of autapomorphies and concerning the inclusion of these characters in phylogenetic analyses.

ACKNOWLEDGEMENTS

We are very grateful to Elliot Santovich Scaramal (Instituto de Filosofia e Ciências Sociais, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil) and Lucas dos Anjos (Department of Renewable Resources, University of Alberta, Edmonton, Alberta, Canada) for revising a preliminary version of the text. This work was supported by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, Education Ministry of Brazil (CAPES), Finance Code 001. We are also thankful to the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) for processes 312786/2022-0 to J.R.M.M. and 310567/2018-1 to C.A.M.R. We are also thankful to Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro (FAPERJ) processes E-26/010.001887/2019, SEI-260003/001170/2020, SEI-260003/012995/2021 to C.A.M.R. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

LITERATURE CITED

  • Ackerly DD, Donoghue MJ (1988) Leaf size, sapling allometry, and Corner’s rules: phylogeny and correlated evolution in maples (Acer). The American Naturalist 152: 767-791. https://doi.org/10.1086/286208
    » https://doi.org/10.1086/286208
  • Amorim DS (2002) Fundamentos de Sistemática Filogenética. Holos Editora, Ribeirão Preto, 2nd ed., 156 pp.
  • Brooks DR, O’Grady RT, Wiley EO (1986) A measure of the information content of phylogenetic trees, and its use as an optimality criterion. Systematic Zoology 35(4): 571-581. https://doi.org/10.2307/2413116
    » https://doi.org/10.2307/2413116
  • Brower AVZ, de Pinna MCC (2012) Homology and errors. Cladistics 28: 529-538. https://doi.org/10.1111/j.1096-0031.2012.00398.x
    » https://doi.org/10.1111/j.1096-0031.2012.00398.x
  • Brower AVZ, de Pinna MCC (2013) About nothing. Cladistics 30: 330-336. https://doi.org/10.1111/cla.12050
    » https://doi.org/10.1111/cla.12050
  • Brower AVZ, Schuh RT (2021) Biological Systematics: Principles and Applications. Cornell University Press, Ithaca and London, 3rd ed., 436 pp.
  • Camin JH, Sokal RR (1965) A method for deducing branching sequences in phylogeny. Evolution 19: 311-326. https://doi.org/10.2307/2406441
    » https://doi.org/10.2307/2406441
  • Carpenter JM (1988) Choosing among multiple equally parsimonious cladograms. Cladistics 4: 291-296. https://doi.org/10.1111/j.1096-0031.1988.tb00476.x
    » https://doi.org/10.1111/j.1096-0031.1988.tb00476.x
  • Černý D, Simonoff AL (2023) Statistical evaluation of character support reveals the instability of higher level dinosaur phylogeny. Scientific Reports 13: 1-13. https://doi.org/10.1038/s41598-023-35784-3
    » https://doi.org/10.1038/s41598-023-35784-3
  • Costa C, Vanin SA, Rosa SP (2014) Description of a new genus and species of Cerophytidae (Coleoptera: Elateroidea) from Africa with a cladistic analysis of the family. Zootaxa 3878: 248-260. https://doi.org/10.11646/zootaxa.3878.3.2
    » https://doi.org/10.11646/zootaxa.3878.3.2
  • de Medeiros BAS, Vanin SA (2020) Systematic revision and morphological phylogenetic analysis of Anchylorhynchus Schoenherr, 1836 (Coleoptera, Curculionidae: Derelomini). Zootaxa 4839: 1-98. https://doi.org/10.11646/zootaxa.4839.1.1
    » https://doi.org/10.11646/zootaxa.4839.1.1
  • de Pinna MCC (1991) Concepts and tests of homology in the cladistic paradigm. Cladistics 7: 367-394. https://doi.org/10.1111/j.1096-0031.1991.tb00045.x
    » https://doi.org/10.1111/j.1096-0031.1991.tb00045.x
  • Farris JS (1969) A successive approximations approach to character weighting. Systematic Zoology 18: 374-385. https://doi.org/10.2307/2412182
    » https://doi.org/10.2307/2412182
  • Farris JS (1973) On comparing the shapes of taxonomic trees. Systematic Zoology 22: 50-54. https://doi.org/10.2307/2412378
    » https://doi.org/10.2307/2412378
  • Farris JS (1977) Phylogenetic analysis under Dollo’s law. Systematic Zoology 26(1): 77-88. https://doi.org/10.2307/2412867
    » https://doi.org/10.2307/2412867
  • Farris JS (1989) The retention index and the rescaled consistency index. Cladistics 5: 417-419. https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
    » https://doi.org/10.1111/j.1096-0031.1989.tb00573.x
  • Felscher W (2000) Bolzano, Cauchy, Epsilon, Delta. In: Shenitzer A, Stillwell J (Eds) The Evolution of… The American Mathematical Monthly 107: 844-862. https://doi.org/10.2307/2695743
    » https://doi.org/10.2307/2695743
  • Felsenstein J (1973) Maximum likelihood and minimum-steps methods for estimating evolutionary trees from data on discrete characters. Systematic Zoology 22(3): 240-249. https://doi.org/10.2307/2412304
    » https://doi.org/10.2307/2412304
  • Felsenstein J (1978) Cases in which parsimony or compatibility methods will be positively misleading. Systematic Zoology 27: 401-410. https://doi.org/10.2307/2412923
    » https://doi.org/10.2307/2412923
  • Felsenstein J (1981) Evolutionary trees from DNA sequences: A maximum likelihood approach. Journal of Molecular Evolution 17: 368-376. https://doi.org/10.1007/BF01734359
    » https://doi.org/10.1007/BF01734359
  • Ferreira VS, Barbosa FF, Bocakova M, Solodovnikov A (2023) An extraordinary case of elytra loss in Coleoptera (Elateroidea: Lycidae): discovery and placement of the first anelytrous adult male beetle. Zoological Journal of the Linnean Society 199: 553-566. https://doi.org/10.1093/zoolinnean/zlad026
    » https://doi.org/10.1093/zoolinnean/zlad026
  • Fitch WM (1971) Toward defining the course of evolution: Minimum change for a specified tree topology. Systematic Zoology 20: 406-416. https://doi.org/10.2307/2412116
    » https://doi.org/10.2307/2412116
  • Forey P (2006) Cladistics for Palaeontologists: Part 4 - Optimisation. Twitchett R editor. The Palaeontology Newsletter 63: 26-35.
  • Goloboff PA, Catalano SA (2016) TNT version 1.5, including a full implementation of phylogenetic morphometrics. Cladistics 32: 221-238. https://doi.org/10.1111/cla.12160
    » https://doi.org/10.1111/cla.12160
  • Hennnig W (1966) Phylogenetic Systematics. University of Illinois Press, Urbana, 263 pp.
  • Kitching I, Forey P, Humphries C, Williams D (1998) Cladistics: The theory and practice of parsimony analysis. Oxford University Press, New York, 2nd ed., 228 pp.
  • Klingenberg CP, Gidaszewski NA (2010) Testing and quantifying phylogenetic signals and homoplasy in morphometric data. Systematic Biology 59: 245-261. https://doi.org/10.1093/sysbio/syp106
    » https://doi.org/10.1093/sysbio/syp106
  • Kluge AG, Farris JS (1969) Quantitative phyletics and the evolution of anurans. Systematic Zoology 18: 1-32. https://doi.org/10.2307/2412407
    » https://doi.org/10.2307/2412407
  • Lewis PO (2001) A likelihood approach to estimating phylogeny from discrete morphological character data. Systematic Zoology 50: 913-925. https://doi.org/10.1080/106351501753462876
    » https://doi.org/10.1080/106351501753462876
  • Morrrone JJ (2013) Sistemática: Fundamentos, métodos, aplicaciones. Prensas de Ciencias, Universidad Nacional Autónoma de México, Facultad de Ciencias, Ciudad de México, 508 pp.
  • Naylor G, Kraus F (1995) The relationship between s and m and the retention index. Systematic Biology 44: 559-562. https://doi.org/10.1093/sysbio/44.4.559
    » https://doi.org/10.1093/sysbio/44.4.559
  • Nixon KC (2002) WinClada. Version 1.00.08. Renner and Weerasooriya, New York.
  • Nixon KC, Carpenter JM (2012a) On homology. Cladistics 28: 160-169. https://doi.org/10.1111/j.1096-0031.2011.00371.x
    » https://doi.org/10.1111/j.1096-0031.2011.00371.x
  • Nixon KC, Carpenter JM (2012b) More on homology. Cladistics 28: 225-226. https://doi.org/10.1111/j.1096-0031.2011.00388.x
    » https://doi.org/10.1111/j.1096-0031.2011.00388.x
  • Nixon KC, Carpenter JM (2012c) More on errors. Cladistics 28: 539-544. https://doi.org/10.1111/j.1096-0031.2012.00409.x
    » https://doi.org/10.1111/j.1096-0031.2012.00409.x
  • Nixon KC, Carpenter JM (2013) More on absences. Cladistics 29: 1-6. https://doi.org/10.1111/j.1096-0031.2012.00430.x
    » https://doi.org/10.1111/j.1096-0031.2012.00430.x
  • Patterson C (1982) Morphological characters and homology. In: Joysey KA, Friday AE (Eds) Problems of phylogenetic reconstruction. Academic Press, London, 21-74.
  • Rannala B, Yang Z (1996) Probability distribution of molecular evolutionary trees: A new method of phylogenetic inference. Journal of Molecular Evolution 43: 304-311. https://doi.org/10.1007/BF02338839
    » https://doi.org/10.1007/BF02338839
  • Sayad SA, Yassin A (2019) Quantifying the extent of morphological homoplasy: A phylogenetic analysis of 490 characters in Drosophila Evolution Letters 3: 286-298. https://doi.org/10.1002/evl3.115
    » https://doi.org/10.1002/evl3.115
  • Sober E (2002) Instrumentalism, parsimony, and the Akaike framework. Philosophy of Science 69: S112-S123. https://doi.org/10.1086/341839
    » https://doi.org/10.1086/341839
  • Struik DJ (1963) The origin of L’Hôpital’s rule. The Mathematics Teacher 56: 257-260. https://doi.org/10.5951/MT.56.4.0257
    » https://doi.org/10.5951/MT.56.4.0257
  • Swofford DL, Bell CD (2017) PAUP*. Phylogenetic Analysis Using Parsimony (*and other methods). Version 4.0a169. Sinauer Associates, Sunderland.
  • Turjak M, Trontelj P (2012) A method for measuring support for synapomorphy using character state distributions on phylogenetic trees. Cladistics 28: 627-638. https://doi.org/10.1111/j.1096-0031.2012.00403.x
    » https://doi.org/10.1111/j.1096-0031.2012.00403.x
  • Wiley EO, Siegel-Causey D, Brooks DR, Funk VA (1981) The Compleat Cladist: A primer of phylogenetic procedures. The University of Kansas, Museum of Natural History, Dyche Hall, Special Publications No. 19, Lawrence, 158 pp.
  • Wiley EO, Lieberman BS (2011). Phylogenetics: Theory and Practice of Phylogenetic Systematics. Wiley-Blackwell, Hoboken, 2nd ed., 406 pp. https://doi.org/10.1002/9781118017883
    » https://doi.org/10.1002/9781118017883
  • Yang Z, Rannala B (1997) Bayesian phylogenetic inference using DNA sequences: A Markov chain Monte Carlo method. Molecular Biology and Evolution 14: 717-724. https://doi.org/10.1093/oxfordjournals.molbev.a025811
    » https://doi.org/10.1093/oxfordjournals.molbev.a025811
  • Yeates D (1992) Why remove autapomorphies? Cladistics 8(4): 387-389. https://doi.org/10.1111/j.1096-0031.1992.tb00080.x
    » https://doi.org/10.1111/j.1096-0031.1992.tb00080.x
  • Zeballos LF, Roza AS, Campello-Gonçalves L, Vaz S, da Fonseca CRV, Rivera SC, da Silveira LFL (2023) Phylogeny of Scissicauda species, with eight new species, including the first Photinini fireflies with biflabellate antennae (Coleoptera: Lampyridae). Diversity 15: 1-43. https://doi.org/10.3390/d15050620
    » https://doi.org/10.3390/d15050620
  • Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Education Ministry of Brazil (CAPES)-Finance Code 001 Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) for processes 312786/2022-0 to J.R.M.M. and 310567/2018-1 Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro (FAPERJ) processes E-26/010.001887/2019, SEI-260003/001170/2020, SEI-260003/012995/2021 This work was supported by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior, Education Ministry of Brazil (CAPES), Finance Code 001. We are also thankful to the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) for processes 312786/2022-0 to J.R.M.M. and 310567/2018-1 to C.A.M.R. We are also thankful to Fundação Carlos Chagas Filho de Amparo à Pesquisa do Estado do Rio de Janeiro (FAPERJ) processes E-26/010.001887/2019, SEI-260003/001170/2020, SEI-260003/012995/2021 to C.A.M.R. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript

ADDITIONAL NOTES

Edited by

Editorial responsibility

Walter A.P. Boeger

Publication Dates

  • Publication in this collection
    23 Sept 2024
  • Date of issue
    2024

History

  • Received
    20 Feb 2024
  • Accepted
    21 June 2024
Sociedade Brasileira de Zoologia Caixa Postal 19020, 81531-980 Curitiba PR Brasil, Tel./Fax: (55 41) 3266-6823 - Curitiba - PR - Brazil
E-mail: sbz@sbzoologia.org.br