PRIMARY IDENTIFICATION CHARACTERS

Identification Characters and Taxonomic Characters

It is well known that identification relies on characters that are different from those used in taxonomy. Taxonomic characters should make it possible to decide that a particular group of individuals represents a separate taxon. Identification character should make it possible to place an unknown specimen into its own existing taxon.

Character is a very ambiguous word (Colless, 1985), but it is used here in the traditional sense of both a characteristic that can be used to differentiate taxa and the instantiation of this character in a particular individual. See Character representation.

The difference between taxonomic and identification characters is well known but not always applied: taxonomic splitters create taxa because they can be differentiated (identified). (This is about as absurd as pretending that races exist in Homo sapiens just because some groups of H. sapiens specimens can be identified.) At the opposite, persons interested in identification sometimes complain that some new taxa are proposed without adequate accompanying descriptions showing tendencies that characterize the taxa. Actually, taxa should be proposed for taxonomic (phylogenetic) reasons. Whether they can be identified by traditional means (naked eye or microscopic examination of morphological-anatomical characteristics) is irrelevant.

This being said, it remains that most species can be identified from morphological-anatomical characters. Even with sibling species undistinguishable from each other by traditional means, the group they form (super-species) can be identified from the other species in the genus. Therefore, it is necessary to address the topic of morphological-anatomical identification characters.

The Quest for the Perfect Identification Character

Obviously, for a character to be used for identification of a taxon it must be present in the members of the taxon and absent outside this taxon. There must be a "gap" between the range of character states observed in one taxon and the range observed for the same character in another taxon.

Like Thiele (1993) and other authors, we do not consider that there is a difference in nature between quantitative and qualitative characters, as most (all?) qualitative characters can be expressed quantitatively ‹ by ratios, as suggested by Thiele, transforms (for outlines), numbers (absent = 0, present = 1 or more), etc.‹ and that, conversely, any quantitative value can be stated in a qualitative way ‹ as in small, numerous, etc.

Beyond this well known "gap" requisite, we make a difference between "easy" identification characters and the characters that are not so easy to observe/measure in specimens of the taxon.

Primary Identification Characters

The concept of Primary Identification Characters was proposed in Fortuner 1989 and Fortuner 1993 (See publications). The basic idea is that identification should rely mostly on characters that are easy to observe in the unknown, which means that they present a minimum risk of errors.

Using our representation of characters, a Primary Identification Character must:

Describe a reasonably visible structure; obviously, the length of a clearly visible organ is more to be trusted than the length of a structure that has indistinct outlines, is not visible by conventional means, or blends into its background (similar color or texture);

represent a basic property that is not ambiguous in the taxon considered; for example, "presence" of seeds in raisins is very ambiguous as "seedless" raisins do have seeds, but very small ones.

have low variability in the taxon, or at least its variability should be well described and clearly delimited and there must be a "gap" between the range of character states/values in the specimens that belong to the taxon and the specimens that belong to other taxa.

Selection of Primary Identification Characters

Primary Identification Characters are taxon-specific, as visibility of structures, ambiguity of properties, and range of states/values vary from taxon to taxon. In Genisys, structure visibility and basic property ambiguity are entered as metadata. Range of states/values can be computed from the database.

Use of Primary Identification Characters and Secondary Characters

Different identification tasks may use different characters. For example, elimination is a very drastic procedure, one that is most susceptible to the adverse effects of errors. Pure dichotomous methods have absolutely no graceful degradation and one wrong choice in a couplet of a key eliminates the correct answer. It is therefore important that elimination relies on primary identification characters only. The other characters (secondary identification characters, those that failed in at least one of the conditions above) should not be used at all for elimination purposes.

Comparison by, e.g., similarity coefficient is a less risky procedure as the taxa are ranked but none are eliminated. Also, a coefficient is computed on several characters taken simultaneously, which makes for built-in graceful degradation. For example, if a coefficient is computed from 20 characters, an error made on one character would result in a 95% correct coefficient. Accuracy can be improved even further by attributing a higher weight to primary identification characters. Secondary characters can be used but a lower weight should be attributed to them. The coefficient of endorsement could be used to weight characters.

_____________________
Colless, DH (1985). On "character" and related terms. Syst. Zool. 34: 229-233.

Thiele, K. (1993). The holy grail of the perfect character: the cladistic treatment of morphometric data. Cladistics 9: 275-304.