Character is a very ambiguous word (Colless, 1985), but it is used here in the traditional sense of both a characteristic that can be used to differentiate taxa and the instantiation of this character in a particular individual. See Character representation.
The difference between taxonomic and identification characters is well known but not always applied: taxonomic splitters create taxa because they can be differentiated (identified). (This is about as absurd as pretending that races exist in Homo sapiens just because some groups of H. sapiens specimens can be identified.) At the opposite, persons interested in identification sometimes complain that some new taxa are proposed without adequate accompanying descriptions showing tendencies that characterize the taxa. Actually, taxa should be proposed for taxonomic (phylogenetic) reasons. Whether they can be identified by traditional means (naked eye or microscopic examination of morphological-anatomical characteristics) is irrelevant.
This being said, it remains that most species can be identified from morphological-anatomical characters. Even with sibling species undistinguishable from each other by traditional means, the group they form (super-species) can be identified from the other species in the genus. Therefore, it is necessary to address the topic of morphological-anatomical identification characters.
The Quest for the Perfect Identification Character
Obviously, for a character to be used for identification of a taxon it
must be present in the members of the taxon and absent outside this
taxon. There must be a "gap" between the range of character states
observed in one taxon and the range observed for the same character in
another taxon.
Like Thiele (1993) and other authors, we do not consider that there is a difference in nature between quantitative and qualitative characters, as most (all?) qualitative characters can be expressed quantitatively ‹ by ratios, as suggested by Thiele, transforms (for outlines), numbers (absent = 0, present = 1 or more), etc.‹ and that, conversely, any quantitative value can be stated in a qualitative way ‹ as in small, numerous, etc.
Beyond this well known "gap" requisite, we make a difference between "easy" identification characters and the characters that are not so easy to observe/measure in specimens of the taxon.
Primary Identification Characters
The concept of Primary Identification Characters was proposed in
Fortuner 1989 and Fortuner 1993 (See publications). The basic idea is that identification
should rely mostly on characters that are easy to observe in the
unknown, which means that they present a minimum risk of errors.
Using our representation of characters, a Primary Identification Character must:
Describe a reasonably visible structure;
obviously, the length of a
clearly visible organ is more to be trusted than the length of a
structure that has indistinct outlines, is not visible by conventional
means, or blends into its background (similar color or texture);
represent a basic property that is not
ambiguous in the taxon
considered; for example, "presence" of seeds in raisins is very
ambiguous as "seedless" raisins do have seeds, but very small ones.
have low variability in the taxon, or at least
its variability should
be well described and clearly delimited and there must be a "gap"
between the range of character states/values in the specimens that
belong to the taxon and the specimens that belong to other taxa.
Selection of Primary Identification Characters
Primary Identification Characters are taxon-specific, as visibility of
structures, ambiguity of properties, and range of states/values vary
from taxon to taxon. In Genisys, structure visibility and basic
property ambiguity are entered as metadata. Range of states/values can
be computed from the database.
Use of Primary Identification Characters and Secondary Characters
Different identification tasks may use different characters. For
example, elimination is a very drastic procedure, one that is most
susceptible to the adverse effects of errors. Pure dichotomous methods
have absolutely no graceful degradation and one wrong choice in a
couplet of a key eliminates the correct answer. It is therefore
important that elimination relies on primary identification characters
only. The other characters (secondary identification characters, those
that failed in at least one of the conditions above) should not be used
at all for elimination purposes.
Comparison by, e.g., similarity coefficient is a less risky procedure as the taxa are ranked but none are eliminated. Also, a coefficient is computed on several characters taken simultaneously, which makes for built-in graceful degradation. For example, if a coefficient is computed from 20 characters, an error made on one character would result in a 95% correct coefficient. Accuracy can be improved even further by attributing a higher weight to primary identification characters. Secondary characters can be used but a lower weight should be attributed to them. The coefficient of endorsement could be used to weight characters.
_____________________
Colless, DH (1985). On "character" and related terms. Syst. Zool. 34:
229-233.
Thiele, K. (1993). The holy grail of the perfect character: the cladistic treatment of morphometric data. Cladistics 9: 275-304.