Sorting is not categorization
A critique of the claim that Brazilians have
fuzzy racial categories
Francisco J. Gil-White
Solomon Asch Center
for the Study of Ethnopolitical Conflict
University of Pennsylvania
3815 Walnut Street
Philadelphia Pennsylvania, 19104-6196
(215) 573-2622
Abstract:
As a result of a spate of studies geared to investigating Brazilian racial
categories, it is now believed by many that Brazilians reason about race in a
manner quite different to that of Americans. This paper will argue that this
conclusion is premature, as the studies in question have not, in fact,
investigated Brazilian categories. What they have done is elicit sorting tasks
on the basis of appearances, but the cognitive models of respondents have not
been investigated in order to determine what are the boundaries of their
concepts. Sorting based on appearances is not sufficient to infer the
boundaries of concepts whenever appearance is not a defining criterion for the
concepts in question, as the case appears to be for racial and ethnic
categories. Following a critique of the methods used, I review a terminological
and theoretical confusion concerning the use of the terms ‘emic’ and ‘etic’ in
anthropology that appears directly responsible for the failure so far to choose
methods appropriate to parsing the conceptual domain of ‘race’ in Brazil.
Do people use
the same kinds of cognitive resources when thinking about ‘race’ in different
parts of the world? The claim has been made that Brazilians have quite
different racial categories from those familiar to Americans, even though the
phenotypic contrasts in Brazil are very similar to those in the US. Many social
scientists now consider this to have been established, and the notion itself
approaches that of a cultural common-sense. This paper will examine the
evidence on which the claim rests, and will argue that it is not very good. This paper will also
make the Moreover,
it will make an argument that we perhaps we should not
expect Brazilian concepts of race to be very different from American ones oncewhen the evidence,
properly gathered, is finally in. A growing body of evidence suggests that humans are
essentialist when it comes to racial and ethnic categories, and that the bias
to essentialize such categories may be a universal aspect of human cognition.
Such evidence is being
gathered at the intersection of evolutionary psychology and the strands of
cognitive anthropology that intersect with it (e.g. Tooby &
Cosmides 1992; Cosmides & Tooby 1994; Atran 1998, 1990; Hirschfeld 1996;
Sperber 1996; Boyer 1994; Brown 1991). These intersections have focused on what is universal to humans, rather than
on the variation, and in so doing have brought at least the possibility of more
balance to the conceptualization of humans from the anthropological
perspective, which so far has been in the tight grip of the assumptions
inherited from the Boasian tradition: human universals are unlikely, and
the importance of cultural differences is great (cf. Brown 1991).
Significantly, the cognitive turn has made students of culture who are
informed by these theoretical turns of events more careful about apparent
differences and similarities that may exist between different cultures.
The main new bias, however, is certainly to ignore differences in order to
focus on what appears general. Perhaps predictably, this risks becoming a
new dogma that fails to do justice to the reality and importance of
cultural diversity (Gil-White 2001c; Richerson & Boyd 2000)-a grave danger
because what may sometimes appear as trifling differences can turn out to
have profound effects for the historical trajectories and system behavior
of different societies (e.g. Nisbett & Cohen 1996; Kelly 1985). But, that
danger aside, there is no question that the cognitive revolution has
provided a needed counterbalance, as well as better methods. Cognitive
anthropologists have moved beyond mere elicitations or demands that
informants introspect about their (often unconscious) mental processes. In
their stead has grown a concern for procedures that will make people use
their cognitive models, allowing us to infer their content and structure.
Evolutionary psychology and the strands of
cognitive anthropology that intersect with it have in the tight,
even suffocating, grip
of the assumptions inherited from the in
which the Boasian tradition: human universals are unlikely, and the importance of
cultural differences is great (cf. Brown 1991). S
has held much of cultural anthropology
(e.g. Tooby & Cosmides 1992; Cosmides & Tooby 1994; Atran 1998, 1990;
Hirschfeld 1996; Sperber 1996; Boyer 1994; Brown 1991). These
two disciplines (and their myriad intersections) have
focused on what is universal to humans, rather than on the variation, and in so
doing have brought at least the possibility of more balance to the
conceptualization of humans. Significantly,
the cognitive turn has made students of culture who are informed by these
theoretical turns of events more careful about apparent differences and
similarities that may exist between different cultures. The main new bias,
however, is certainly to ignore surface differences
in order to focus on what is appears general. Perhaps predictably, Tthis risks
becoming a new dogma that fails to do justice to the reality and importance of
cultural diversity (Gil-White 2001c; Richerson & Boyd 2000)—a grave danger
because ; what may sometimes appear as
trifling differences can turn out to have profound effects for the historical
trajectories and system behavior of different societies (e.g. Nisbett &
Cohen 1996; Kelly 1985). But, that danger aside, there is no question that the
cognitive revolution has provided a needed counterbalance, as well as better
methods. equipped us with a new way of formulating and
asking questions. Cognitive anthropologists have moved beyond mere
elicitations or demands that informants introspect about their (often
unconscious) mental processes. In their stead has grown a concern for
procedures that will make people use their cognitive models, allowing us
to infer their content and structure.
In a cumulative science, improved perspectives and
methods call for a reassessment of earlier conclusions. Given
these new perspectives and methods now at our disposal, it is imperative to
make reassessments of the conventional wisdoms that may have emerged from
studies based on prior methodologies. One such
area that may call for reexamination is the literature on the cognitive models
of race in Brazil (Harris et al. 1995, 1993; Byrne et al. 1995; Harris 1970;
Sanjek 1971; Harris & Kottack 1963). The consensus that has emerged from
this research is that racial categories in Brazil are fluid (interpersonally,
and perhaps also inter-temporally variable), fuzzy (without clear boundaries), and
ambiguous (admitting of different, imperfectly overlapping labels). To me, this
was surprising, because I found ethnic categories in Mongolia to be crisp
classical categories with necessary and sufficient conditions of membership,
and a very strong overlap across most individuals. Specifically, I found that
for the overwhelming majority of respondents, membership in an ethnie was a
matter of patrilineal biological descent from another member of the category.
It could be that Mongolia and Brazil, being different places in far flung
corners of the world, have developed very different cultural models for
thinking about race and ethnicity. But it could also be that, had I used
Harris’s methods in Mongolia, I would have reached his conclusions, and that,
if I were to use my own methods in Brazil, the conclusions would be very
similar to those I reached in Mongolia. This possibility raises the question of
whether the difference is an artifact of the methods chosen, and so we must
examine and compare in order to decide which set of methods is better at
addressing the central question: what is the native system of categories? That
will be the purpose of this paper.
Before I go further, I must clarify what may appear as a
confusion of domains. Above I have noticed
a disparity between the claims of Harris and colleagues, who were
investigating ‘race’, and my own
investigations, which have to do with ‘ethnicity’. In order for my findings to
be truly discrepant with those of Harris, ‘race’ and ‘ethnicity’ should be synonymous
in some sense, otherwise Harris’s and my own are just different findings
corresponding to different domains of cognition and experience. To clarify this
issue I offer the following distinctions.
‘Race’ in the technical sense. To the
biologist a race is a subspecies, identified by a morphological and
genetic discontinuity between two populations in the same species. The lay
categories of race that humans variously think they see in their own species do
not meet the criteria of a morphological or genetic discontinuity (see Boyd
& Silk 2000; Brown & Armelagos 20010).
There are no human races in the technical, biological sense.
The lay category of ‘race’. This is a category of people made on the
basis of phenotypic attributes. It is a category of people whose bodies ‘look’
a certain way. By using Since Harris
was using depictions of humans with different kinds of features
and asking his Brazilian respondents to label them, Harris was
attempting to he was investigate ing lay categories of race. But Aa lay category of race is always more than just
a category of people with certain physical traits. Membership in the category
is held to be explanatory or predictive of other things. This is why
‘Brunettes’ are not a ‘race’—there isn’t much in the way of content or
expectations,
—implicitly or explicitly, associated
with a member of the category. It is possible, moreover, that But a lay category of race is more may be even more than just a
phenotypic category with explanatory or predictive beliefs attached; the people
who use it may believe that they are looking at a biologically meaningful
grouping. That is, laypeople may believe that their lay categories
of race are in fact ‘natural’, even if the intuition does not carry all of the
sophistication of the technical understanding described above. Hirschfeld’s
(1996) investigations suggest this is so, and that the intuitions to process
our phenotypic categories as if they cut our the human species
at important natural joints are active at a very young age and may even be
innate. Whether or not in any particular culture some phenotypic categories are
essentialized as natural categories is an empirical
question—one that Harris did not pursue for Brazilian categories.
‘Ethnicity’ in the technical sense. It is not without controversy for me to advance a technical definition of ethnicity because the experts on this topic actually spend much time trying to undermine each other’s definitions. However, I have tried to show before (Gil-White 1999:813) that despite what appear to be lengthy, ink-spilling disagreements between totally opposite camps, all scholars of ethnicity actually implicitly agree with the following definition: “ethnic groups are agglomerations of people who, at a minimum, represent themselves as vertically reproducing historical units implying cultural ‘peoplehood’.”
The lay category of ‘ethnicity’. Since, as noted above, analysts recognize an ‘ethnic group’ when they think they have found a set of laypeople who organize themselves around an ‘ethnic identity’, there is not here, as in race, the possibility of a sharp distinction between the analytic and lay understandings of ethnicity. But at least one can say that the cultural distinctiveness, the purity of vertical reproduction, and the historical depth of the groups may be greatly exaggerated by native participants in an ethnic category system, and the analyst has no need to go along with such exaggerations and/or reifications. As before, a lay category of ethnicity may be, cognitively, more than just the claim of a cultural distinction and its mode of reproduction. Laypeople may essentialize their ethnic categories as corresponding to natural, biological cuts of humanity. This amounts to saying that laypeople may racialize their ethnic boundaries. Again, whether they do or not in any particular locality is an empirical question.
I have argued before that the extant empirical data favors
the interpretation that laypeople generally racialize their ethnic categories
(Gil-White 2001a,
1999). And sub-populations
characterized as representatives of a ‘race’ in a particular times and places so-called
races in particular times and places easily develop all the
trappings of ethnic groups (e.g. the white/black boundary in the US). This
convergence between the two domains is reflected in the past—and sometimes
current—practice whereby ethnic groups were referred to as ‘races’. And English
is far
from being not the only language to uncertainly
apply the same term to both domains (see below). All of this argues for the relevance
of cognitive investigations in the domain of ‘race’ to those which study
‘ethnicity’ and vice-versa. If it seems likely from some evidence that the lay
domains of race and ethnicity are really—from the point of view of the
cognitive processes that underlie them—one single domain, then we must pay
attention to findings that appear to open this to question. My data about the
cognition of ethnicity in Mongolia appear to contradict rather directly
Harris’s data about ‘race’ in Brazil, and therefore finding out whether this is
a genuine contradiction or a methodological obfuscation deserves the attention
I will give it below.
I begin by reviewing briefly the methods I used in Mongolia, the results, and their interpretation. The presentation is quite abridged as my concern here is with broad methodological priorities, but the full details may be found in Gil-White (2001b).
My respondents are Torguud nomads in the district of Bulgan CumBulgan Sum,
XovdHovd
Province, Republic of Mongolia. Torguuds are a relatively small, Mongol ethnic
group, and they share the district of Bulgan CumBulgan Sum
with KazaxKazakhs,
who make up about 30% of the population in the district. Like Torguuds, some KazaxKazakhs
are nomadic herders and others are town-dwellers. Although there are a few KazaxKazakh
families in what is essentially Torguud territory, on the whole Torguuds and KazaxKazakhs
are territorially segregated—even Torguud and KazaxKazakh
town-dwellers live in separate parts of town. Relations between the groups are
outwardly friendly, if for the most part distant. Trade between them is very
limited, as most conduct a subsistence lifestyle and their economies are almost
identical, resulting in an absence of trade efficiencies. Intermarriage is
entirely absent (in a population of 10,000, there was only one reported case of
intermarriage and I was not able to corroborate it). There is very little
day-to-day interaction between Torguuds and KazaxKazakhs,
especially among herders, and they don’t know too much about each other’s
customs and norms.
The purpose of my investigation was to find out what Torguuds considered to be the rule or rules for being a member of an ethnic group, whether their own or another one. Much ethnographic data from around the world suggests that criteria of blood are paramount when making ethnic ascriptions. However, it has become something of a tradition in the social sciences to claim that ethnic membership is fluid, contextual, situational, and even rationally chosen (see Gil-White 1999 for a review). I decided not to pit criteria of blood against rational choices, but against enculturation. If enculturation from birth into a group—absent a tie of blood—will not confer membership, then rational choices are a moot point. This is therefore a very strong test of the ‘primordialist’ hypothesis, which maintains that ethnic actors will require biological descent from a category member in order to consider a given individual a member of the ethnic category.
The procedure consisted in asking a sample of Torguuds the following questions:
Question (1) If the father is KazaxKazakh
and the mother Mongol, what is the ethnicity of the child?
Question (2) The father is KazaxKazakh,
the mother Mongol, but
everybody around the family is Mongol and the child has
never even seen a KazaxKazakh,
outside of the father. The child will learn Mongol customs and language. What
is the ethnicity of this
child?;
Question (3) A KazaxKazakh
couple has a child that they don’t want. They give it in adoption to a Mongol
couple when the child is under a year old. Around the Mongol family there only
are Mongols and the child grows up never meeting or seeing a
single KazaxKazakh.
He is never told of the adoption and thinks that his biological father and
mother are the Mongol adopters. He grows up learning
Mongol customs and language. What is the ethnicity of this child?
Because in the first question my respondents unanimously
answered that the child would be KazaxKazakh,
revealing a patrilineal model for ethnic ascription, the second and third
questions were crafted so as to pit descent against enculturation, with
question 3 presenting the most extreme contrast.
The first time around a sample of 59 Torguuds were asked
these questions in a setup where the child is male, and the questions are
were
asked in sequence. In a second test, a new sample of 41 Torguuds was
asked the same questions but the order was randomized, the ethnicities were
switched (so that the father was now Mongol and enculturation KazaxKazakh),
and the child was made female. A majority in both tests believed that a child’s
ethnicity was that of his biological parents (principally his/her father’s), no
matter what the circumstances of enculturation, or what the child may have been
told. For question 3, this majority was a strong one in the second test (76%,
versus 59% in the first test), which randomized the order of the questions and
therefore did not create an expectation that the researcher, by making the
circumstances of enculturation progressively extreme, was fishing for an
enculturation-based answer.
Although this establishes that one’s ethnic ascription is
a matter of patrilineal descent, this could might be
nothing more than a way to flag one’s community of biological origin. To know
whether the ethnic categories are essentialized one needs to investigate if
respondents attach to ethnic membership the expectation of a behavioral
disposition. A subsequent test asked a sample of those who had answered that
descent always trumped enculturation whether the child in question 3 would
behave exactly like members of its community of adoption, or somewhat
different. A strong majority (74%) replied that the child in question 3 could
not or would not behave exactly like members of its adopting ethnie, but would
instead behave somewhat like the ethnie of its biological parents. This,
despite the child having absolutely no contact with this ethnie, and despite
being unaware of any such ancestry. Ethnographic data further suggest that even
those who gave explicit enculturation-based answers may have implicit intuitions
to match those of respondents who gave explicit descent-based/essentialist
models (Gil-White, 2001a).
The bulk of these findings suggests that ethnic categories
may be ‘naturalized’. Specifically, iIt may even be
that we treat them as ‘folk species’ categories akin to the categories bear
and mouse
(see Gil-White 2001a for a defense of this argument; for a
different interpretation of ethnic and racial essentialism see Hirschfeld 1996,
Gelman & Hirschfeld 1999). Here I restrict myself to commenting on what
this means for a reassessment of past methodology to understand ethnic
cognition.
Categorization processes in living kinds have two interesting properties where the importance of ‘appearance’ is concerned: (1) category membership is independent of appearances (Gil-White 2001a; Medin & Atran 1999; Atran 1998, 1990; Keil 1989; Gelman 1996, 1991; Medin & Ortony 1989); but (2) membership in a given category nevertheless carries an expectation of category-specific morphological uniqueness (Markman 1989, 1990).
The first point says that appearance is not the criterion
for categorization, although we may use appearances to guide our first
guess as to where a given object belongs. This is because category membership
is conceived as resulting from possession of the category ‘essence’ (believed
to reside ‘inside’), not a set of external features. It is true that the essence
is believed to typically cause typical surface
features, including appearance, and therefore that common category membership
will lead to the expectation of surface similarity between any two objects. But
since this expectation is not defining, a common appearance is neither
necessary nor sufficient for category inclusion. A dolphin, for example, looks
like a fish, but isn’t. Appearances are not a matter of fuzzy membership
either. A dolphin’s appearance does not make it partly a fish. It is not
a fish at all. And this is also not a question of modern versus old biology
(Atran , 1998). It is our readiness to accept
the statement or discovery by an expert source that dolphins are not, after
all, fish (and our immediate assumption that there must be something intrinsic
to dolphin ‘nature’ that makes this true) that sets living-kind categories
apart—not the actual theory we may have about the taxonomic position of
dolphins.
To see the distinction between sorting (which involves
identification procedures) and categorization (which involves reasoning about
definitional criteria), suppose I present you with a new stimulus and ask you
“what is this?” Assuming I give you nothing to go on beyond what is apparent on
the stimulus itself, you will give me
the best answer you can, with the available information. Using the Given
the appearance of the object as your guide, you will provisionally sort
it into the category that you think it most likely belongs in, and this will be
your answer. That however—and this is crucial—does not mean that the cues you
use to make this initial tentative sorting constitute the actual criteria
for category membership (i.e. the definition of an object inside the category X
is not necessarily equivalent to “possession of the cues a, b, c, ..., which are used to make the tentative
sorting”). If presence of these cues is not coextensive with the definition of
a member of category X, then I can give you additional information about that
object that may lead you to revise the slot that you had put it in at first (for
the difference between identification procedures and concept representation,
see Smith & Medin, 1981:40-43; Armstrong, Gleitman, & Gleitman, 1983)
My data suggests that if Dorj, a Torguud Mongol, were to
encounter a man who was dressed like a KazaxKazakh,
spoke KazaxKazakh,
and behaved in every obvious way like a KazaxKazakh,
Dorj would think to himself: “Lo, a KazaxKazakh.”
However, if told subsequently that this man’s biological parents were both
Mongol and that he had been adopted at an early age into the KazaxKazakh
community, Dorj would then think: “Aha, so this man is really a Mongol!” Dorj at first sorted the man into the KazaxKazakh
slot with the available information. He made the best guess he could make under
the circumstances. But finding that Dorj makes such guesses does not mean that
Dorj thinks “if it talks like a KazaxKazakh
and walks like a KazaxKazakh, then
it is a KazaxKazakh.”
Clearly, Dorj doesn't think this. A sorting guess made on the basis of
appearance, and categorization, are not the same thing.
It is precisely the fact that appearances are insufficient
for categorization, plus the expectation of behavioral regularities resulting merely from
the fact
of descent from Mongols alone (in
the absence of any contact with Mongols), which suggests that we are looking
here at an essentialized ‘natural kind’ category. The fact that the ‘essence’
is taken to be passed on reproductively suggests a species-like category, for i. In such categories, membership is unequivocally established believed to result from possession of the essence,
and is established from those facts as will determine if the the condition s for having the essence has been
ascertained, which condition amounts to the essence. In the case of
species categories, this is biological descent from another category member.
Notwithstanding the fact that appearances are insufficient
for categorization in ‘living kinds’, there is certainly a perceived causal
link between the essence and a given typical appearance; two ‘living kind’
categories will be expected to have
different appearances by virtue of their having two different essences, even if
appearance is not defining for any one individual. This may create a bias
bigger than the baseline category accentuation effect (Hogg, 1988:19) that
obtains generally and which makes us see members of different categories as
more contrasting than they actually are. In Bulgan CumBulgan Sum
I found that my respondents unanimously expected the biological child of
Mongols, if adopted into a KazaxKazakh
community, to look physically different from its adoptive community. However,
when I tested discrimination ability with a set of anonymous photographs, I
found that there was considerable phenotypic overlap between the groups
(Gil-White, 2001b). This suggests that information about category membership
may, in living kinds, greatly exaggerate perceptions of
dissimilarity.
The above findings strongly imply that when it comes to
ethnicity, we need to dig deeper than appearances and classifications of
appearances. Given the above-mentioned strong similarities and even common
semantic blurring and confusion between lay categories of ethnicity and race,
as well as experimental data supporting the idea that races are likewise
essentialized (Hirschfeld,
1996), the same admonition applies to racial categories.
There is now a corpus of work on the basis of which at least many some scholars believe that racial
categories in Brazil are quite different from their counterparts in the US
(Harris et al 1995, 1993; Byrne et al 1995; Harris 1970; Sanjek 1971; Harris
& Kottack 1963). The main contention is that racial categories in Brazil
are highly ambiguous with very fuzzy boundaries. In light of my findings, I
shall now argue that this conclusion is premature and, given the methodology
employed so far
by Harris and colleagues, not yet supported by any data. I begin with a
detailed discussion of Harris (1970) since that paper is emblematic of the
others.
Harris (1970) This study elicited
‘racial’ classifications from 100 Brazilian respondents by using
...a deck of 72 full face drawings constructed out of the combination of three skin tone, three hair form, two lip, two nose, and two sex types (...) Responses were initiated without using terms presumed to be part of the domain...In general it was found that a question involving a request for the 'qualidade' or 'tipo' or 'raça' of the person depicted in the drawing was adequate to prime the response process. The word 'cor' was used only as a last resort.
There is either Harris seems to have little concern for
priming the domain of race, or else Harris feels confident that how the question
is put will not really
affect the results much. But to assume that the words
‘qualidade’, ‘tipo’, ‘raça’, and ‘cor’—which translate into English roughly as
‘quality’, ‘kind, ‘race’, and ‘color’[1]—are
interchangeable as stimulants of responses in the same domain is strange. In Bulgan CumBulgan Sum,
for certain, I would not get answers in the same domain if I asked some people
“what is the ündecten (ethnie/race)
of this person? ” and others “what is the öngö
(color) of this person?” It appears that which question was used was ad hoc and depended on the idiosyncratic
comprehension problems of individual participants. Harris does not code his
data according to which responses were elicited by which kind of question, and
he reports no evidence of their interchangeability or a test of their effects
on responses. He obtained quite a wild variation in his responses—restricting
ourselves just to those terms used more than 100 times, these are still a grand
total of 12. However, this much
variation could be at least partly the product of asking four different
questions, not all of them priming the same domain, rather than the product of
a large universe of Brazilian ‘racial’ terms, as he claims.
Harris also finds that there is little intersubjective
agreement about the boundaries of the different categories. Perhaps. But by
showing people pictures and asking them to classify them he is more likely
testing identification procedures rather than cognitive models of category
membership. It is possible for the entire population to be in intersubjective
agreement about what makes a ‘branco’ a ‘branco’, but for different individuals
to have different thresholds—on the basis of appearance—for making a guess that
a person is branco or not. For example, suppose you and I agree that someone is
branco (‘white’) if and only if his/her parents are also branco; however, you
and I have different thresholds of increasing brownness at which we stop
guessing that someone is a branco. Should we be told the facts of descent, you
and I will agree on who is and isn’t branco, but when Harris shows us the
pictures, we will make different guesses. Because Harris’s study does not delve
deeper than the initial appearance-based
sorting guesses on the basis of appearance, the
underlying cognitive models are not really explored, and his data are therefore
silent on the intersubjective agreement that may exist for these. Aggravating
this problem is the fact that using black & white drawings rather than
photographs (or at least more realistic color drawings) is likely to increase
the ambiguity in people’s guesses because people are likely to have individually
varying biases to imaginatively complete the missing information.
Another possibility is that category X is understood by everybody to have the same conditions of membership, and that most of the respondents did not even disagree with each other during the identification process, but people name the categories differently. For example, you and I agree on who is a ‘preto’ (‘black’) but when Harris asks us, I say ‘preto’ and you say ‘moreno’, meaning the same category. The looseness of intersubjective variability in naming of the category is not the same as looseness in identification of the category or its membership requirements—especially if one usage is more polite than another and some respondents are trying to be polite. In Mexico, for example, people of African descent are usually referred to as ‘negros’ but, more politely, ‘morenos’ or ‘prietos’ (the two latter words have other usages and that is what makes them polite). Similarly, in the US, it would not be sensible for a researcher to assume intersubjective disagreement if some respondents identified a picture as ‘African American’, others as ‘black’, and yet others as ‘colored’—it is anything but far-fetched to suggest that many such respondents would agree if questioned further that they were talking about the same category. Harris interprets his data as pointing to differences in American and Brazilian racial categories, but a little introspection suggests that applying his methods in the US would result in qualitatively similar data.
Yet another possibility is that some of these words may function sometimes as racial category labels and other times as literal color descriptors. This seems plausible for the word ‘moreno’ ( ‘tanned’ or ‘brown’), and it may account for Harris’s (1963) claim that sometimes full siblings in Brazil receive different ‘racial’ labels. The terms almost certainly don’t function as ‘racial’ labels at all in these cases. It should be obvious that not all phenotypic distinctions are ‘racial’. When we speak of blondes and brunettes we are not, by that token, separating people into different ‘racial’ categories. Likewise, what Harris is referring to almost certainly involves people making non-racial phenotypic distinctions between siblings, and the terms used then function merely as labels to distinguish the surface appearance of siblings naturally considered to be of the same ‘race’.
This is certainly how color labels function within ethnic/racial categories in Mexico, where I grew up. In Mexico the upper-class is overwhelmingly ‘white’. Among whites, I am considered ‘moreno’ because I have dark eyes and hair, and white but usually olive skin due to easy tanning. One of my sisters had blond hair when she was young and still has blue eyes and very white skin. Among whites, she was called, as a girl, a ‘güera’, which is the term used by this group for any blonde. However, to an Indian or a mestizo we are both ‘güeros’ (‘whites’) and that is what Indians and mestizo’s who don’t know me and who want to attract my attention always call me ‘Hey, güerito!’—literally ‘whitey’ (although in Mexico no offense is taken, partly because of the double meaning). The term has this connotation especially for members of Mexico’s lower classes. A similar effect may take place in Harris’s data because his respondents are from different socio-economic strata and may thus, as in Mexico, have class-based usage variations for the same terms.
The residents of Bulgan CumBulgan Sum
are at the geographic center of Asia, and in their history they have migrated
far and wide. As a consequence they display astounding phenotypic diversity—at
least half have either blue or green eyes; there are blondes and brunettes;
some have epicantic folds over their eyes, others don’t; there are very pale
and very dark-skinned people, etc.; and also every conceivable combination of
traits (e.g. very dark-skinned people with green eyes and epicantic folds are
not uncommon). All this naturally leads people to employ, as in Brazil, a quite
rich vocabulary of phenotypic description. Individuals are said to be yellow,
brown, red, white, black… Many a person’s nickname is a diminutive form of
these color terms. But to describe a girl as ‘yellow’ (e.g. ‘Shar Oxio’ or
‘yellow daughter’) and her brother as ‘black’ (e.g. Xaraa or ‘blackie’) hardly
means they put them in different racial or ethnic categories. They are both
Mongol.
For another angle, consider that the residents of Bulgan CumBulgan Sum
are aware through schooling that Africans exist and some of them have even seen
one or two in the flesh because they came to this area as tourists. They refer
to them as ‘blacks’. The fact that some Mongols are also called ‘black’ does
not imply any ambiguity about their ethnoracial categories. Within Mongols, ‘black’ refers to a
person’s hue, not a person’s racial category. This is an elementary observation
in Mongolia, where the contrast between a Mongol referred to as ‘black’ and an
African is obvious. The contrast between Mongols and KazaxKazakhs,
on the other hand, is far from obvious, and corresponds to an objectively fuzzy
continuum comparable to the Brazilian contrast between, say, ‘pretos’ and the
intermediate category ‘morenos’. But since the conceptual distinction between
Mongols and KazaxKazakhs
is not fuzzy, we must consider it at least possible, on the strength of the
Mongolian evidence, that in Brazil objective phenotypic fuzziness in the racial domain
may likewise not imply a fuzzy conceptual boundary.
Finally, when Harris finds that a given drawing was
identified by most as ‘branco’ but by a substantial minority as ‘sarara’, does
this mean that Brazilian racial categories are ambiguous, or that some
respondents are giving the basic-level category and others a more specific
color description? To use a comparison example, if he showed them a picture of
a laptop computer and asked them ‘what is this?’, would finding that some
people answer ‘computer’ and others ‘laptop’ mean that they have different ways
of parsing the world? No. In this case it would simply mean that some people
chose a subordinate category and others a superordinate category, but that they
both agree that it is a computer and, more specifically, a laptop. In the same
way, perhaps, a person can be a ‘branco’ and, more specifically, a ‘sarara’ (in
other words, ‘sarara’ is the kind of ‘branco’ that she is). Which interpretation
is correct, however, cannot be determined unless one looks to see if any of the labels
given stand in inclusion relationships to each other. Harris did not
attempt this.
The above alternative explanations for Harris’s results are certainly not mutually exclusive. A second look at his data will illustrate this.

Figure 1. The numbers on the X-axis represent the descending
rank-order of the drawings in terms of number of ‘branco’ responses (Harris’s
numbers are different because he kept his original numbered identifying labels;
the order of the drawings is the same as Harris’s, however). For each drawing,
the number of ‘branco’ responses can be found on the Y-axis. The branco curve
is a close approximation of Harris’s smoothed line. For the other curves,
Harris provides the actual data points and they are reproduced here.
<<Insert Figure 1 about here>>
The figure above superimposes Harris’s ‘moreno’ responses on the ‘branco’-‘preto’ axis. I will refer to the drawings 1-15 as the ‘exclusive branco’ range because ‘branco’ is the most common response throughout that range. Similarly, I will call the drawings 32-36 as the ‘exclusive preto’ range. The ‘extended branco’ (where ‘branco’ is more common than ‘preto’ but not necessarily more common than ‘moreno’) is the range of drawings 1-21; the ‘extended preto’ is 21-36. The ‘moreno’ range is the set of drawings 15-32. Notice that although the extended branco and preto ranges meet almost at the midpoint, the exclusive preto range is very small because ‘preto’ responses dip very quickly. This is also because on average, there are more ‘moreno’ responses inside the extended preto range than in the extended branco range.
How can we explain the above distributions? One interpretation could go as follows. If the term ‘moreno’ sometimes refers to a racial category intermediate between ‘branco’ and ‘preto’ we should see a moreno range in the middle where ‘moreno’ responses swamp both ‘branco’ and ‘preto’. This is apparent above. If ‘moreno’, within a racial category, simply means ‘intermediate’ for that category, then we should see a ‘moreno’ spike in the middle of the branco range, and another one in the middle of the preto range. The first is apparent in the figure above (drawings 8-9). The second might be the spike at 31-32, except that it is not really occurring within the exclusive preto range whereas the first one occurred in the middle of the exclusive branco range. This might be because ‘moreno’ is sometimes used politely for people who are considered racially ‘preto’. So the spike and its base are both raised, the ‘preto’ curve lowered, and as a result the size of the exclusive preto range is cut shorter. This interpretation is bolstered by the fact that even with the most extreme drawing on the preto side (#36) there appears to be a reluctance to use the term ‘preto’ (32 instances against 50 of ‘branco’ for the comparison drawing on the branco side). Harris claimed that there was no relationship that he could find for the moreno responses on the ‘branco’-‘preto’ axis. But superimposing them as I do above (which Harris doesn’t do for his readers) suggests this interpretation as a possibility, though it is not one admitting of a statistical test. More data needed to be gathered about the respondents’ cognitions for such tests to be meaningful. There is simply too much missing information, and too much potential noise, in this data set.
The main reason for the noise is that Harris has asked his
respondents to sort a lot of perceptual stimuli. But sorting is not
categorization—that is, not necessarily. To find out if what emerges from a
sorting task of perceptual stimuli corresponds to the racial categories we need
to investigate how people think about the resulting ‘lumps’. If we don’t do
this all we have is data about categorization sorting guesses
that result from identification procedures—these are , not
necessarily the categories in people’s heads.
Harris wants us to accept the data as evidence of a racial categorization
system that, by comparison to, say, the American, is very wild. But this is
premature. We need to know what the racial categories are. These cannot be
inferred willy nilly from appearance categories—it is not written anywhere that
racial categories and first-guess piles due to sorting on the basis of
appearance should match exactly.
In a later study, Harris et al (1993), with a different methodology, again fail to rigorously rule out the confounding of
color descriptors and racial labels. They have two different but roughly
comparable samples in terms of phenotype. People in both samples are first
asked to self-identify freely (the authors do not report the question used).
Then one sample is asked to self-identify again in terms of just the four forced
choices of ‘branco’, ‘preto’, ‘amarelo’, and ‘pardo’ (the ‘pardo option’),
while the other sample does the same with the forced choices of ‘branco’,
‘preto’, ‘amarelo’, and ‘moreno’ (the ‘moreno option’). The question is: of
those who initially freely self-identify as ‘moreno’ how many will reidentify
as ‘pardo’ in the ‘pardo option’?
Harris et al find s that
only 37.4% of those who at first freely-identify as ‘moreno’ later reidentify
as ‘pardo’ in the ‘pardo option’. They His conclusion conclude is that
the racial categories are ambiguous, and he criticize s an
earlier academic effort (Silva, , 1988)
that led to the Brazilian census categories for having disregarded this by
discarding the ambiguous term ‘moreno’— and
adopting ‘pardo’ instead—as
the label for the intermediate racial category in the list of forced choices
that appear on census forms.
Harris’sThe conclusion
is rather startling. However, In the first
place, Silva (1988) argued that ‘moreno’ sometimes acts as
a racial label and sometimes as a literal color descriptor (this is a
recognition of ambiguity, but of a different kind), and he was trying to avoid
confounding the two. Because he thought that ‘pardo’ primed the racial domain
better by avoiding this confusion, the recommendation for the census became to
use ‘pardo’ instead of ‘moreno’ for the intermediate racial category. One may
argue that Silva did not rigorously establish that ‘pardo’ was the best choice,
but there is no question that he was concerned with distinguishing racial labels
from color descriptors. Harris again seems to have no concern for this, and it
leads him to his perplexing practice of treating any color terms applied to
people as ‘racial’ labels without any need for further evidence. He thus seems
unaware of
the fact that his own data in fact support
Silva’s contention that ‘moreno’ is sometimes a racial label and sometimes
a color descriptor (Telles, 1995 makes the same point), and that ‘pardo’
narrowly primes the racial domain. It is precisely under such
circumstances the latter circumstance that we should
expect a substantial proportion of freely self-identifying ‘morenos’ to later
reidentify as ‘branco’ or ‘preto’ in the pardo option, as Harris found.
And what about the moreno option? It is plausible that asking people to freely self-identify will cause many to answer in terms of color rather than race (especially if there is not much rigor in how the question is posed), but that being presented with Harris’s four forced choices (even when one of them is ‘moreno’) will better prime the basic-level domain of racial categories. This hypothesis would be supported if there was a noticeable decrease in the number of people who reidentify as ‘moreno’ even in the moreno option. However, Harris et al do not even report this crucial evidence.
Some suggestions for fixing the problems with the above methodologies are as follows. First, if the investigation is about race, one must be careful to elicit data in such a way that other domains are not primed. Sticking to the word ‘race’ is a good idea, but even better than that would be a pretest of how different words may lead to different kinds of elicitations. The use of color photographs, rather than black-and-white drawings, should be considered; but if drawings are used, the more information in them the better—realistic color is best. The drawings should be used merely as an instrument for respondents to express, if they can, a set of mutually exclusive categories that fully specify the domain. Proceeding this way is likely to establish the set of mutually exclusive categories at the ‘basic level’, for if respondents are asked to sort all photographs into categories they will almost certainly privilege the basic level (for discussions of basic-level categories see Lakoff, 1987; Smith & Medin, 1981; Rosch, 1978). This however, should not be taken for granted; one should proceed with further investigations. For example, if more than one clump received the same or similar names, or if a set of photographs is called by one name by some respondents, and a different name by others, respondents should be challenged to explain. Following that, one can ask if it is possible to name this or that category with other labels in some contexts, and what would these be. Careful attention should be paid to possible inclusion relationships to see which of the resulting categories parse the domain at the same taxonomic rank-level. Then one can ask about membership conditions to these categories—a very important question that helps distinguish appearance-based from ‘essentialized’ categories, and determines whether or not the categories obtained are mutually exclusive. Finally, one can compare the results of many individuals to see how much intersubjective variability there is in basic-level categories found and membership conditions given. Whether or not the categories show intersubjective agreement in the phenomenological boundaries is much less interesting than whether the cognitive models of respondents agree, and whether they would resolve borderline cases in the same way should they be given the same background information about the individuals that constitute the stimuli.
It is true that one does not find in the US as varied and prolific a vocabulary for describing people’s phenotypes as one does in Brazil. It is also true that the US system features hypodescent, which prevents the emergence of intermediate racial categories. These are quite hoary observations of the differences between the US and Brazil. However, it is possible that this is the extent of the difference. Harris simply assumed that all of the terms he got referred to ‘racial’ categories and did not bother to examine people’s cognitive models, limiting his study to an elicitation of labels. (This posture explains Harris’s remarkable claim that in Brazil full siblings may be classed in different ‘racial’ categories, an interpretation that stretches thin not only the data he has, but also the concept of ‘race’ as usually understood.) It seems quite possible that the differences in terms of cognitive models of race between Brazil and the US may be smaller than Harris thinks.
The problems alluded to here appear to result at least in part from Harris’s reinterpretation of Kenneth Pike’s original distinction between emic and etic. Harris keeps the terms but his definitions in fact are quite orthogonal to Pike’s approach, have a different theoretical purpose, and importantly influence the methodological road taken. Revisiting these concepts is helpful both in terms of understanding Harris’s approach and its pitfalls, and to clarifying the useful work that cognitive anthropology can do to understand alien category systems. The words emic and etic, unfortunately, have come to mean almost anything to almost anyone (a sad commentary on the triumph of rhetoric over science in much anthropology). Headland (1990:20-22) reviews an astounding plethora of uses with almost nothing in common. In this discussion, however, I will focus on the original meaning, and on Harris’s early departure from it.
Before I do, an important caveat. In anthropology it is customary to spill much ink trying to wrest control of the meaning of terms of art from other anthropologists, as if defining a word were equivalent to providing an explanation or making an intellectual contribution. Many proceed as if the meaning of a word—itself apparently of profound importance—was something to be discovered through careful application of quasi-philosophical reflection, and the anthropologist was the professional whose task was to discover it (Headland et al, 1990 is an egregious example of this sort of thing). But this exercise does not advance science and it is not my purpose here. I am not advocating that the terms ‘emic’ and ‘etic’ be used in any particular way, and I don’t think the words themselves have any mysterious power. I merely want to show (1) that Pike and Harris are talking about completely different things despite the fact that they use the same terms, and (2) that Harris, though he defines the terms differently, nevertheless expects ‘emic’ and ‘etic’ to do the same work for him that they do for Pike. This causes serious confusion about what needs to be done in order to explicate an alien category system, and is directly responsible for the methodological problems outlined above. In a nutshell, I will argue that Harris uses emic/etic to distinguish between a ‘folk’ and an ‘analytic’ category , whereas Pike uses emic/etic exclusively to analyze the boundaries of folk categories.
Kenneth Pike’s original innovation was the result of applying his understanding of one system of categories, phonemes, to other systems of categories to see if this could generate any insights. A phoneme is not a sound but a collection of sounds. The ‘p’ in ‘pit’ and the ‘p’ in ‘spit’ are not the same sound, and they are produced by a different combination of vocal articulations. However, to an English speaker they both count as the same phoneme /p/. We therefore have that there is a collection of different sounds (phonetics) that a speaker will parse into a smaller number of functional/grammatical units (phonemics). The suffixes were used by Pike as technical terms to distinguish the variation in any domain—etic phenomena—that are parsed by people in a given culture into the same categorical slot—or emic unit. Thus, Pike was extending an insight of linguistics—the explication of the grammatical properties of the smallest units of speech, which resulted from understanding how a collection of different sounds constituted the same category of grammar—to other cultural ‘grammars’.
In all cultural domains, there is a lumping of objects,
performances, or events —which
are phenomenally and measurably different —into
the same category. A useful illustration offered by Pike (1990:28-29) is a
‘strike’ in baseball, as there are many different kinds of performances that
all count as ‘a strike’.
Individual emic units may have variants without losing their
status as the 'same' emic unit for a particular person or subculture.—Pike
(1990:28-29)
This insight may not appear as a very deep one until one
realizes that there may be nothing obvious at all about the actual parsing
processes and the categories themselves. For example, to many most native English
speakers, the contention that the ‘p’ in ‘pit’ and the ‘p’ in ‘spit’ are two
different sounds will come as a surprise. Thus, for Pike, the
‘insider/outsider’ distinction is directly tied to the fact that category
systems may be opaque to those who use them daily.
Insiders act but mostly don't understand what they know. I use and categorize sounds spoken by English speakers correctly into the phonemes of English. This is because I am a competent English speaker, an insider of the English language. But I don't have any insight into this tacit knowledge until I analyze my behavior in the manner of an outsider, and treat what I implicitly know as something requiring explanation.—Pike (1990:33-34)
These observations were important to anthropologists because the ethnographer, typically coming from a different culture than the one she studies, is an outsider, and therefore in a good position to find alien category systems as strange and demanding explanation, and also to see at work the categorization processes that go unnoticed by natives. However, there is a danger that the anthropologist will miss native categories if she superimposes her own. Thus, a determination of the native parsing of the world is indispensable to a proper understanding of an alien culture.
Again, think of phonemes. For English speakers, the sounds
which are orthographically glossed as ‘r’, and those which are glossed as ‘l’,
belong to two different phonemes. However, to a native Japanese speaker these sets are both
included in the same phoneme (it is
exactly like the ‘p’ in ‘pit’ and the ‘p’ in ‘spit’ to for a native English
speaker). This explains why Japanese learners of English so often get their
‘r’s and ‘l’s confused, and why this is so saliently strange and amusing to
native English speakers, but so hard to notice for the
Japanese. Similarly, one should expect that people from different cultures will
often parse the world of objects, events, and performances in different ways
that will surprise and confound an ethnographer. It is also possible that some
native categories that should surprise
the ethnographer never will because they go unnoticed—unless the ethnographer
has a method for discovering how natives parse the world. Therefore, it is
important, in the effort to understand an alien culture, to establish what
people will call ‘the same’. But it is equally important, because much of this
may not be conscious, not to trust the natives’ own reports of what is the same because, as insiders, natives may not
always be able to explain their own categories. Cognitive tasks that make
people reveal the underlying and perhaps cryptic categories by using them
are a much better methodological route than asking informants to introspect on
the meanings of things.
To sum up, Pike points out the importance of finding out what objects, performances, or events are lumped as ‘the same’ into one category. To speak of this problem he refers to the measurable variation as etic phenomena, and to the categories in which this variation is lumped by natives as emic units.
I now turn to Harris’s use of ‘emic’ and ‘etic’, which clearly departs from Pike. Below is Harris’s view, consistent over the years (Harris, 1990:53-54; my emphasis):
Grammatical rules...have the same epistemological status as phonemes, since the test of their validity, no matter how abstract the formulation, is whether they generate utterances that native speakers recognize as meaningful and appropriate. Such tests, however, are completely irrelevant to etic analyses, which stand or fall on their contribution to predictive or retrodictive nomothetic theories about the evolution of sociocultural differences and similarities.
Do not be fooled by the reference to phonemes—we are no longer talking about the same distinction that Pike has drawn. Harris distinguishes between what makes sense to the native, and what makes sense to the scientist; a distinction between two different languages, rather than two aspects of the same language (which is Pike’s distinction). ‘Emic’ and ‘etic’, for Harris, stand for two entirely separate ways of looking at the world. Emic is the native’s parsing of the world, and its measurement; etic is the scientist’s parsing of the world (dictated by her hypothesis), and its measurement. This is evident in the fact that Harris considers emic analyses “completely irrelevant” to etic analyses.
Could Pike ever say that emic and etic phenomena are irrelevant to each other? No. In Pike’s definitions ‘emic’ refers to a native category the boundaries of which are determined by looking at the ‘etic’ variation which it encloses. Remember, he starts with an analysis of phonemes: the phonetic variation is the collection of different variants (different along some domain that we can measure) that native categorizers lump together in the same concept (a given phoneme). Thus, ‘etic’, for Pike, does not refer to an alternative categorization with entirely different purposes that a scientist may have, but rather to the ‘stuff’ that the native emic unit is made of. Pike’s project is to find out what natives will consider ‘the same’. The measurement of the (Pikean) ‘etic’ variation is therefore indispensable—rather than completely irrelevant—to finding out which and how many (Pikean) ‘emic’ units natives will parse this variation into.
It is apparent that what Harris has really seized on is
Pike’s reference to an outsider/insider distinction. But notice the difference.
By the outsider/insider distinction Pike merely means to observe that the
outsider (the scientist) can measure the etic variation, but the native cannot
perform this analysis because she is trapped in the perception of the world
that her categories impose. Harris, on the other hand, has taken the words
‘etic’ and ‘emic’ to stand for the perspectives or categorical worldviews
of the outsider and the insider, respectively. For Harris , the
scientist is the
(outsider), and she has has
a hypothesis that dictates one choice of categories, and a system
of measurement that determines one set of claims; , while the
native is the (insiderr), with a separate has
her own—perhaps quite orthogonal—system of categories, and an—also
perhaps quite orthogonal—value system and set of priorities that may lead to
very different claims. This interpretation becomes even more evident as Harris
(1990:53-54; my emphases and brackets) continues:
This does not mean that etic analysis necessarily results in descriptions that are at variance with participants' sense of appropriateness and historical truth. In many domains, but especially technological processes, emic versions of cultural practices and behavior stream events [the native’s claims] correspond closely to etic versions of the same [the scientist’s measurements]. As shown by Allen Johnson in a study of Brazilian peasant farmers (1974), elicited rules for planting crops on particular kinds of lands and elicited descriptions of past planting activity sometimes corresponds closely with etically observed behavior. But as Johnson emphasized, the facts of correspondence or noncorrespondence raise equally important questions: Why are some rules followed while others are broken? Why do some individuals follow rules while others break them? Why are some rules and concepts widely shared while others vary from individual to individual? (Johnson, 1974:100).
Again, notice the contrast. ‘Emic’, in the above quote,
refers to how the native sees the world, and the language she employs. ‘Etic’
refers to how the scientist sees the world, and the language she
employs. For Harris ‘emic’ = what
people say they believe and do, and ‘etic’ = what people actually do as measured by an outside observer.
So, for Harris, if a farmer says he believes rule X about planting crops, that
is ‘emic’; the farmer’s actual behavior, when, for example, he is observed by
a scientist chooses (for example) to
violate that rule, is ‘etic’. But for the question of what rule X itself is
(and this is Pike’s question, and the raison d’être for his emic/etic
distinction), it matters not one whit whether people in fact go about breaking
rule X or not
because, for. Pike, ‘etic’ refers not to what people do but to how
people count what they do. An etic instance of rule X is anything that counts
as satisfying it according to the natives. A native who steals
may obviously count what she did as ‘stealing’ and therefore as breaking the
rule “thou shalt not steal.”Whether or
not I, in point of fact, steal property has no bearing on what kinds of
behaviors I consider to count as ‘stealing
property’. Thus, if Pike was
interested in the native emic unit ‘crop planting rule X’, in what sense would
a violation be ‘etic’? For Pike, an etic instance of rule X is anything that
counts as satisfying it (and violations, by definition, do not satisfy a rule). Her behavior is
irrelevant to Pike’s analysis; what matters is how she counts it.
The point is not that recording what people do, or whether they follow their own expressed rules, is unimportant or uninteresting. The point is simply that this is not what Pike’s emic/etic distinction is about. We may be interested in the degree to which people follow their rules, or to which they behave consistent with their salient concepts (Harris’s question), but this is not the same thing as having established what they understand their rules to be, or what the boundaries of their concepts are (Pike’s question).[2] I hope to have shown that we are faced here with two very different research problems, both of which quite unfortunately use the same two terms of art.
Now, if one wishes to conduct a study of how native
Brazilians parse the domain of ‘race’, one could understandably begin by
thinking that the Pikean emic/etic distinction will be useful for framing the
work ahead. The project will be to look at the Pikean etic variation, and find
out what tokens within that variation natives will call ‘the same’, in order to
determine what the Pikean emic units are. However, if one frames the problem
merely by using the words ‘emic’ and ‘etic’, while foisting a
different distinction on them, the problem will not be framed in the same way
at all. If the distinction used is Harris’s, one will say that etic is
the scientist’s perspectivesystem of categories,
having no necessary relation to the native’s system of categories. ,
while emic is the worldview of
the natives. This distinction may be handy for some purposes, but
it does not usefully frame the methodological task ahead for someone who is
about to engage in the decoding the boundaries of
a native system of categories (which was, in fact, Harris’s project). On
the contrary, if ‘emic’ is to ‘etic’ as the native’s claims are to the worldview
is to the scientist’s measurements, one may conclude that, since
we are after the emic (to the exclusion of the etic!), the proper
methodological route is to privilege the first answer the native gives
without challenge or further investigation. Who is the scientist, after all, to
impose his own ‘etic’ understanding on the native, if the native does not share
it? This stance seems to be directly responsible for Harris’s methodological
pitfalls.
Here are Harris et al (1993:453) explaining the application of his emic/etic distinction to the study of race:
In conducting a racial census, researchers can follow an etic or emic strategy. The most rigorous etic approach would employ biologically valid categories based on clusters of alleles as revealed through genetic testing (e.g. blood types, mitochondrial DNA). (...) In a more practical if less rigorous etic approach, racial categories, can be defined by clusters of phenotypical traits (e.g. skin color, hair form, nose width) by trained observers.
In an emic approach, on the other hand, racial identity
is established by eliciting respondents' categorizations of themselves or of
others using terms that respondents regard as appropriate, whether or not
observers find them to be ambiguous or contradictory.—Harris
For Harris et al an ‘etic’ approach to ‘race’ entails a form of analysis where the natives are not consulted, whereas ‘emic’ is a different form of analysis that requires the researcher to ask the natives about ‘race’, and privilege their answers.[3] The difference between Pike and Harris becomes starkly clear in this passage. Harris calls the studies I have criticized above ‘emic’ because he is recording peoples’ answers, rather than taking some other kind of measurement. However, Pike would say that what Harris obtained is a sample of ‘etic’ variation and, since Harris has not tried to establish where the conceptual cleavages and boundaries are in that variation, he has not yet found a single ‘emic’ unit. That is, in Pike’s terms, Harris has elicited many (etic) behavioral responses, but has not found—yet—any (emic) categories. To see this, it suffices to consider merely that Harris has not sought to establish whether any of the myriad terms his respondents gave are synonyms, or whether any exist in taxonomic inclusion relationships to each other, only one rank-level of which people consider to be ‘race’. Pike’s project, remember, is to find out what people will call ‘the same’, and thus leaving the possibility of synonyms and inclusion relationships unexplored reveals that Harris’s methods do not even attempt Pike’s project, even though, ostensibly, Harris claims to be after the native system of categories.
The above interpretation is bolstered by the fact
that Now, it is interesting that on at
least two recent occasions (Harris et al , 1993;
Byrne & Harris et al,
1995) Harris defines emic/etic for his readers with the historical link to phonetics. He does not cite
Pike despite the fact that it is Pike who made the derivation from phonetics,
and despite having had a public debate with him about the use of the terms
(Headland et al, 1990). Moreover, Harris uses the
definition he has used over the years rather than keep it in line with the
linguistic analogy. If Harris is talking about something else, why keep the
same terms? And why keep an alleged link to phonetics? Probably because the
historical link to phonetics creates the impression that the emic/etic
distinction is useful for doing work that explicates alien category systems (since
that was Pike’s explicit program), and Harris is ostensibly trying to
understand one such system.
Perhaps this is why we fight over words in anthropology:
we hope that words alone will do theory for us. But they can’t and don’t. One
cannot base a study of an alien categorical system on Pike’s terms (much less
only one of them). One has to base it on his linked definitions.
The lexemes ‘etic’ and ‘emic’ are themselves immaterial and have no special
power. It is what they stand for, in Pike’s exposition, that is useful.
Harris’s methodology elicited labels on the basis of appearance, but this is
not enough for establishing the nature or the boundaries of the categories,
nor is it sufficient to determine whether the responses really are all in the
same domain, or at the same taxonomic level. The reason is that it fails to
investigate whether and unless it seeks to establish whether and which which
of those elicited terms count as ‘the same’ in people’s minds.
Anthropologists should be wary of applying their own (Pikean)
emic system—that is, their own culturally-based native system of
categories—to an alien culture, but they should be equally wary of making the
facile assumption that the native system must be different than from their own.
Harris may have been too ready to believe that Brazilian racial categories are
radically different from American ones, and his studies y make s little
effort to prove him wrong.
Cognitive anthropologists need methods that go
beyond mere elicitation. Especially in domains such as natural categories where
appearances underlie first-guess identification rather than conceptual parsing,
elicitations of labels based on appearances (no matter how rigorous and careful
the preparation of elicitation materials) will fall short in our attempts to
map native categorical systems.
I found in Bulgan CumBulgan Sum
that people believe ethnic membership to be, first, strongly a question of
biological origin; second, predictive of behavior despite the fact that a
member may have no exposure to his ethnic group and may not even know himself
to be a member. This suggests that ethnic categories may be essentialized in
the manner of species categories and, like them, not defined by appearances.
These latter serve merely to assist first-guess identification, and therefore ethnic
categories they cannot be studied with methods
that rely solely
or even primarily on identification procedures without delving deeper
into the cognitive models of respondents. Due to the close empirical and theoretical overlap
between racial and ethnic categories, and to the plausibility that from the
cognitive point of view they are indistinguishable, the same admonition applies
to racial categories.
I
have critiqued here what I see as problems with previous research that
attempted to investigate the cognitive
models of race
in Brazil ethnicity by eliciting behavioral
responses to ‘appearance’ stimuli, and have suggested alternative methodology
that is more likely to succeed in the effort to map the conceptual boundaries
in naturalized or essentialized domains. I have also sought to trace the
historical source of the methodological road taken by Harris and colleagues,
which consists in its essence springs from of a theoretical misconception
resulting from using the terms emic/etic with different meanings than those
originally proposed by Kenneth Pike, while at the same time expecting them to
do the same work that, with the original meanings, they did for Pike in terms
of addressing the work that needs to be done to understand an alien category
system.
The most important lesson here is that the cognitive anthropologist needs methods that go beyond mere elicitation. Especially in domains such as natural categories where appearances underlie first-guess identification rather than conceptual parsing, elicitations of labels based on appearances (no matter how rigorous and careful the preparation of elicitation materials) will fall short in our attempts to map native categorical systems.
Armstrong, S., Gleitman, L., & Gleitman, H. (1983). What some concepts might not be. Cognition, 13, 263-308.
Atran, S. (1990). Cognitive foundations of natural history. New York: Cambridge University Press.
Atran, S. (1998). Folk-biology and the anthropology of science: Cognitive universals and cultural particulars. Behavioral and brain sciences, 21, 547-609.
Boyer, P. (1994). The naturalness of religious ideas. Berkeley: University of California Press.
Brown, D. E. (1991). Human
Universals.: McGraw Hill.
Brown, R. A., and G. J. Armelagos. (2001). Apportionment of racial diversity: A review. Evolutionary anthropology 10, 34–40.
Byrne, B., Harris, M., Consorte, J. G., & Lang, J. (1995). What's in a name? The consequences of violating Brazilian emic color-race categories in estimates of social well-being. Journal of anthropological research, 51, 389-397.
Cosmides, L., & Tooby, J. (1994). Origins of domain-specificity: The evolution of functional organization. In L. A. Hirschfeld & S. A. Gelman (Eds.), Mapping the mind: Domain-specificity in cognition and culture (pp. 85-116). New York: Cambridge University Press.
Gelman, S. A. (1988). The development of induction within natural kin and artifact categories. Cognitive psychology,, 20, 65-95.
Gelman, S. A. (1996). Concepts and theories. In R. Gelman & T. Kit-Fong (Eds.), Perceptual and cognitive development (2 ed., pp. 117-150). San Diego: Academic Press, Inc.
Gelman, S. A., & Hirschfeld, L. A. (1999). How biological is essentialism? In D. Medin & S. Atran (Eds.), Folkbiology.: MIT Press.
Gelman, S. A., & Wellman, H. (1991). Insides and essences: Early understandings of the non-obvious. Cognition(38), 213-244.
Gil-White, F. J. (1999). How thick is blood? The plot thickens...: If ethnic actors are primordialists, what remains of the circumstantialist/primordialist controversy? Ethnic and Racial Studies, 22(5), 789-820.
Gil-White, F. J. (2001a). Are ethnic groups 'species' to the human brain?: Essentialism in our cognition of some social categories. In press, Current Anthropology.
Gil-White, F. J. (2001b). The cognition of ethnicity: Native category systems under the field-experimental microscope. In press, Field Methods.
Gil-White, F. J. (2001c). L'evolution culturelle a-t-elle des règles? La Rechérche, Hors-Série No. 5(Avril).
Harris, M. (1970). Referential ambiguity in the calculus of Brazilian racial terms. In N. E. Whitten (Ed.), Afro-American anthropology (pp. 75-85). New York: Free Press.
Harris, M. (1990). Emics and etics revisited. In T. N. Headland & T. N. Pike & M. Harris (Eds.), Emics and etics: The insider/outsider debate (Vol. 7, pp. 48-61). Newbury Park and London: Sage.
Harris, M., Gomes Consorte, J., Lang, J., & Byrne, B. (1993). Who are the whites? Imposed census categories and the racial demography of Brazil. Social forces, 72(2), 451-462.
Harris, M., Gomes Consorte, J., Lang, J., & Byrne, B. (1995). A reply to Telles. Social Forces, 73(4), 1609-1611.
Harris, M., & Kottack, C. (1963). The structural significance of Brazilian racial categories. Sociologia(25).
Headland, T. N. (1990). A dialogue between Kenneth Pike and Marvin Harris on emics and etics. In T. N. Headland & K. L. Pike & M. Harris (Eds.), Emics and etics: The insider/outsider debate (Vol. 7).
Headland, T. N., Pike, K. L., & Harris, M. (Eds.). (1990). Emics and etics: The insider/outsider debate ( Vol. 7).
Hirschfeld, L. (1996). Race in the making: Cognition, culture, and the child's construction of human kinds. Cambridge MA: The MIT Press.
Hogg, M. A., & Abrams, D. (1988). Social motivation, self-esteem and social identity. In D. Abrams & M. A. Hogg (Eds.), Social identity theory: Constructive and critical advances (pp. 28-47). New York: Harvester Wheatsheaf.
Johnson, A. (1974). Ethnoecology and planting practices in a swidden agricultural system. American ethnologist, 1, 87-101.
Kelly, R. (1985). The Nuer conquest: Structure and development of an expansionist system. Ann Arbor: University of Michigan Press.
Lakoff, G. (1987). Women, fire, and dangerous things : what categories reveal about the mind. Chicago: University of Chicago Press.
Markman, E. M. (1989). Categorization and naming in children: Problems of induction. Cambridge, MA: MIT Press, Bradford Books.
Markman, E. M. (1990). Constraints children place on word meanings. Cognitive science, 14(1), 57-77.
Medin, D., & Atran, S. (1999). Folkbiology.: MIT Press.
Medin, D. L., & Ortony, A. (1989). Psychological essentialism. In S. Vosniadou & A. Ortony (Eds.), Similarity and analogical reasoning. Cambridge: Cambridge University Press.
Nisbett, R. E., & Cohen, D. (1996). Culture of honor: The psychology of violence in the South. Boulder, CO: Westview Press.
Pike, T. N. (1990). On the emics and etics of Pike and Harris. In T. N. Headland & T. N. Pike & M. Harris (Eds.), Emics and etics: The insider/outsider debate (Vol. 7, pp. 28-47). Newbury Park and London: Sage.
Richerson, P. J., & Boyd, R. (2000). Climate, culture, and the evolution of cognition. In C. Heyes & L. Huber (Eds.), The evolution of cognition. Cambridge, MA: MIT Press.
Rosch, E. (1978). Principles of categorization. In E. Rosch & B. B. Lloyd (Eds.), Cognition and categorization. Hillsdale, NJ: Lawrence Erlbaum Associates.
Sanjek, R. (1971). Brazilian racial terms: Some aspects of meaning and learning. American anthropologist, 73, 1126-1143.
Silva, N. d. V. (1988). Cor e processo de relizacão socio-economica. In C. Hassenbalg & N. d. V. Silva (Eds.), Estrutura social, mobilidade, e raça.: Editora Revista Dos Tribunais.
Smith, E. E., & Medin, D. L. (1981). Categories and concepts. Cambridge, MA
London, England: Harvard University Press.
Sperber, D. (1996). Explaining culture: A naturalistic approach. Oxford: Blackwell.
Telles, E. E. (1995). Who are the morenas? Social Forces, 73(4), 1609-1611.
Tooby, J., & Cosmides, L. (1992). The psychological foundations of culture. In J. H. Barkow & L. Cosmides & J. Tooby (Eds.), The adapted mind: Evolutionary psychology and the generation of culture. (pp. 627-637). New York and Oxford: Oxford University Press.
Figure 1. The
numbers on the X-axis represent the descen-ding rank-order of the drawings in
terms of number of ‘branco’ responses (Harris’s numbers are different because
he kept his original numbered identifying labels; the order of the drawings is
the same as Harris’s, however). For each drawing, the number of ‘branco’
responses can be found on the Y-axis. The branco curve is a close approximation
of Harris’s smoothed line. For the other curves, Harris provides the actual
data points and they are reproduced here.
[1] Whitlam, J., V. Davies, & M. Harland (1991). Portuguese dictionary: English-Portuguese, Portuguese-English. New York: Harper Collins.
[2] Nevertheless, Johnson’s last quoted question is important to Pike’s distinction. Suppose that the same labeled rule X, encompasses, for individual A, a different universe of actual performances than for individual B. This will sometimes be true. After all, we know that in the domain of speech no two people speak the same idiolect. A speech community is just a group of speakers where the overlap in idiolects is great, and where the overlap between their idiolects and those of people in other communities is small by comparison. In other words, a speech community is a cluster of idiolectic variation. The same is true for the grammar of any other cultural domain. What the ethnographer has to do is, for the domain in question, ascertain the emic system or grammar of each individual in the sample, then measure the degree of overlap between individuals to see if there is reasonable intersubjective agreement. When there is, we may claim to have found the emics of a domain for a particular culture, just as we claim to explain the grammar of a particular dialect. The degree to which a rule is a local ‘cultural’ rule, in this sense, is the quantitative measure of intersubjective overlap in individual emic systems. All of this, of course, using emic/etic according to Pike.
[3] Harris et al seem to suggest that etic and emic
approaches to ‘race’ are two different ways of studying the same thing.
However, when you do what Harris advocates as etic, you are doing one of two
different things: either you are studying the clustering of genetic variation,
or the clustering of phenotypic variation. When you carry out what he calls an
emic analysis, you are studying quite another thing: people’s naming of social
categories. That these things all get called ‘race’ is merely a confusion
resulting from the use of a colloquial term for three different things that
science ought to distinguish. One question has to do with the categories of
social relevance and worth that people divide themselves into, and the
assumptions and reasoning processes that underlie them, which may or may not be
based on phenotypic traits. Another question has to do with the distribution of
phenotypic traits, which may or may not objectively cluster. And finally, we
have the distribution of genetic variation, which may or may not result in
sufficiently distinct and intercorrelated clusters that we would like to call
them ‘races’ the way this word is used for other species in the biological
literature (biologists think this use is not justified in humans; see Boyd
& Silk , 1999; Brown &
Armelagos 2001). Simply using the word ‘race’ for the three questions
does not mean they are the same thing. What Harris calls ‘emic’ and ‘etic’
approaches to ‘race’ are not, in fact, two ways of studying the same thing, but
in fact different questions entirely that are not even being asked of the same
domain.