Behavioral Ecology Advance Access originally published online on November 29, 2006
Behavioral Ecology 2007 18(1):271-275; doi:10.1093/beheco/arl076
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Forum |
Testes size in birds: quality versus quantityassumptions, errors, and estimates
Department of Animal and Plant Sciences, University of Sheffield, Sheffield S10 2TN, UK
Address correspondence to S. Calhim. E-mail: s.calhim{at}sheffield.ac.uk.
Received 31 January 2006; revised 23 June 2006; accepted 25 September 2006.
| INTRODUCTION |
|---|
|
|
|---|
Across a wide range of both invertebrate and vertebrate taxa, relative larger testes are associated with female and/or male promiscuity (e.g., mammals: Short 1979
Estimating testes size is particularly difficult in taxa where the gonads are located internally and show enormous seasonal changes in size as in birds (Lofts and Murton 1973
) and reptiles (James and Shine 1985
). One of the main difficulties is that to gain access to the gonads, one must use invasive approaches, such as dissection or laparotomy (e.g., Wingfield and Farner 1976
), although as technology improves it may be possible to use noninvasive magnetic resonance imaging techniques (Czisch et al. 2001
).
Data on testes size are available from a variety of sources, from detailed field studies of particular species to museum skins. Comparative studies of avian testes size have used different estimation methods, several sources of data, and hitherto unchecked methodological assumptions to compile species-specific testes size estimates. Our aim is not to criticize previous work, which has been pioneering in its attempts to obtain these estimates for a wide range of avian taxa, but rather to 1) demonstrate how and why the reliability and accuracy of those estimates differ both across and within studies and 2) point out how the quality of avian testes size data might affect the results obtained when larger, low-quality data sets are used in comparative analyses.
All the statistical and simulation analyses were conducted using R v.2.0.1 (R Development Core Team 2004) and (S)MATR (Falster et al. 2003
) computer software packages. Results are presented as means with standard errors, unless specified otherwise.
| VARIATION IN ESTIMATES |
|---|
|
|
|---|
In a careful revision of the quality of the available estimates of mass of both testes (combined testes mass, hereafter referred to as CTM) from all currently available data sets (Cartar 1985
Intrinsic assumptions
Because detailed studies of natural testes size cycles in wild birds are relatively rare, most currently available data have been compiled from museum collections (e.g., 75% of the Dunn et al. [2001]
data set). Museum data generally consist of linear dimensions (length and/or width) of at least one testis (often only the larger of the pair) of males caught from diverse geographic locations and throughout the year. The use of these sources must therefore be made with caution because CTM estimates are obtained through indirect calculation from linear dimensions and assuming the use of a representative sample of males. Previous authors assumed such data to be reliable on the basis of several untested assumptions (Table 1). We checked these assumptions using a data set of 45 species for which we had detailed testicular dimensions (linear and wet mass) and examples from the literature (see supplementary material). It is difficult to assess quantitatively the magnitude of the errors associated with the failure to meet each assumption (see Table 1), but it is clear that ignoring them will clearly lead to erroneous CTM estimates.
|
Estimating "peak" CTM and sources of data
The seasonal change in CTM generally takes the form of either an inverted U-shaped curve in seasonal breeders or a series of discrete multiple peaks in nonseasonal breeders (Murton and Westwood 1977
The Means method, used by Møller (1991)
, Stutchbury and Morton (1995)
, and Dunn et al. (2001)
, estimates peak CTM by averaging CTM across individuals sampled within a predefined "breeding season." Intermale differences in breeding cycle will result in an underestimation of that species' CTM, through the inclusion of individuals sampled during the growth and regression phases of their testicular cycle, despite all being within the predefined calendar date limits of the species' breeding season. Møller (1991)
defined breeding season as the period from 2 weeks preceding the start of egg laying until the last egg of the last clutch is laid. This definition is probably reasonable for taxa breeding in temperate regions, which tend to be more synchronous than those breeding at lower latitudes (e.g., Stutchbury and Morton 1995
). However, tropical species may lack a clear breeding season (e.g., Ward 1969) or may show marked geographic asynchrony in breeding (Stutchbury and Morton 2001; Moore et al. 2005). As a result, males of the same species are likely to be at different stages of their seasonal testis cycle at any time.
The Polynomial method, first used by Møller and Briskie (1995)
, involves fitting second-order polynomial curves to plots of testis size against calendar date within the breeding season (sensu Møller 1991
). The maxima of the fitted curve are then considered that species' peak testis size. Møller and Briskie (1995)
used this approach to determine testis peak length and peak width separately, which were then converted into CTM using Møller's (1991)
formula. However, it is more logical that the Polynomial method be used on the data for wet (or indirectly calculated) mass rather than on the 2 linear measures independently. The Polynomial method better allows for exclusion of males sampled outside their peak in testicular cycle than the Means method.
We compared the 2 CTM estimation approaches in terms of their accuracy and reliability and also investigated the effect of using these 2 methods on 2 types of data sources: literature on natural testis cycles and museum collections, referred to here as "study" and "museum" sources, respectively. Two different data sets for dark-eyed juncos (Junco hyemalis) were used: the study raw data were kindly provided by Deviche et al. (2000)
and the museum data set for the same species was compiled from museum Web pages (see supplementary material for more details).
To investigate the reliability of the 2 CTM estimation techniques with different sample sizes, we conducted resampling simulations where either an average value (Means method) or the maxima of a polynomial curve fit (Polynomial method) were obtained at increasing sample sizes (minimum n = 5 and n = 10 for each method, respectively; 1000 repeats; nonreplacement method). The resulting bootstrapped estimate and the variance between estimates were computed at every n. The variance in CTM across repeated estimates decreases exponentially with increasing sample size, for both methods and data sources, but these simulations also revealed that sample sizes below 20 or 30 individuals clearly provide inconsistent (i.e., more variable) CTM estimates for both methods. Also, for both Means and Polynomial, the museum data set always provided more variable CTM estimates than the study data set at every n.
For both study and museum data sets, we also tested if restricting the data set in the direction of a presumed peak window affects the accuracy of the Means approach, by sequentially trimming the values on either side of the plot of testes mass against date until a sample of about 20 individuals remained. We averaged CTM across these increasingly small subsets and compared it with a benchmark estimate for the dark-eyed junco (CTM = 0.386 g; this estimate was obtained by applying the Polynomial method to the study data set). The results suggest that the accuracy of the Means method is very dependent on the correct identification of the peak window in testes size: the more restricted the sampling period used is toward this peak, the more similar the estimate obtained from the Means method is to the benchmark CTM. Interestingly, CTM values obtained from the museum data set show the same rate of increasing accuracy with restriction of data set as estimates obtained from the study data set (the slopes of CTM against sample size do not differ significantly: t = 0.02, P = 0.987); however, museum estimates consistently show larger deviances from the benchmark estimate than the study data (different intercepts: t = 20.47, P < 0.001). Using data for dark-eyed juncos and 7 other passerine species for which both a study and a museum CTM data were available (see supplementary material), we found that the Polynomial approach when applied to museum data sets did not result in significantly different CTM estimates from those obtained from study data sets (Wilcoxon signed-rank test: Z = 0.14, P = 0.889, n = 8). In contrast, the Means method applied to museum data significantly underestimated CTM (Z = 2.52, P = 0.012, n = 8) and unpredictably so (range = 5% to 52% of the correct estimate).
In summary, the Means approach is accurate only if one can be sure the males sampled are at their peak: for most museum sources, this is unfeasible. In contrast, the Polynomial method provides more accurate estimates, without the need for such detailed information on the stage of the reproductive cycle of the males used in the sample, although sampling throughout the breeding season is necessary. In short, the Polynomial method is the most reliable approach, particularly when estimating CTM from museum sources.
| DATA SET SIZE VERSUS DATA SET QUALITY |
|---|
|
|
|---|
If we are to robustly test hypotheses relating to relative testes size in birds, it is essential that we obtain reliable and sufficiently large data sets of CTM for use in comparative studies. However, the large data sets available (e.g., n > 1000 in Dunn et al. 2001
Our results show that in 6 out of 10 analyses, the smaller data set yield results that differ significantly from those previously found using a larger, potentially biased data set (Table 2). In one-third of these (2 out of 6), the sample size of the larger data set was sufficient to significantly detect the corresponding EF. We can therefore be confident that these previously reported patterns are not an artifact of errors in testes size estimates. We cannot, however, be equally confident of the validity of the reported associations between relative testes size and the remaining 4 traits: breeding density, clutch size, extrapair paternity, and migratory behavior.
|
| ALLOMETRIC RELATIONSHIPS |
|---|
|
|
|---|
The allometric relationship between testes mass and body mass, obtained from the data sets used in comparative studies, provides the necessary value of residual testes mass in order to decide whether a particular (novel) species has relatively large or relatively small testes (e.g., Sheldon and Birkhead 1994
There were also no significant differences in the allometric relationship between data sets differing in size and quality (see supplementary material). However, the smaller data set provides the most conservative avian testes size allometric relationship, because these estimates are more accurate, and the body size range is virtually the same as in the larger data set. Moreover, we suggest that confidence intervals (CIs) of the slope should also be considered. For instance, a conservative approach for deciding whether a particular species has a relatively large or small testes size would be to consider those species within the area within CIs to have residual testes size of zero; above this area, positive residuals; and below this area, negative residuals. Supplementary material lists the allometric relationships (regression models I and II, with 90%, 95%, and 99% CI) obtained from different data sets of varying quality. We pragmatically suggest that in order to qualitatively assess a particular (novel) species' relative testes size, the model II common allometric slope and its 95% CI obtained from the good-quality data set be used: log CTM (g) = 1.81 + 0.85 (0.780.93) x log body mass (g) (R2 = 0.76, P < 0.0001).
| CONCLUSION |
|---|
|
|
|---|
Researchers often face trade-offs between the quality and quantity of data they usea problem exemplified in the study of the evolution of testes size in birds. Avian testes size is difficult to measure, and previous studies have relied on several assumptions and applied different techniques to obtain estimates of avian testes size. We demonstrate how accuracy and reliability of testes mass estimates are influenced by 1) previously unchecked methodological assumptions, 2) the estimation approach used, and 3) the quality of the source of testes size data.
A review of the currently available avian testes mass estimates showed that reliable values are scarce (11% of the 1044 species) because the majority of estimates have been obtained indirectly (from linear dimensions) from the more convenient and widely available museum skin tags, using a method that averages across individuals (Means). Museum collections have allowed researchers to obtain testes size data for about 10% of all avian taxa. However, these large data sets do not compensate for the poor quality of data that museum specimens generate if used uncritically. We recommend the use of more stringent methodological criteria for estimating meaningful species-specific testes sizes. Estimates should be obtained directly as wet mass of both testes, from a sample of at least 30 adult males from a breeding population from a specific geographic location. This sample must exclude juveniles, nonbreeders, and first-year breeding males, and body mass data should originate from the same source. In order to obtain reliable estimates of a species' peak testes size, the Polynomial approach should always be used whenever 1) sampling spans the breeding season, 2) sample sizes are sufficient (n
30), but 3) details of the birds' breeding stage is unknown. We recommend that the Means method not be used on museum sources because it is particularly prone to errors.
Testing evolutionary hypotheses relating to sexual selection relies heavily on accurate measures of reproductive investment and/or levels of sperm competition, such as relative testes size. Until now, our understanding of these patterns across avian taxa has been based on the results of comparative studies that have employed large data sets. The fact that we have identified serious errors and biases in these data sets means, first, that the results from these studies should be treated with caution and, second, there is no longer any excuse for their uncritical use. Large samples have greater statistical power, but unless the data are of high quality, they merely create an illusion of greater precision. For this particular trade-off, more is certainly not better.
| SUPPLEMENTARY MATERIAL |
|---|
|
|
|---|
Supplementary material can be found at http://www.beheco.oxfordjournals.org/.
| ACKNOWLEDGEMENTS |
|---|
We thank S. Nakagawa and J. Hadfield for statistical advice; I.P.F. Owens, T.E. Pitcher, P.O. Dunn, B.J.M. Stutchbury, A.P. Møller, J.M. Briskie, R. Montgomerie, and S. Nakagawa for valuable comments on the manuscript; and P. Deviche, T.E. Pitcher, and P.O. Dunn for the use of their unpublished data. S.C. was supported by a doctorate research grant from the Portuguese Science and Technology Foundation (FCT, Portugal, SFRH/BD/10040/2003).
| REFERENCES |
|---|
|
|
|---|
Balshine S, Leach BJ, Neat F, Werner NY, Montgomerie R. (2001) Sperm size of African cichlids in relation to sperm competition. Behav Ecol 12:726731.
Birkhead TR and Møller AP. (1992) Sperm competition in birds(Academic Press, London).
Briskie JV and Montgomerie R. (1992) Sperm size and sperm competition in birds. Proc R Soc Lond B Biol Sci 247:8995.[Medline]
Byrne PG, Roberts JD, Simmons LW. (2002) Sperm competition selects for increased testes mass in Australian frogs. J Evol Biol 15:347355.[CrossRef][Web of Science]
Cartar RV. (1985) Testis size in sandpipers. Naturwissenschaften 72:157158.[CrossRef][Web of Science]
Cohen J. (1988) Statistical power analysis for the behavioral sciences(Erlbaum, Hillsdale (NJ)).
Czisch M, Coppack T, Berthold P, Auer DP. (2001) In vivo magnetic resonance imaging of the reproductive organs in a passerine bird species. J Avian Biol 32:278281.[CrossRef]
Deviche P, Wingfield JC, Sharp PJ. (2000) Year-class differences in the reproductive system, plasma prolactin and corticosterone concentrations and onset of prebasic molt in male dark-eyed juncos (Junco hyemalis) during the breeding period. Gen Comp Endocrinol 118:425435.[CrossRef][Web of Science][Medline]
Dunn PO, Whittingham LA, Pitcher TE. (2001) Mating systems, sperm competition, and the evolution of sexual dimorphism in birds. Evolution 55:161175.[CrossRef][Web of Science][Medline]
Falster DS, Warton DI, Wright IJ. 2003. (S)MATR: standardised major axis tests and routines. Version 1.0. Available from: http://www.bio.mq.edu.au/ecology/SMATR. Accessed 2005.
Faul F and Erdfelder E. (1992) GPOWER: a priori, post-hoc, and compromise power analyses for MS-DOS(Bonn University, Department of Psychology, Bonn (FRG)).
Felsenstein J. (1985) Phylogenies and the comparative method. Am Nat 125:115.
Frith AJ, Carpenter SM, Braithwaite LW. (1976) Sexual cycles of pigeons in arid and semiarid Australia. Aust J Zool 24:331351.[CrossRef]
Gage MJG. (1994) Association between body size, mating pattern, testis size and sperm lengths across butterflies. Proc R Soc Lond B Biol Sci 258:247254.
Garamszegi LZ, Eens M, Hurtrez-Boussès S, Møller AP. (2005) Testosterone, testes size, and mating success in birds: a comparative study. Horm Behav 47:389409.[CrossRef][Medline]
Harcourt AH, Harvey PH, Larson SG, Short RV. (1981) Testis weight, body weight and breeding system in primates. Nature 293:5557.[CrossRef][Medline]
Harvey PH and Harcourt AH. (1984) Sperm competition, testes size, and breeding system in primates. In Smith RL (Ed.). Sperm competition and the evolution of animal mating systems(Academic Press, Orlando (FL)) pp. 589600.
Heske EJ and Ostfield RS. (1990) Sexual dimorphism in size, relative size of testes, and mating systems in North American voles. J Mammal 71:510519.[CrossRef]
Hosken DJ. (1997) Sperm competition in bats. Proc R Soc Lond B Biol Sci 264:385392.[Medline]
Hosken DJ. (1998) Testes mass in megachiropteran bats varies in accordance with sperm competition theory. Behav Ecol Sociobiol 44:169177.[CrossRef][Web of Science]
James C and Shine R. (1985) The seasonal timing of reproduction: a tropical comparison in Australian lizards. Oecologia 67:464474.[CrossRef][Web of Science]
Jennions MD and Passmore NI. (1993) Sperm competition in frogs: testis size and a sterile male experiment on Chiromantis xerampelina (Rhacophoridae). Biol J Linn Soc 50:211220.[CrossRef]
Kenagy GJ and Trombulak SC. (1986) Size and function of mammalian testes in relation to body size. J Mammal 67:122.[CrossRef][Web of Science]
Lofts B and Murton RK. (1973) Reproduction in birds. In Farner DS, King JR, Parkes KC (Eds.). Avian biology(Academic Press, London) pp. 1107.
McFarlane RW. (1963) The taxonomic significance of avian sperm. Proc XIII Int Ornithol Congr 11:91102.
Møller AP. (1991) Sperm competition, sperm depletion, paternal care, and relative testis size in birds. Am Nat 137:882906.[CrossRef][Web of Science]
Møller AP and Briskie JV. (1995) Extra-pair paternity, sperm competition and the evolution of testis size in birds. Behav Ecol Sociobiol 36:357365.[CrossRef][Web of Science]
Moore IT, Bonier F, Wingfield JC. (2005) Reproductive asynchrony and population divergence between two tropical bird populations. Behav Ecol 16:755762.
Morrow EH, Pitcher TE, Arnqvist G. (2003) No evidence that sexual selection is an engine of speciation in birds. Ecol Lett 6:228234.[CrossRef][Web of Science]
Murton RK and Westwood NJ. (1977) Avian breeding cycles(Clarendon Press, Oxford).
Olsson M and Madsen T. (1998) Sexual selection and sperm competition in reptiles. In Birkhead TR and Møller AP (Eds.). Sperm competition and sexual selection(Academic Press, London) pp. 503577.
Parker GA. (1970) Sperm competition and its evolutionary consequences in the insects. Biol Rev 45:525567.[CrossRef]
Pitcher TE, Dunn PO, Whittingham LA. (2005) Sperm competition and the evolution of testes size in birds. J Evol Biol 18:557567.[CrossRef][Web of Science][Medline]
Pitnick S. (1996) Investment in testes and the cost of making long sperm in Drosophila. Am Nat 148:5780.
Poulin R and Morand S. (2000) Testes size, body size and malemale competition in acanthocephalan parasites. J Zool 250:551558.
R Development Core Team. (2004) R: a language and environment for statistical computing(R Foundation for Statistical Computing, Vienna, Austria) (www.R-project.org.).
Sheldon BC and Birkhead TR. (1994) Reproductive anatomy of the chaffinch in relation to sperm competition. Condor 96:10991103.[CrossRef][Web of Science]
Short RV. (1979) Sexual selection and its component parts, somatic and genital selection, as illustrated by man and the great apes. Adv Study Behav 9:131158.[CrossRef]
Short RV. (1981) Sexual selection in man and the great apes. In Graham CE (Ed.). Reproductive biology of the great apes(Academic Press, New York) pp. 319341.
Stockley P, Gage MJG, Parker GA, Moller AP. (1997) Sperm competition in fishes: the evolution of testis size and ejaculate characteristics. Am Nat 149:933954.[CrossRef][Web of Science][Medline]
Stutchbury BJ and Morton ES. (1995) The effect of breeding synchrony on extra-pair mating systems in songbirds. Behaviour 132:675690.[CrossRef][Web of Science]
Stutchbury BJ and Morton ES. (2001) Behavioral ecology of tropical birds(Academic Press, New York).
Ward P. (1969) The annual cycle of the yellow-vented bulbul Pycnonotus goiavier in a humid equatorial environment. J Zool 159:2545.
Wingfield JC and Farner DS. (1976) Avian endocrinologyfield studies and methods. Condor 78:570573.[CrossRef][Web of Science]
Witschi E. (1935) The origin of asymmetry in the reproductive system of birds. Am J Anat 56:114119.
Wolfson A. (1954) Sperm storage at lower-than-body temperature outside the body cavity in some passerine birds. Science 120:6871.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
L. Z. Garamszegi, S. Calhim, N. Dochtermann, G. Hegyi, P. L. Hurd, C. Jorgensen, N. Kutsukake, M. J. Lajeunesse, K. A. Pollard, H. Schielzeth, et al. Changing philosophies and tools for statistical inferences in behavioral ecology Behav. Ecol., November 2, 2009; (2009) arp137v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Schradin, A.A. Kinahan, and N. Pillay Cooperative Breeding in Groups of Synchronously Mating Females and Evolution of Large Testes to Avoid Sperm Depletion in African Striped Mice Biol Reprod, July 1, 2009; 81(1): 111 - 117. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Calhim and T. R. Birkhead Intraspecific variation in testis asymmetry in birds: evidence for naturally occurring compensation Proc R Soc B, June 22, 2009; 276(1665): 2279 - 2284. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Fitzpatrick, R. Montgomerie, J. K. Desjardins, K. A. Stiver, N. Kolm, and S. Balshine From the Cover: Female promiscuity promotes the evolution of faster sperm in cichlid fishes PNAS, January 27, 2009; 106(4): 1128 - 1132. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Hegyi, L. Z. Garamszegi, and M. Eens The roles of ecological factors and sexual selection in the evolution of white wing patches in ducks Behav. Ecol., November 1, 2008; 19(6): 1208 - 1216. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



