>
View Item |
Downloads per month over past year
View more statistics
The University of Glasgow is a registered Scottish charity: Registration Number SC004401
Discover the world's research
Study motivation, study design, data collection, analysis, reporting, and interpretation of sex differences, conclusions, acknowledgments.
Janet W Rich-Edwards, Ursula B Kaiser, Grace L Chen, JoAnn E Manson, Jill M Goldstein, Sex and Gender Differences Research Design for Basic, Clinical, and Population Studies: Essentials for Investigators, Endocrine Reviews , Volume 39, Issue 4, August 2018, Pages 424–439, https://doi.org/10.1210/er.2017-00246
A sex- and gender-informed perspective increases rigor, promotes discovery, and expands the relevance of biomedical research. In the current era of accountability to present data for males and females, thoughtful and deliberate methodology can improve study design and inference in sex and gender differences research. We address issues of motivation, subject selection, sample size, data collection, analysis, and interpretation, considering implications for basic, clinical, and population research. In particular, we focus on methods to test sex/gender differences as effect modification or interaction, and discuss why some inferences from sex-stratified data should be viewed with caution. Without careful methodology, the pursuit of sex difference research, despite a mandate from funding agencies, will result in a literature of contradiction. However, given the historic lack of attention to sex differences, the absence of evidence for sex differences is not necessarily evidence of the absence of sex differences. Thoughtfully conceived and conducted sex and gender differences research is needed to drive scientific and therapeutic discovery for all sexes and genders.
A sex- and gender-informed perspective increases rigor, promotes discovery, and expands the relevance of biomedical research
Methods exist to test sex and gender differences as interactions; inference from sex- and gender-stratified data should be viewed with caution
Without careful methodology, the pursuit of sex and gender difference research as a poorly considered mandate will result in a literature of contradiction
However, given the paucity of sex and gender differences research, the absence of evidence for differences is not necessarily evidence of the absence of differences
Many compelling publications have argued why sex and gender should be considered in preclinical, clinical, and population research ( 1–4 ). Both sex (the biological attributes of females and males) and gender (socially constructed roles, behaviors, and identities in a spectrum, including femininity and masculinity) affect molecular and cellular processes, clinical traits, response to treatments, health, and disease ( 1 ). Since 2010, the Canadian Institutes of Health Research has mandated that all grant applicants address whether they had considered sex and/or gender in their applications ( 5 ). In 2014, the European Commission issued the Horizon 2020 guideline, which makes explicit the rules for sex and gender inclusion as elements of European Union grant evaluation and monitoring ( 6 , 7 ). Although the 1993 National Institutes of Health (NIH) Revitalization Act required the inclusion of women in NIH-funded clinical research, it was not until 2015 that the NIH announced policies requiring the consideration of sex as a biological variable in study design, analysis, and reporting ( 1 , 8–10 ). Such mandates to include females are not mere political correctness ( 11 ). A sex-informed and gender-informed perspective is essential to increase rigor, promote discovery, expand the relevance of research, and improve patient care. At the very least, it will allow readers of the scientific literature to critically assess the validity of what they read.
Investigators who wish to—or now find themselves required to—include both sexes in their studies are faced with a number of methodological questions, including issues of motivation, subject selection, sample size, data collection, analysis, and interpretation. We provide an overview of these issues in this review as they pertain to basic, clinical, and population research ( Table 1 ). This review builds on earlier discussions of sex differences research methodology ( 11–18 ) in several ways: we consider gender as well as sex differences; we examine the entire research process, from motivation to analysis and presentation; and we discuss nuances of statistical design and interpretation, particularly how to plan robust tests of sex or gender interactions that can help minimize statistical artifacts. Rather than assume ubiquitous sex and gender differences in biology, health, and disease, we propose methods and interpretation that will increase the likelihood of detecting true differences where they exist.
Methodological Considerations in Investigations of Sex and Gender Differences
Research Step . | Best Practices . |
---|---|
Motivation | Consider known sex differences in disease incidence, prevalence, and survival. |
Review existing literature on sex and gender differences, alert to the fact that many hypotheses have not been well tested. Read carefully to consider likelihood of false-positives (especially in context of multiple testing) and false-negatives (especially where statistical power is low). | |
Apply a life course perspective to consider the timing of exposures that might interact with sex and gender in specific developmental windows. | |
Subject selection | Consider sex-specific age incidence of disease to maximize statistical power. |
Consider reproductive stages and cycles, particularly where they may modify the impact of the main exposure being investigated. | |
Consider the impact of gendered social environment for the distribution of factors that may interact with the main exposure. | |
For basic and preclinical studies, review options for classical gonadectomy, knockouts, or four-core genotype experiments. | |
Consider whether sex of cell lines is known, relevant, and generalizable. | |
Randomization (if applicable) | In smaller studies, stratified randomization by sex or gender will ensure balance, even if different numbers of males and females are included. |
Sample size | True tests of sex differences need to be large enough to test interaction between sex and the main exposure or treatment; such tests typically require several times the sample size to be adequately powered, compared with studies of main effects. |
Studies to small to detect interaction can still report the main effects of the exposure or treatment by sex; however, they cannot claim to have tested a sex difference. Be alert to the risk of false-negatives in underpowered sex strata. | |
Studies too small to detect even the main effects of sex can provide sex-specific data to generate hypotheses or contribute to meta-analyses of sex differences. | |
“Big data” studies, where the variable of sex is often available, need to be conducted thoughtfully to avoid contributing false-positives to the sex difference literature. | |
Data collection | Consider sex and gender differences in disease presentation. |
Consider whether exposures mean the same thing in both sexes and genders. | |
Be aware of sex and gender differences in pharmacokinetics and pharmacodynamics; the same dose may have different impact in males and females or may vary by body size. | |
Collect data on exogenous hormones: contraceptives, menopausal hormone therapy, testosterone, and other steroid use. | |
Consider recording data on reproductive cycle (follicular/luteal), and stage (prepuberty, puberty, pregnancy, lactation, premenopause and postmenopause). | |
Collect data on influential covariates that may vary by sex and gender in the study population. | |
Analysis, reporting, and interpretation | Prespecify tests of sex differences to reduce type I error. |
Account for confounding by factors associated with sex and gender. | |
Investigate intermediate “pathway” variables to understand apparent sex differences. | |
Admit when sex differences were tested as exploratory analyses. | |
Make opportunities to replicate sex difference findings. | |
Interpret apparent sex and gender differences in the light of biological plausibility and social context. |
Research Step . | Best Practices . |
---|---|
Motivation | Consider known sex differences in disease incidence, prevalence, and survival. |
Review existing literature on sex and gender differences, alert to the fact that many hypotheses have not been well tested. Read carefully to consider likelihood of false-positives (especially in context of multiple testing) and false-negatives (especially where statistical power is low). | |
Apply a life course perspective to consider the timing of exposures that might interact with sex and gender in specific developmental windows. | |
Subject selection | Consider sex-specific age incidence of disease to maximize statistical power. |
Consider reproductive stages and cycles, particularly where they may modify the impact of the main exposure being investigated. | |
Consider the impact of gendered social environment for the distribution of factors that may interact with the main exposure. | |
For basic and preclinical studies, review options for classical gonadectomy, knockouts, or four-core genotype experiments. | |
Consider whether sex of cell lines is known, relevant, and generalizable. | |
Randomization (if applicable) | In smaller studies, stratified randomization by sex or gender will ensure balance, even if different numbers of males and females are included. |
Sample size | True tests of sex differences need to be large enough to test interaction between sex and the main exposure or treatment; such tests typically require several times the sample size to be adequately powered, compared with studies of main effects. |
Studies to small to detect interaction can still report the main effects of the exposure or treatment by sex; however, they cannot claim to have tested a sex difference. Be alert to the risk of false-negatives in underpowered sex strata. | |
Studies too small to detect even the main effects of sex can provide sex-specific data to generate hypotheses or contribute to meta-analyses of sex differences. | |
“Big data” studies, where the variable of sex is often available, need to be conducted thoughtfully to avoid contributing false-positives to the sex difference literature. | |
Data collection | Consider sex and gender differences in disease presentation. |
Consider whether exposures mean the same thing in both sexes and genders. | |
Be aware of sex and gender differences in pharmacokinetics and pharmacodynamics; the same dose may have different impact in males and females or may vary by body size. | |
Collect data on exogenous hormones: contraceptives, menopausal hormone therapy, testosterone, and other steroid use. | |
Consider recording data on reproductive cycle (follicular/luteal), and stage (prepuberty, puberty, pregnancy, lactation, premenopause and postmenopause). | |
Collect data on influential covariates that may vary by sex and gender in the study population. | |
Analysis, reporting, and interpretation | Prespecify tests of sex differences to reduce type I error. |
Account for confounding by factors associated with sex and gender. | |
Investigate intermediate “pathway” variables to understand apparent sex differences. | |
Admit when sex differences were tested as exploratory analyses. | |
Make opportunities to replicate sex difference findings. | |
Interpret apparent sex and gender differences in the light of biological plausibility and social context. |
There is ample evidence of sex differences—at the level of the cell, organism, and population—to motivate sex differences research. Sex chromosomes encode sexual differentiation through three mechanisms: (1) presence of Y genes; (2) increased dose of X genes in XX vs XY cells; and (3) X chromosome inactivation and imprinting ( 12 ). These primary chromosomal differences lead to sexual differentiation and the somatic and gonadal expressions of sex ( 19 ). The resulting “sexome” produces differences in all organ systems and across the lifespan, influencing how our bodies interact with the environment to determine health ( 20 ). The sex-informed framework considers sex differences in anatomy and physiology, understood within a lifespan perspective of sensitive periods of fetal and childhood development, differential pace and timing of puberty, reproductive events, and senescence. This is critical given that timing is everything when it comes to identifying developmental sex effects ( 14 , 21 , 22 ). Furthermore, sex differences in treatment abound: pharmacokinetics and pharmacodynamics of medications often vary by sex, as may effects of other treatment modalities ( 23 ).
Gender, too, is a determinant of health, influencing the physical and social environments to which individuals are exposed, their access to resources that affect health, their agency to seek health care and receive treatment, and the equitability of research that drives medical discovery ( 14 , 17 , 24–26 ).
This sex- and gender-informed perspective is necessitated by widespread differences in disease incidence, prevalence, and survival that have been reviewed elsewhere ( 2 , 26–28 ). There are sex and gender differences in symptoms and clinical presentations of illness, reliability of diagnostic tests, and response to treatment. There is “sex bias’” down to the level of epigenetic marking and gene expression ( 28 ). In short, the rigor of research depends on researchers’ understanding of the ways in which sex and gender influence the biologic systems they study.
Investigators seeking to construct a sex- and gender-informed framework for their research may be disappointed by a lack of systematic evidence regarding sex and gender differences in the literature. There is a particular dearth of true gender-difference studies; in fact, literature searches on “gender differences” largely turn up studies on sex differences that have used the term “gender” to refer to biologic sex. The historic neglect of women in clinical studies and the sex of animals and cells in basic research should be kept in mind when gathering evidence of sex and gender differences. Although data to interrogate sex differences may exist in some studies, they have yet to be examined. In other cases, sex-informed questions have yet to be posed. Furthermore, as argued below, the proliferation of ill-considered and often unplanned sex difference inquiries leads to a literature of contradictions. Thus, the absence of evidence for sex differences is not necessarily evidence of the absence of sex differences.
In most cases, the choice of overarching study design, whether experimental or observational, is little affected by considerations of sex and gender. Exceptions to this are experiments precluded by ethical considerations, such as inclusion of pregnant women for trials of potentially teratogenic drugs. However, nearly every other feature of study design necessitates a sex-informed perspective, including subject selection, randomization, sample size, and data collection.
Inclusion of both sexes is more nuanced than deciding that the sample should be equally divided by sex. Sex-specific age incidence of disease, reproductive stage, reproductive cycle, and environment need to be considered to optimize the validity, generalizability, and efficiency of a study sample. More often than not, investigators have to compromise between competing goals of validity (by narrowing subject selection to increase the likelihood that findings are true for a specific population) and generalizability (by widening subject selection to make broad inference at the risk of overgeneralizing across true differences between groups.) There are also compromises between large scientific goals and restricted available funds. Such trade-offs are best made as choices informed by already known sex and gender differences. The most efficient subject selection will pick the minimum number of each sex or gender necessary to make valid inferences about sex and gender differences; a 50/50 split between males and females may not be the most efficient, as discussed below.
Sex differences in incidence and age-incidence trajectories are important considerations in subject selection. For example, at ages 55 to 64 men have more than double the rate of coronary heart disease (CHD) of women. By ages 85 to 94, male CHD rates are only 10% higher than those of females ( 20 ). Thus, an investigator wishing to enroll a cohort of 50-year-olds to study CHD incidence will need to enroll two to three times as many women as men to ensure equivalent statistical power, or consider selecting older women. For example, the Vitamin D and Omega-3 Trial study of dietary supplements to reduce heart disease, stroke, and cancer includes women aged 55 or older and men aged 50 or older to account for the later onset of disease in women ( 29 ). Sex differences in disease incidence exist in animals as well. For example, in the nonobese diabetic mouse, diabetes is more prevalent in females, so that more male mice must be included to yield the same number of affected animals of each sex ( 30 ).
Females outlive males in most vertebrate species ( 31 ). In mammals, the heterogametic (XY) sex may have a shorter lifespan because of the unguarded expression of harmful recessive alleles on the Y sex chromosome. Similarly, the homogametic (XX) sex may be protected by the stochastic X-inactivation that creates mosaics of females; although female neonates are a 1:1 mosaic of maternal and paternal allele expression, over time that ratio becomes skewed to favor the cellular population whose active X presumably confers a survival advantage ( 32–35 ).
Sex differences in the rate of aging and the incidence of disease onset are reflected at the cellular level. For example, there are sex differences in the length of telomeres, noncoding DNA sequences that cap and protect chromosomes, the length of which are correlated with longevity. Although similar at birth, male telomeres shorten faster during the lifespan than do female telomeres ( 35 ). This difference could be the result of sex or gender; most likely, it is a combination of biological sex differences and gendered experiences (such as smoking) ( 36 ). Similarly, although stem cell populations decline with aging, this loss is earlier and more rapid in male than in female mice ( 37 ). Methylation patterns also differ between the sexes, likely influencing DNA expression over the life course ( 38 , 39 ). As research further clarifies sex-specific or sex-dependent mechanisms of senescence, investigators may want to consider sex differences in the cellular age and methylation patterns of their subjects, be they cells, animals, or people.
All animals, regardless of sex or species, go through a process of reproductive maturation whose timing, duration, and outcome are subject to physical and social cues from the environment. In mammals, puberty involves sex-specific, but variable, changes in central neural systems, gonadal steroid production, and the emergence of secondary sexual characteristics, including behaviors. When a study investigates adolescence or young adulthood, accounting for sex differences in the pace and timing of puberty will be critical for identifying sex effects ( 14 ).
Mature mammals of both sexes have variations in gonadal steroid levels that may affect subject selection. In males, testosterone levels have circadian and perhaps seasonal variations and vary with age, physical activity, and energy homeostasis ( 40 , 41 ). Reproductive age females have menstrual or estrous cycles. On top of natural variability, women may use hormonal contraceptives or menopausal hormone therapy; many men use exogenous androgens and anabolic steroids. These factors are important in subject selection if an investigator wants to understand how the exposure–outcome associations under study are impacted by sex hormones. Researchers may decide to include a representative range of reproductive phases or cycles. For example, cyclical patterns of DNA synthesis and rates of cell division and death would not have been discovered if females in different cycle phases had not been studied ( 42 , 43 ). The knowledge that natural killer cell activity peaks during the luteal phase came from studies of cycling women ( 44 ). Understanding of the roles of neurokinin B and kisspeptin in reproduction has been facilitated by studying male and female animals at varying reproductive stages, with and without gonadectomy ( 45 ).
Sex differences in physiology and behavior have been observed even in the prepubertal and peripubertal periods, before the pubertal activation of the hypothalamic–pituitary–gonadal axis and production of gonadal sex steroid hormones. These prepubertal sex differences have been largely attributed to the effects of prenatal and perinatal activity of the hypothalamic–pituitary–gonadal axis and resultant sex steroid hormone production and actions. Among the best described effects are the so-called activational and organizational effects of gonadal hormones on brain development ( 46 ). The first robust sex difference described in the mammalian brain was the sexually dimorphic nucleus of the preoptic area ( 47 ). More recently, a sexually dimorphic population of kisspeptin neurons was identified that is present in higher numbers in the anteroventral periventricular nucleus in prepubertal females than in males, to which the sexually dimorphic preovulatory luteinizing hormone surge that occurs in adult females but not males is attributed ( 48 ).
Thus, to the extent that hormone levels affect study outcomes, researchers may need to examine subjects who are premenopausal or postmenopausal, in the follicular or luteal phase, and with or without hysterectomy or gonadectomy. Including or excluding participants using hormonal therapies, such as contraceptives and female and male hormone replacement or suppression therapies, is another potentially important design choice. To fully capture between-sex variability, it may be of use to compare men to two or more groups of women. For example, a study of brain activity in the stress response circuitry found few differences between healthy men and women in the early follicular phase, but striking differences between men and the same women at midcycle ( 49 ). Furthermore, sex differences in brain activity in memory circuitry were statistically significant in premenopausal and perimenopausal women, but attenuated in postmenopausal women compared with men ( 50 ). To capture within-sex variability, studies compare the same females at different cyclical stages, perhaps in crossover fashion.
Note that the effects of sex steroid hormones extend beyond estradiol and testosterone. There are multiple types of estrogens produced by the ovaries and other tissues, as well as multiple androgens beyond testosterone. Progesterone levels also need to be considered. Furthermore, there is target tissue specificity in the actions of estrogens, which can be attributed to tissue-specific expression patterns of estrogen receptors (ERs), including ER α , ER β , and estrogen membrane receptors such as membrane ER α and the G protein–coupled receptor GPER1/GPR30 ( 51 , 52 ).
In addition to the multiple ERs, tissue-specific responses to estrogens can occur through the presence of modulating proteins such as coactivators and corepressors, among others. Varying tissue-specific responses are exemplified by the action of synthetic agonists and antagonists such as the selective ER modulators, including tamoxifen, raloxifene, and toremifene. These selective ER modulators are competitive inhibitors of estrogen binding to ERs, with mixed agonist and antagonist activity, depending on the target tissue ( 53 ). For example, tamoxifen is used in the prevention and treatment of breast cancer as an ER antagonist, but it has ER agonist activity in some other tissues such as bone and endometrium. Progesterone also acts through multiple receptors, which are generated as splice variants from a single gene ( 54 ). The actions of testosterone, through the androgen receptor, are modulated at the local tissue level through local activity of the enzyme 5 α -reductase, which catalyzes the formation of the more potent androgen receptor agonist, dihydrotestosterone ( 55 ).
Many determinants of disease, both physical and social, are differentially distributed by gender. Some of these factors may confound experiments if not carefully accounted for in study design and analysis. For example, in many societies, women are more likely to have vitamin D deficiency ( 56 ), affecting multiple tissues and systems, and men are more likely to smoke cigarettes and drink alcohol. Men and women are exposed differentially to types of violence and trauma ( 57–61 ). Such stressors may affect gonadal steroid secretion in a sex- and hormone-dependent fashion ( 12 ). In the case of powerful covariates strongly associated with gender or sex, investigators may want to select participants to ensure these covariates are balanced in male and female samples.
Historical reliance on male animal models ( e.g. , mice, rats) has resulted in incomplete data to guide human subject research in both men and women. Basic studies can complement clinical studies by investigating mechanisms of sex-dependent or sex-specific processes in greater depth by manipulating genotypic and phenotypic sex experimentally ( 12 ). Beyond simply studying both male and female animals as they age naturally, studies can include classic gonadectomy with or without hormone replacement: prenatally and perinatally to address developmental effects; in juvenile animals to study postnatal developmental and differentiation effects; in adults to assess the effects of sex steroid hormones at the time of testing; and in aging animals to study effects of sex steroids in models of aging. Several new genetic and epigenetic animal models have increasing translational validity to represent human ovarian failure and menopause ( 62 ). Some alternative models of menopause or ovarian failure include Foxl2-deficient mice with accelerated rates of decline in ovarian reserve ( 63 ).
Another frequent approach is to study “knockout” mice (or other species) that lack a specific sex steroid receptor. ER α knockout mice have shown that the absence of ER α promotes adiposity in male and female animals and, in turn, the progression of breast cancer in females ( 64 ). Animals with “conditional knockout” or “conditional knockin” of a specific sex steroid receptor can be used to target specific tissues or life stages.
Additionally, targeted mutagenesis can be used to address the role of specific domains or specific functions of a sex steroid receptor. For example, although ER α has traditionally been thought of as a nuclear, ligand-dependent transcription factor acting through estrogen response elements in gene promoters, the molecular mechanisms of action are more complex. Estradiol actions can be mediated by other “nonclassical” ER α pathways: (1) ligand-independent ER α signaling, in which gene activation alters phosphorylation of ERs via second-messenger pathways that affect intracellular kinase and phosphatase activity; (2) rapid, nongenomic effects through a membrane-associated ER; and (3) genomic, estrogen response element–independent signaling, in which ER α regulates genes via protein–protein interaction with other transcription factors, including c-Fos/c-Jun B (AP-1), Sp1, and nuclear factor κ B ( 65 ). For example, as noted above, estradiol is critical to the regulation of energy balance and body weight. In an experiment with female mice, ER α -null mutant mice become obese, with decreased energy expenditure and locomotion, increased adiposity, hyperleptinemia, and altered glucose homeostasis, characteristics similar to the propensity of postmenopausal women to develop obesity and type 2 diabetes. Interestingly, knockin mice that express a mutant ER α that can signal only through a nonclassical pathway ( i.e. , without direct estrogen response element binding) restored the metabolic parameters to normal or near-normal values, including energy expenditure. These findings indicate that nonclassical ER α signaling mediates major effects of estradiol on energy balance, raising the possibility that selective ER α modulators may be developed to reduce the risks of obesity and metabolic disturbances in postmenopausal women ( 66 ).
“The hormonal environment of cultured cells…can affect experimental outcomes.”
Although it is facile to insist that basic researchers use and report on both XX and XY cells in their experiments, this is not always possible ( 11 ). In fact, cell lines are a poor model with which to study sex differences, even when the sex of the lineage is specified. By definition, immortalized cell lines, chosen for their peculiarities and derived from a single organism, may be inherently impossible to cull or create from a second organism of any sex. Even where it is possible to create cell lines from a male and female similar enough to interrogate a particular question, inferences about sex differences cannot reliably be made. As with a clinical study with n = 2, a comparison of a male and a female cell line, because each is derived from a single individual, cannot distinguish sex differences from other genetic, epigenetic, or environmental characteristics of the founding individuals from which they were derived. Cell lines may have sex-dependent features other than the sex chromosome complement, including differences in hormone production or hormone responses related to variation in steroidogenic enzyme expression or expression of sex steroid hormone receptors. There may also be differences in expression of other genes related to imprinting or epigenetic differences. Moreover, each cell line is clonal in origin and has unique characteristics based on the experimental conditions in which it was derived and propagated—even two cell lines derived from the same organism may have different characteristics.
It is more reasonable to request that investigators specify the sex of a cell line used in a study ( i.e. , derived from a male vs female, or XX vs XY in sex chromosome complement), as the sex of many cell lines has been established ( 70 ). However, even this is not always possible, as cell lines can lose their sex chromosome complement over time ( 11 ). Although primary cultures can isolate cells directly from the body, permitting the creation of a small population of male or female cells, the procedure may be technically difficult and time-consuming, and the cells may be short-lived, limited in number, difficult to manipulate, and can change their characteristics over time in culture. Furthermore, Miller et al. ( 14 ) caution that the hormonal environment of cultured cells, including some media, can affect experimental outcomes. Finally, comparisons of isolated male and female cells oversimplify the question of sex, let alone gender, because such cells are removed from the complex interactions with other cells, hormones, neurotransmitters, nutrients, pathogens, and environmental exposures, which themselves vary in living organisms by sex and gender ( 11 ). In such cases, the absence of evidence for sex differences in vitro may well be absence of any evidence at all, a straw man (or woman) of an experiment purportedly about sex.
Experimentalists, particularly those conducting studies with >100 subjects, may wish to randomize the sexes separately to ensure similar distributions of treated and untreated males and females. In preclinical experiments, this is known as a factorial design ( 15 ). Such stratified randomization retains the advantages of standard random allocation, effectively creating a mini-trial within each sex stratum ( 71 ). Stratified randomization can accommodate a study plan with unequal numbers of male and female subjects, especially helpful when men and women join a study at different rates or in different time periods. Stratified randomization can also be used to balance follicular vs luteal phase participants, or any other marker of sex or gender.
Most studies are planned from the outset with a sample size just large enough to afford 80% statistical power to detect the main effect of the primary exposure. Unless preplanned, most studies are underpowered to examine associations separately for males and females. This is particularly true of secondary data analyses of studies never designed to examine subgroup differences. This lack of statistical power to detect sex and gender differences can lead to the premature conclusion that such differences do not exist; in fact, most studies are simply too small to fairly test all but the most pronounced sex and gender differences. In the current era of accountability to analyze and present sex-stratified data, it is worth considering ideal practice and reality with respect to power and sample sizes to detect sex differences. Although most researchers will find that limited samples and funds constrain their ability to investigate sex and gender differences, we will also address the special case of “big data,” where problems may ensue from an abundance of statistical power to detect trivial differences, rather than too little power to detect meaningful differences.
Epidemiologists and clinical researchers are familiar with the concepts of effect modification and interaction, although the terminology may differ between disciplines. However, basic investigators, whose aim is usually to limit all variation other than the exposure under examination, may be less familiar with these issues. “Effect modification” refers to the ability of a third variable (here, sex) to modify or interact with the “main effect” of the exposure (say, treatment) on outcome (usually, disease). For example, the association of diabetes with cardiovascular disease (CVD) is stronger for women than men ( 72 , 73 ); it is said that sex “interacts” with diabetes to cause CVD or that sex “modifies” the diabetes–CVD association.
Although stratifying data by sex to examine the exposure–disease association separately for males and females allows the investigator to eyeball effect modification by sex, such estimation gives no indication of the extent to which any observed sex differences are due to chance. To gauge this likelihood, many researchers test the statistical significance of sex differences by incorporating into their statistical models a (usually multiplicative) “interaction” term that represents the intersection of exposure and sex. For example, if the main effect of treatment is represented as a binary variable (0 if untreated; 1 if treated) and the main effect of sex as a binary variable (0 if male; 1 if female), then an interaction term (treatment × sex) which equals 1 only for treated females will, when modeled with the main effects of treatment and sex, capture the additional increment or decrement in the risk of the outcome that is attributable to both treatment and female sex, that is, the sex difference in the association between treatment and disease. By convention, P values <0.05 for such interaction terms are indicative of a statistically significant sex difference, one that is unlikely due to chance alone. Such tests of effect modification or interaction by sex can (and should) be as easily incorporated into basic research as in clinical and population research. The difficulty is having the statistical power to do so.
The sample size required to detect statistically significant sex differences (interactions by sex) is considerably larger than that required to detect the main effects of treatment or sex alone. Statistical power to detect a sex difference depends on the prevalence of the exposure, outcome, and sex, as well as the strength of the associations between them. Software is freely available to calculate sample sizes to detect interactions ( 74 , 75 ). However, the rule of thumb is that it takes fourfold the sample size to detect an interaction than it does to detect main effects ( i.e. , treatment or sex alone) ( 76 ). Investigators need to take into account differential disease rate by sex and the expected magnitude of the main effect in each sex; statistical power to detect either main effects or a sex interaction may not be optimized by recruiting half women and half men. In planning, investigators may have to make “best guesses” at the magnitude of expected sex differences, based on the literature and biologic understanding. As with any power calculation, it is best to input a range of likely main effects and interactions to evaluate the impact of sample size on the ability to detect a sex interaction.
A study that is large enough to detect a sex interaction, if one exists, represents the “ideal” in sex difference studies. Few studies are planned with the power to detect statistically significant sex differences. Many studies that have attempted to test interactions by sex have been woefully underpowered to do so. Unfortunately, researchers easily forget that an interaction P value >0.05 often says as much about the design and size of the study as it does about the presence or absence of a sex difference.
Even where a study is too small to test for sex interaction effects, it may still have enough statistical power to examine the main effects of exposure within sex strata. This is simple sex stratification to examine exposure–disease associations for each sex. (Does diabetes predict CVD among males? Does diabetes predict CVD among females?) A study may find a statistically significant beneficial impact of treatment on disease among males and fail to find a significant effect of treatment among females (or, in extreme cases, find statistically significant benefits or harms that vary by sex). However, if the study lacks power to test an interaction by sex, investigators cannot claim that they have detected a difference between males and females that meets conventional standards for ruling out chance. As discussed, the detection of a statistically significant interaction by sex is a high bar. However, apparent contrasts in sex-stratified data—differential main effects of treatment by sex—can suggest the presence of sex differences. At the least, they provide a rationale for larger studies powered to detect sex interactions, or incentivize data collection across studies for meta-analyses of interactions by sex ( 15 ).
To plan a study with adequate statistical power to detect main effects by sex is straightforward: simply calculate sample sizes needed to detect main effects in men and women separately (and add them up), taking into account sex differences in rates of disease, expected size of impact of exposure, and, for observational studies, expected prevalence of exposure.
Many studies analyze their data by sex as an afterthought. Such subgroup analyses of main effects stratified by sex are often underpowered, which heightens the risk of type II error, or false-negative results. This is true even when the original analysis, in which all subjects are analyzed together, regardless of sex, reports a statistically significant association of exposure with disease. For example, in a study in which the exposure–disease association approaches statistical significance (say, a 2 standard error difference in outcome between study arms), splitting subjects into two groups of similar size will yield a one in three chance that the association will be sizeable and statistically significant ( P < 0.05) in one group and inconsequential in the other (less than a standard error difference) ( 77 ).
“Defining gender in human studies is both difficult and controversial.”
Studies underpowered to detect even sex-stratified main effects can still make available data and/or analyses stratified by sex, particularly in supplemental material, without making inferences regarding sex differences per se. Such data may serve as preliminary analyses for future studies adequately powered to detect sex differences and may be used in meta-analyses.
We have entered an era in which enormous datasets are increasingly available. Many of them include the variable “sex.” Two cautions are important to emphasize. First, such datasets, while deep in sample size, are often narrow in breadth, lacking the variables (discussed below) helpful to contextualize and understand sex differences. Second, the temptation in very large datasets to stratify by any variable is strong, as it is easy to detect statistically significant interactions, including sex differences, of clinically trivial and meaningless magnitude ( 78 ). Sound motivation to test sex differences, discussed earlier, is essential. So is conservative interpretation of statistically significant findings. To whom much data are given, much common sense is demanded: extra caution needs to be exercised in interpreting studies with enormous statistical power to detect minute differences between subgroups.
Truly sex-informed research is more than just stratifying by sex or gender. Researchers should collect the data to characterize the ways in which exposures, diseases, and contributing environmental factors vary by sex and gender.
Although it can be difficult to determine the sex of subjects in some species, for the most part, the sex of humans and nonhuman subjects in biomedical research is known. Categories of sex include males, females, intersexual individuals born with male and female characteristics, and people who undergo interventions to reassign their sex ( 25 ). In some instances, syndromes resulting from atypical sexual development can complicate categorization of sex ( 79 ).
Defining gender in human studies is both difficult and controversial. Indeed, some have argued that sex and gender are “irreducibly entangled,” and that even the most seemingly straightforward presentation of sex as a biological variable in human studies is inevitably a mix of sex and gender ( 24 , 25 , 80 ). Sociologists Westbrook and Saperstein ( 81 ), observing the tendency of large surveys to conflate sex and gender, call the state of measurement a “conceptual muddle” that is fraught with essentialist treatment of sex and gender as synonymous, obvious, easily determined by others, and unchanging over the life course.
The very concept of gender is subtle, complex, and shifting. It has been suggested that gender comprises at least three distinct, but interrelated components, the “three dimensions of gender” ( 82 ). These include: (1) our physical bodies, how we experience them, and how others interact with them; (2) our gender identity, our internal sense of ourselves as female, male, a blend of both, or neither; and (3) our gender expression, how we present our gender and how society interacts with the gender we present. Note that these dimensions are independent of sexual orientation. We are likely to see new measures of gender emerge; however, at present, there are few studies that have attempted to relate nuanced dimensions of gender to health and disease ( 81 , 83 ).
In the meantime, some researchers have ventured measures of gender that are intended to be distinct from sex ( 84–86 ). For example, several gendered factors correlated with poor health among women have been proposed as proxies for gender influences on health, including income, education, labor force participation, single-headed household, unpaid child and elder care, unpaid housework, political participation, and access to education or health ( 86 ). Particularly problematic has been the identification of proxies for male gender that might influence men’s health. The prevalence of gun ownership, for example, has been proposed ( 84 ). Such measures of gender are often measures of gender inequality. Many times they are based on national or state-level statistics, rather than more granular individual or household data ( 86 ).
Pelletier et al. ( 85 ) have proposed a method to measure individual-level gender as “psychosocial sex,” in contrast to “biological sex.” They argue that, as gender roles and attitudes—components that might comprise a gender index—depend on culture, age, and era, no single gender scoring system is broadly applicable. Rather, a method for defining gender within a study population is a better approach to measure gender. Drawing from extensive questionnaires completed by their study participants, the researchers identified a set of seven variables (including income, hours doing housework, and scores on a sex role inventory survey) that resulted in a continuous gender score ranging from masculine to feminine characteristics. Independent of sex, a high gender score (more feminine characteristics) was associated with increased risk of diabetes, hypertension, and depression and anxiety symptoms ( 85 ). In fact, once gender was accounted for, sex no longer predicted these health outcomes. Although the study was not large enough to exclude a modest interaction between sex and gender ( i.e. , did the gender score predict outcomes more among males or among females?), the authors observed that the higher femininity score appeared to predict outcomes for men as well as for women (Louise Pilote, personal communication). This study was possible only because of the extensive collection of economic and psychosocial covariates related to gender. To the extent possible, studies should be designed to collect data on gender. However, lack of data with which to construct a comprehensive gender measure does not absolve investigators of considering gender in their interpretation of data regarding sex differences.
Is gender relevant to animal studies? If it is hard to measure gender in human beings, it would seem entirely alien to do so in other species. However, a few investigators have attempted to design exposures that mimic human gendered experiences. For example, Shors et al. ( 87 ) developed an animal model (sexual conspecific aggressive response, or SCAR) to examine the effects of sexual aggression on the brain and learned behaviors. Pubescent female rodents are paired with sexually experienced adult males. The female releases high levels of adrenal stress hormones. Her ability to learn, including to learn maternal caring behavior, was suppressed. The authors suggested that such experiments are aimed at understanding how sexual trauma impacts mechanisms that shape the female brain. Although other interpretations of that animal model are possible, studies have reported that women with a history of childhood sexual trauma exhibit changes in brain and associated physiology. Women with a history of childhood sexual trauma, a highly prevalent exposure, have irregularities to cortical and subcortical tissue and long-term alterations to their hypothalamic–pituitary adrenal axis, compared with women without childhood sexual trauma ( 88 , 89 ). Sexual assault occurs to all sexes and genders, but considerably more often to girls and women, and therefore constitutes a gendered exposure ( 58–61 ). National surveys show that physical child abuse is also common, often more so for boys than girls ( 59 , 60 ). Other violent exposures, such as combat casualties and war-time trauma, also have gendered distributions and implications ( 90 , 91 ).
It is essential to capture outcomes in sufficient detail to detect sex and gender differences in disease presentation. The classic example in clinical research is CHD, one of the leading causes of morbidity and mortality for men and women in the United States. Traditionally, myocardial infarction was characterized as the result of obstruction of the large coronary arteries. However, up to one third of women with a myocardial infarction and two thirds of women with chest pain had unobstructed arteries upon angiography ( 92 , 93 ). It is now recognized that myocardial ischemia may result from disease of the coronary microvessels. The Women’s Ischemia Syndrome Evaluation study reported that roughly half of the women with angina and ischemia without coronary artery obstruction evidenced microvascular dysfunction ( 94 ). Instead of the classic chest-crushing sensation of coronary artery obstruction, women with microvessel disease may present with shortness of breath and fatigue, nonspecific symptoms easy to misdiagnose and often dismissed. Thus, an investigator studying CHD in both sexes needs to consider the symptoms and diagnostic tests that will capture the presentation of disease in women and men ( 95 ).
Another example of differential disease presentation by sex is the tendency for prolactinomas to be detected as microadenomas among women, but macroadenomas among men. This results, at least in part, from the earlier detection among women, in whom small elevations in prolactin can cause infertility, menstrual disturbances, and/or galactorrhea. In contrast, in men, prolactinomas may progress to macroadenomas before they become symptomatic with headaches, double vision, or vision loss resulting from the mass of the tumor pressing on neurologic structures in the brain. Although this difference between the sexes is largely attributed to differences in diagnostic timing, the possibility that prolactinomas are more aggressive in men has not been entirely excluded ( 96 , 97 ). In animal models, sex differences in the expression and activity of pituitary transforming growth factor β 1 may contribute to sex differences in prolactinoma incidence ( 98 ).
“The failure to consider exogenous hormone use…may contribute to the lack of reproducibility in many studies.”
The use of exogenous hormones, such as oral contraceptives, menopausal hormone therapy, testosterone, and anabolic steroids, is particularly important to document. Taken systemically, by mouth, injection, or patch, such drugs affect reproductive and nonreproductive systems throughout the body, and they could be important to investigations in which the exposure–disease relationship could be affected by sex hormones. In the United States, sales of testosterone, available as oral medicine, gel, patch, or injection, grew 77% from 2010 to 2013, with 2.3 million prescriptions filled ( 103 ). It is estimated that 2.9 to 4 million Americans, largely men, have used anabolic steroids in their lifetime ( 104 ).
In addition to their direct impact on brain, bone, muscle, metabolism, immune, cardiovascular, and reproductive function ( 105–107 ), reproductive steroids often interact with other drugs. For example, among patients with growth hormone deficiency, women taking oral estrogen require twice as much growth hormone as men or women not taking oral estrogen to achieve the same levels of insulin-like growth factor 1 ( 108 ); current guidelines for treatment of adult growth hormone deficiency now recommend the consideration of estrogen status in dosing ( 109 ). Basic scientists may find it illuminating to vary the levels of exogenous hormone exposures in their experiments to mimic widespread human exposures.
In addition to intentional exogenous hormone exposure, there is an increasing body of literature suggesting that exposure to endocrine disrupting chemicals in the environment may affect human health. For example, phthalates are a nearly ubiquitous class of chemicals used in the manufacturing of household products, including food packaging and personal care products such as cosmetics and nail polish. Exposure to phthalates may depend on occupation and use of personal care products; higher urine concentrations of phthalate metabolites have been reported among women compared with men ( 110 , 111 ). Phthalate exposures have been associated with insulin receptor and glucose oxidation in the Chang liver cell line (unspecified sex) ( 112 ), signs of diabetes and endocrine disruption in female rats ( 113 ), and with insulin resistance and diabetes in men and women ( 111 , 114 , 115 ).
The failure to consider exogenous hormone use, endogenous hormones, and/or markers of hormonal status (such as menopause) may contribute to the lack of reproducibility in many studies. For example, investigators found that the serum concentrations of 68% of 171 serum biomarkers associated with chronic disease were affected by sex, oral contraceptive use, menstrual phase, or menopausal status ( 116 ). They estimated up to 40% false discoveries in biomarkers when sex was ignored and up to 41% false discoveries when oral contraceptive use was ignored. Heeding this caution, investigators may want to collect data on menstrual or estrus phase, menopausal status, use of exogenous hormones, and/or levels of circulating hormones ( 14 ).
There are nuances to asking women to report their menstrual cycle and menopausal status. For example, as the duration of the luteal phase varies less than that of the follicular phase, menstrual cycle timing is best recorded retrospectively from the first day of the next menstrual period ( 117 ). As menstrual cycles may be suppressed or dictated by hormonal contraceptives (including hormonal intrauterine devices), or breastfeeding, it is useful to record these variables when assessing menstrual timing. Menopause may occur naturally or may result from hysterectomy, oophorectomy, or chemotherapy, and it cannot be determined until a year after the last menstrual period. Measurement of menstrual cycle phase and menopausal transition are covered elsewhere ( 117–119 ). Archived biospecimens should include information about such variables ( e.g. , time of blood draw, day of menstrual cycle).
Some factors that vary by sex or gender can influence the exposure–disease association under study, either as confounders (easily mistaken for sex differences) or effect modifiers (covariates that interact with sex to change outcome). Some of these sex-dependent covariates are obvious, such as parity. Other aspects of reproductive history may affect nonreproductive systems under study ( 120 ). For example, history of the hypertensive pregnancy disorder preeclampsia predicts twofold higher risk of CVD in women affected by the disorder ( 121 ). A woman’s history of preeclampsia might modify the impact of an antihypertensive drug. Exposures to exogenous endocrine drugs, such as those administered in the course of assisted reproductive technologies such as in vitro fertilization, might affect systems under study.
The degree to which other exposures, such as cigarette smoking, alcohol use, physical activity, socioeconomic position, caretaking responsibilities, and medication use, vary by sex and gender will depend on the population under study. For example, in some, but not all, cultures and climates, circulating 25-hydroxyvitamin D concentrations may differ considerably between men and women as a result of gender differences in factors such as clothing, time spent outdoors, and supplement use ( 122–124 ); depending on the country, these differences may be equalized by dietary vitamin D intake, particularly of fortified foods. Additionally, as shown in a study in the Netherlands, lower 25-hydroxyvitamin D levels among women may be explained by their higher adiposity levels, a difference that could be attributed either to sex (a biological difference) or gender (as many social determinants of adiposity are gendered) ( 125 ).
Particularly important to consider are sex or gender differences in the distribution of comorbidities that might influence an exposure–disease association. For example, compared with diabetic women, diabetic men have lower prevalence of depression and anxiety, gendered psychosocial factors that impede self-care activity and treatment success ( 126 ). Thus, it would be wise for studies examining sex differences in CVD to account for major depression history, especially when depression is associated with the main exposure under consideration ( 21 ).
Subgroup analyses from studies thoughtfully designed to query sex differences, particularly once replicated, can provide sound evidence of benefit or protection from harm for women and men. Alternatively, post hoc sex difference analyses, devoid of theoretical basis and sound construction, may create more noise than light. A recent analysis of sex differences presented in Cochrane reviews of clinical trials suggested that few met the stringent criteria of documenting statistically significant interactions by sex ( 127 ); this criticism of sex differences research is cautionary. Whether the absence of sex differences reflects fact, indiscriminate testing, lack of sample size, or the decades it takes for sex differences observed in basic or population research to reach clinical testing remains to be seen ( 128 ).
The risk of a blanket mandate to require all studies to stratify all results by sex is that the literature-wide type I error, that is, the risk of detecting false sex differences, will skyrocket. Furthermore, as we increase the size and statistical power of our studies to detect true sex interactions (minimizing type II error), we court the risk of finding sex differences where none exist (type I error). As mentioned earlier, type I error is a particular hazard of a theoretical “big data” analysis.
If we pursue sex difference analysis as a poorly considered mandate, a literature of contradiction will follow. The field of sex differences research risks discredit from unthinking and profligate enthusiasm. How, then, can we encourage sex differences research that is thoughtful, conservative, and consistent over time?
In any subgroup analysis, including sex and gender, tests of interaction should be limited and prespecified in statistical analytic plans ( 129 ). Although this does not guarantee that tests of hypotheses will be well constructed, it does help to protect against post hoc fishing and data-derived hypothesis testing (themselves self-fulfilling prophecies). “Surprise” subgroup findings should be presented as such, and interpreted with caution—the basis for further study, not for instant translation to clinic or policy. There seems almost a reflexive tendency of researchers to view male and female as the fundamental dichotomy of the biologic world ( 24 ). We need to approach the question of sex differences with curiosity and skepticism, rather than unquestioning assumption.
In creating an a priori hypothesis, it is best practice to prespecify the expected direction and magnitude of the sex difference. Should there be subgroups within sex, such as nulliparous vs parous, or premenopausal vs postmenopausal? Careful a priori hypothesizing is important for observational studies, experiments, and trials, and it serves to maintain scientific transparency. In large studies, some statistically significant sex differences may arise by chance. Thus, prespecifying the form of the expected interaction helps guard against indiscriminate post hoc scrambling.
“…Researchers will be collecting and analyzing data by sex, but the onus is on investigators to address this adequately and at all levels of basic, clinical, and population research.”
Some variables may be intermediates between sex or gender and the outcomes under study, and their treatment in analysis deserves special consideration. For example, sex and gender are two of many factors that determine body size and composition. Not only are body size and adiposity determined by sex steroids, sex steroid receptors, adipokines, and other differences between males and females ( 134 ), but gender differences in physical activity also affect body size and composition ( 135 ). [Interestingly, there are also sex differences in the voluntary physical activity of rodents: females exercise more than males. ( 135 )]Whether to control for body size in studies of sex and gender effects is a nuanced decision.
For example, in a study of differences between men and women in the impact of Ambien and impaired driving, should the investigator adjust for body size in evaluating sex or gender differences? Alternatively, body size is (at least partially) a product of sex and gender, and adjusting for it might obscure the most important pathway (body size) through which sex and gender impact the metabolism of the drug. Additionally, adjustment for body size might reveal other mechanisms (some of which are discussed in “Sex and gender differences in drug exposures and metabolism” above) through which sex or gender affect drug clearance. Statistical methods can help segregate or ‘decompose’ the impact of such intermediate variables, also known as mediators ( 136 , 137 ).
The critical importance of replication has been addressed by others ( 78 ). This is particularly true in a scientific climate that encourages, or even mandates, subgroup analysis.
As with any study finding, apparent sex differences (or lack thereof) need to be interpreted with caution ( 138 ). The magnitude and direction of apparent sex effects need to be placed in the context of prior knowledge. Biological mechanisms, hopefully outlined a priori , need to be discussed. The adequacy of the study to rule out bias, confounding, and chance needs to be frankly addressed. Even statistically significant sex differences may be due to chance or bias, instead of true heterogeneity of exposure–disease associations or of treatment effects ( 129 ).
This is especially true when interpreting main effects stratified by sex, where a study lacks statistical power to test interaction by sex. In this case, the play of chance is often overlooked and findings are overinterpreted. As an example, Assman et al. ( 139 ) cite a subgroup analysis of a trial that followed myocardial infarction survivors ( 140 ). The investigators, laudably, considered the differential impact of treatment on the mortality of men and women, including both sexes and prespecifying stratification by sex in the analyses. However, they did not plan to test an interaction by sex. Their intervention had no overall impact on mortality. There was no association of treatment with cardiac mortality among men ( P = 0.94). However, in women, the authors observed what they interpreted as a “possible harmful impact of the intervention” on women’s cardiac mortality ( P = 0.06). Later, Assman et al. used their data to calculate a proper test of interaction by sex, which revealed no statistically significant evidence of an interaction between treatment and sex ( P for interaction = 0.21), indicating that chance could well explain the seeming sex effect. Thus, despite the disparate associations and P values in the two sex strata, the study was simply too small to test whether the impact of the treatment on cardiac mortality truly differed by sex.
Most importantly, surprise subgroup findings need to be acknowledged as such. Sex differences that were discovered as the result of post hoc poking around in the data need to be treated with caution until they are replicated as pre hoc tests in other studies. In this event, supplemental tables stratified by sex, data repositories, and meta-analyses may extend the impact of any single study. In other words, investigators can make available sex-stratified data to spur the generation of new hypotheses, without presenting sex-stratified analyses that overreach the intent and design of their original study.
New governmental mandates mean that researchers will be collecting and analyzing data by sex, but the onus is on investigators to address this adequately and at all levels of basic, clinical, and population research. If we fail, the “noise” created by multiple testing across all our datasets may drown out the signal of true sex differences. Furthermore, in human studies it is important to investigate the impact of both sex and gender to illuminate fundamental, modifiable causes of disease and avoid a reflexive attribution of seeming sex differences solely to biology. If we address these design and analytic issues skillfully, then we have the chance for new insights for men and women that will be critical for the next generation of scientific and therapeutic discoveries in this age of precision medicine.
coronary heart disease
cardiovascular disease
estrogen receptor
National Institutes of Health
Disclosure Summary: The authors have nothing to disclose.
National Institutes of Health. Consideration of sex as a biological variable in NIH-funded research. Available at: grants.nih.gov/grants/guide/notice-files/NOT-OD-15-102.html . Accessed 17 April 2017.
Legato MJ , Johnson PA , Manson JE . Consideration of sex differences in medicine to improve health care and patient outcomes . JAMA . 2016 ; 316 ( 18 ): 1865 – 1866 .
Google Scholar
Goldstein JM , Holsen L , Handa R , Tobet S . Fetal hormonal programming of sex differences in depression: linking women’s mental health with sex differences in the brain across the lifespan . Front Neurosci . 2014 ; 8 : 247 .
Institute of Medicine . Exploring the Biological Contributions to Human Health: Does Sex Matter? Washington, DC : National Academies Press ; 2001 .
Google Preview
Johnson J , Sharman Z , Vissandjée B , Stewart DE . Does a change in health research funding policy related to the integration of sex and gender have an impact ? PLoS One . 2014 ; 9 ( 6 ): e99900 .
European Commission Directorate-General for Research and Innovation. H2020 programme: guidance on gender equality in Horizon 2020. Available at: eige.europa.eu/sites/default/files/h2020-hi-guide-gender_en.pdf . Accessed 5 April 2018.
Rabesandratana T. Adding sex-and-gender dimensions to your research. Available at: www.sciencemag.org/careers/2014/03/adding-sex-and-gender-dimensions-your-research . Accessed 12 February 2018.
National Institutes of Health. Enhancing reproducibility through rigor and transparency. Available at: grants.nih.gov/grants/guide/notice-files/NOT-OD-15-103.html . Accessed 17 April 2017.
National Institutes of Health/Agency for Healthcare Research and Quality. Implementing rigor and transparency in NIH & AHRQ research grant applications. Available at: grants.nih.gov/grants/guide/notice-files/NOT-OD-16-011.html . Accessed 17 April 2017.
National Institutes of Health/Agency for Healthcare Research and Quality. Implementing rigor and transparency in NIH & AHRQ career development award applications. Available at: grants.nih.gov/grants/guide/notice-files/NOT-OD-16-012.html . Accessed 17 April 2017.
Ritz SA , Antle DM , Côté J , Deroy K , Fraleigh N , Messing K , Parent L , St-Pierre J , Vaillancourt C , Mergler D . First steps for integrating sex and gender considerations into basic experimental biomedical research . FASEB J . 2014 ; 28 ( 1 ): 4 – 13 .
Becker JB , Arnold AP , Berkley KJ , Blaustein JD , Eckel LA , Hampson E , Herman JP , Marts S , Sadee W , Steiner M , Taylor J , Young E . Strategies and methods for research on sex differences in brain and behavior . Endocrinology . 2005 ; 146 ( 4 ): 1650 – 1673 .
Ouyang P , Wenger NK , Taylor D , Rich-Edwards JW , Steiner M , Shaw LJ , Berga SL , Miller VM , Merz NB . Strategies and methods to study female-specific cardiovascular health and disease: a guide for clinical scientists . Biol Sex Differ . 2016 ; 7 ( 1 ): 19 .
Miller VM , Kaplan JR , Schork NJ , Ouyang P , Berga SL , Wenger NK , Shaw LJ , Webb RC , Mallampalli M , Steiner M , Taylor DA , Merz CN , Reckelhoff JF . Strategies and methods to study sex differences in cardiovascular structure and function: a guide for basic scientists . Biol Sex Differ . 2011 ; 2 ( 1 ): 14 .
Miller LR , Marks C , Becker JB , Hurn PD , Chen WJ , Woodruff T , McCarthy MM , Sohrabji F , Schiebinger L , Wetherington CL , Makris S , Arnold AP , Einstein G , Miller VM , Sandberg K , Maier S , Cornelison TL , Clayton JA . Considering sex as a biological variable in preclinical research . FASEB J . 2017 ; 31 ( 1 ): 29 – 34 .
Cornelison TL , Clayton JA . Considering sex as a biological variable in biomedical research . Gender and the Genome. 2017 ; 1 ( 2 ): 89 – 93 .
Nieuwenhoven L , Klinge I . Scientific excellence in applying sex- and gender-sensitive methods in biomedical and health research . J Womens Health (Larchmt) . 2010 ; 19 ( 2 ): 313 – 321 .
Legato MJ . Principles of Gender-Specific Medicine: Gender in the Genomic Era .3rd ed. London, UK : Academic Press ; 2017 .
Arnold AP , Lusis AJ . Understanding the sexome: measuring and reporting sex differences in gene systems . Endocrinology . 2012 ; 153 ( 6 ): 2551 – 2555 .
Mosca L , Barrett-Connor E , Wenger NK . Sex/gender differences in cardiovascular disease prevention: what a difference a decade makes . Circulation . 2011 ; 124 ( 19 ): 2145 – 2154 .
Tobet SA , Handa RJ , Goldstein JM . Sex-dependent pathophysiology as predictors of comorbidity of major depressive disorder and cardiovascular disease . Pflugers Arch . 2013 ; 465 ( 5 ): 585 – 594 .
Anastario M , Salafia CM , Fitzmaurice G , Goldstein JM . Impact of fetal versus perinatal hypoxia on sex differences in childhood outcomes: developmental timing matters . Soc Psychiatry Psychiatr Epidemiol . 2012 ; 47 ( 3 ): 455 – 464 .
Whitley H , Lindsey W . Sex-based differences in drug activity . Am Fam Physician . 2009 ; 80 ( 11 ): 1254 – 1258 .
Springer KW , Mager Stellman J , Jordan-Young RM . Beyond a catalogue of differences: a theoretical frame and good practice guidelines for researching sex/gender in human health . Soc Sci Med . 2012 ; 74 ( 11 ): 1817 – 1824 .
Krieger N . Genders, sexes, and health: what are the connections—and why does it matter ? Int J Epidemiol . 2003 ; 32 ( 4 ): 652 – 657 .
Sen G , Ostlin P . Gender inequity in health: why it exists and how we can change it . Glob Public Health . 2008 ; 3 ( sup1 , Suppl 1 ) 1 – 12 .
Garcia M , Mulvagh SL , Merz CN , Buring JE , Manson JE . Cardiovascular disease in women: clinical perspectives . Circ Res . 2016 ; 118 ( 8 ): 1273 – 1293 .
Jazin E , Cahill L . Sex differences in molecular neuroscience: from fruit flies to humans . Nat Rev Neurosci . 2010 ; 11 ( 1 ): 9 – 17 .
Manson JE , Bassuk SS , Lee IM , Cook NR , Albert MA , Gordon D , Zaharris E , Macfadyen JG , Danielson E , Lin J , Zhang SM , Buring JE . The VITamin D and OmegA-3 TriaL (VITAL): rationale and design of a large randomized controlled trial of vitamin D and marine omega-3 fatty acid supplements for the primary prevention of cancer and cardiovascular disease . Contemp Clin Trials . 2012 ; 33 ( 1 ): 159 – 171 .
King AJ . The use of animal models in diabetes research . Br J Pharmacol . 2012 ; 166 ( 3 ): 877 – 894 .
Clutton-Brock TH , Isvaran K . Sex differences in ageing in natural populations of vertebrates . Proc Biol Sci . 2007 ; 274 ( 1629 ): 3097 – 3104 .
Lyon MF . Gene action in the X-chromosome of the mouse ( Mus musculus L.) . Nature . 1961 ; 190 ( 4773 ): 372 – 373 .
Busque L , Mio R , Mattioli J , Brais E , Blais N , Lalonde Y , Maragh M , Gilliland DG . Nonrandom X-inactivation patterns in normal females: lyonization ratios vary with age . Blood . 1996 ; 88 ( 1 ): 59 – 65 .
Abkowitz JL , Taboada M , Shelton GH , Catlin SN , Guttorp P , Kiklevich JV . An X chromosome gene regulates hematopoietic stem cell kinetics . Proc Natl Acad Sci USA . 1998 ; 95 ( 7 ): 3862 – 3866 .
Barrett EL , Richardson DS . Sex differences in telomeres and lifespan . Aging Cell . 2011 ; 10 ( 6 ): 913 – 921 .
Astuti Y , Wardhana A , Watkins J , Wulaningsih W ; PILAR Research Network . Cigarette smoking and telomere length: a systematic review of 84 studies and meta-analysis . Environ Res . 2017 ; 158 : 480 – 489 .
Resende MM , Taylor DA . Building solutions for cardiovascular disease in women . Tex Heart Inst J . 2013 ; 40 ( 3 ): 285 – 287 .
El-Maarri O , Becker T , Junen J , Manzoor SS , Diaz-Lacava A , Schwaab R , Wienker T , Oldenburg J . Gender specific differences in levels of DNA methylation at selected loci from human total blood: a tendency toward higher methylation levels in males . Hum Genet . 2007 ; 122 ( 5 ): 505 – 514 .
Zhu ZZ , Hou L , Bollati V , Tarantini L , Marinelli B , Cantone L , Yang AS , Vokonas P , Lissowska J , Fustinoni S , Pesatori AC , Bonzini M , Apostoli P , Costa G , Bertazzi PA , Chow WH , Schwartz J , Baccarelli A . Predictors of global methylation levels in blood DNA of healthy subjects: a combined analysis . Int J Epidemiol . 2012 ; 41 ( 1 ): 126 – 139 .
Smith RP , Coward RM , Kovac JR , Lipshultz LI . The evidence for seasonal variations of testosterone in men . Maturitas . 2013 ; 74 ( 3 ): 208 – 212 .
Winston AP , Wijeratne S . Hypogonadism, hypoleptinaemia and osteoporosis in males with eating disorders . Clin Endocrinol (Oxf) . 2009 ; 71 ( 6 ): 897 – 898 .
Masters JR , Drife JO , Scarisbrick JJ . Cyclic variation of DNA synthesis in human breast epithelium . J Natl Cancer Inst . 1977 ; 58 ( 5 ): 1263 – 1265 .
Anderson TJ . Mitotic activity in the breast . J Obstet Gynaecol . 1984 ; 4 ( Suppl 2 ): S114 – S118 .
Sulke AN , Jones DB , Wood PJ . Variation in natural killer activity in peripheral blood during the menstrual cycle . Br Med J (Clin Res Ed) . 1985 ; 290 ( 6472 ): 884 – 886 .
Ruiz-Pino F , Navarro VM , Bentsen AH , Garcia-Galiano D , Sanchez-Garrido MA , Ciofi P , Steiner RA , Mikkelsen JD , Pinilla L , Tena-Sempere M . Neurokinin B and the control of the gonadotropic axis in the rat: developmental changes, sexual dimorphism, and regulation by gonadal steroids . Endocrinology . 2012 ; 153 ( 10 ): 4818 – 4829 .
de Vries GJ , Södersten P . Sex differences in the brain: the relation between structure and function . Horm Behav . 2009 ; 55 ( 5 ): 589 – 596 .
McCarthy MM , Pickett LA , VanRyzin JW , Kight KE . Surprising origins of sex differences in the brain . Horm Behav . 2015 ; 76 : 3 – 10 .
Semaan SJ , Tolson KP , Kauffman AS .The development of kisspeptin circuits in the mammalian brain. In: Kauffman AS , Smith JT , eds. Kisspeptin Signaling in Reproductive Biology . New York, NY : Springer ; 2013 : 221 – 252 .
Holsen LM , Lancaster K , Klibanski A , Whitfield-Gabrieli S , Cherkerzian S , Buka S , Goldstein JM . HPA-axis hormone modulation of stress response circuitry activity in women with remitted major depression . Neuroscience . 2013 ; 250 : 733 – 742 .
Jacobs EG , Weiss BK , Makris N , Whitfield-Gabrieli S , Buka SL , Klibanski A , Goldstein JM . Impact of sex and menopausal status on episodic memory circuitry in early midlife . J Neurosci . 2016 ; 36 ( 39 ): 10163 – 10173 .
Heldring N , Pike A , Andersson S , Matthews J , Cheng G , Hartman J , Tujague M , Ström A , Treuter E , Warner M , Gustafsson JA . Estrogen receptors: how do they signal and what are their targets . Physiol Rev . 2007 ; 87 ( 3 ): 905 – 931 .
Barton M , Filardo EJ , Lolait SJ , Thomas P , Maggiolini M , Prossnitz ER . Twenty years of the G protein-coupled estrogen receptor GPER: historical and personal perspectives . J Steroid Biochem Mol Biol . 2018 ; 176 : 4 – 15 .
Cosman F , Lindsay R . Selective estrogen receptor modulators: clinical spectrum . Endocr Rev . 1999 ; 20 ( 3 ): 418 – 434 .
Jacobsen BM , Horwitz KB . Progesterone receptors, their isoforms and progesterone regulated transcription . Mol Cell Endocrinol . 2012 ; 357 ( 1-2 ): 18 – 29 .
Marks LS . 5α-Reductase: history and clinical importance . Rev Urol . 2004 ; 6 ( Suppl 9 ): S11 – S21 .
Looker AC , Johnson CL , Lacher DA , Pfeiffer CM , Schleicher RL , Sempos CT . Vitamin D status: United States, 2001–2006 . NCHS Data Brief . 2011 ; 59 ( 59 ): 1 – 8 .
Tolin DF , Foa EB . Sex differences in trauma and posttraumatic stress disorder: a quantitative review of 25 years of research . Psychol Bull . 2006 ; 132 ( 6 ): 959 – 992 .
Breiding MJ , Smith SG , Basile KC , Walters ML , Chen J , Merrick MT . Prevalence and characteristics of sexual violence, stalking, and intimate partner violence victimization—national intimate partner and sexual violence survey, United States, 2011 . MMWR Surveill Summ . 2014 ; 63 ( 8 ): 1 – 18 .
Black MC , Basile KC , Breiding MJ , Smith SG, Walters ML, Merrick MT, Chen J, Stevens MR. The National Intimate Partner and Sexual Violence Survey (NISVS): 2010 Summary Report . Atlanta, GA : National Center for Injury Prevention and Control, Centers for Disease Control and Prevention ; 2011 .
Afifi TO , MacMillan HL , Boyle M , Taillieu T , Cheung K , Sareen J . Child abuse and mental disorders in Canada . CMAJ. 2014 ; 186 ( 9 ): E324 – E332 .
World Health Organization . Global and regional estimates of violence against women: prevalence and health effects of intimate partner violence and non-partner sexual violence . Geneva, Switzerland : World Health Organization ; 2013 .
Diaz Brinton R . Minireview: translational animal models of human menopause: challenges and emerging opportunities . Endocrinology . 2012 ; 153 ( 8 ): 3571 – 3578 .
Uhlenhaut NH , Treier M . Foxl2 function in ovarian development . Mol Genet Metab . 2006 ; 88 ( 3 ): 225 – 234 .
Drew BG , Hamidi H , Zhou Z , Villanueva CJ , Krum SA , Calkin AC , Parks BW , Ribas V , Kalajian NY , Phun J , Daraei P , Christofk HR , Hewitt SC , Korach KS , Tontonoz P , Lusis AJ , Slamon DJ , Hurvitz SA , Hevener AL . Estrogen receptor (ER)α-regulated lipocalin 2 expression in adipose tissue links obesity with breast cancer progression . J Biol Chem . 2015 ; 290 ( 9 ): 5566 – 5581 .
McDevitt MA , Glidewell-Kenney C , Jimenez MA , Ahearn PC , Weiss J , Jameson JL , Levine JE . New insights into the classical and non-classical actions of estrogen: evidence from estrogen receptor knock-out and knock-in mice . Mol Cell Endocrinol . 2008 ; 290 ( 1–2 ): 24 – 30 .
Park CJ , Zhao Z , Glidewell-Kenney C , Lazic M , Chambon P , Krust A , Weiss J , Clegg DJ , Dunaif A , Jameson JL , Levine JE . Genetic rescue of nonclassical ERα signaling normalizes energy balance in obese Erα-null mutant mice . J Clin Invest . 2011 ; 121 ( 2 ): 604 – 612 .
Itoh Y , Mackie R , Kampf K , Domadia S , Brown JD , O’Neill R , Arnold AP . Four core genotypes mouse model: localization of the Sry transgene and bioassay for testicular hormone levels . BMC Res Notes . 2015 ; 8 ( 1 ): 69 .
Chen X , McClusky R , Chen J , Beaven SW , Tontonoz P , Arnold AP , Reue K . The number of X chromosomes causes sex differences in adiposity in mice . PLoS Genet . 2012 ; 8 ( 5 ): e1002709 .
Li J , Chen X , McClusky R , Ruiz-Sundstrom M , Itoh Y , Umar S , Arnold AP , Eghbali M . The number of X chromosomes influences protection from cardiac ischaemia/reperfusion injury in mice: one X is better than two . Cardiovasc Res . 2014 ; 102 ( 3 ): 375 – 384 .
Shah K , McCormack CE , Bradbury NA . Do you know the sex of your cells ? Am J Physiol Cell Physiol . 2014 ; 306 ( 1 ): C3 – C18 .
Kernan WN , Viscoli CM , Makuch RW , Brass LM , Horwitz RI . Stratified randomization for clinical trials . J Clin Epidemiol . 1999 ; 52 ( 1 ): 19 – 26 .
Peters SA , Huxley RR , Woodward M . Diabetes as risk factor for incident coronary heart disease in women compared with men: a systematic review and meta-analysis of 64 cohorts including 858,507 individuals and 28,203 coronary events . Diabetologia . 2014 ; 57 ( 8 ): 1542 – 1551 .
Huxley RR , Peters SA , Mishra GD , Woodward M . Risk of all-cause mortality and vascular events in women versus men with type 1 diabetes: a systematic review and meta-analysis . Lancet Diabetes Endocrinol . 2015 ; 3 ( 3 ): 198 – 206 .
Garcia-Closas M, Lubin JH. POWER V3.0 software. Available at: dceg.cancer.gov/tools/design/power . Accessed 5 April 2018.
VanderWeele T. Tools and tutorials. Available at: www.hsph.harvard.edu/tyler-vanderweele/tools-and-tutorials/ . Accessed 5 April 2018.
Brookes ST , Whitely E , Egger M , Smith GD , Mulheran PA , Peters TJ . Subgroup analyses in randomized trials: risks of subgroup-specific analyses; power and sample size for the interaction test . J Clin Epidemiol . 2004 ; 57 ( 3 ): 229 – 236 .
Peto R .Statistical aspects of cancer trials. In: Halnan KE , ed. Treatment of Cancer . London, UK : Chapman and Hall ; 1982 : 867 – 871 .
Ioannidis JP . Why most published research findings are false . PLoS Med . 2005 ; 2 ( 8 ): e124 .
MacLaughlin DT , Donahoe PK . Sex determination and differentiation . N Engl J Med . 2004 ; 350 ( 4 ): 367 – 378 .
Hankivsky O , Doyal L , Einstein G , Kelly U , Shim J , Weber L , Repta A . The odd couple: using biomedical and intersectional approaches to address health inequities . Global Health Action . 2017 ; 10 ( Suppl 2 ): 1326686 .
Westbrook L , Saperstein A . New categories are not enough: rethinking the measurement of sex and gender in social surveys . Gend Soc . 2015 ; 29 ( 4 ): 534 – 560 .
Gender Spectrum. Understanding gender. Available at: www.genderspectrum.org/quick-links/understanding-gender/ . Accessed 4 April 2018.
Gender Identity in U.S. Surveillance (GenIUSS) Group. Best Practices for Asking Questions to Identify Transgender and Other Gender Minority Respondents on Population-Based Surveys. Herman JL, ed. Los Angeles, CA: The Williams Institute, 2014 .
Phillips SP . Defining and measuring gender: a social determinant of health whose time has come . Int J Equity Health . 2005 ; 4 ( 1 ): 11 .
Pelletier R , Ditto B , Pilote L . A composite measure of gender and its association with risk factors in patients with premature acute coronary syndrome . Psychosom Med . 2015 ; 77 ( 5 ): 517 – 526 .
Tamambang L , Auger N , Lo E , Raynault M-F . Measurement of gender inequality in neighbourhoods of Québec, Canada . Int J Equity Health . 2011 ; 10 ( 1 ): 52 .
Shors TJ , Tobόn K , DiFeo G , Durham DM , Chang HYM . Sexual conspecific aggressive response (SCAR): a model of sexual trauma that disrupts maternal learning and plasticity in the female brain . Sci Rep . 2016 ; 6 ( 1 ): 18960 .
Blanco L , Nydegger LA , Camarillo G , Trinidad DR , Schramm E , Ames SL . Neurological changes in brain structure and functions among individuals with a history of childhood sexual abuse: a review . Neurosci Biobehav Rev . 2015 ; 57 : 63 – 69 .
Heim C , Nemeroff CB . The role of childhood trauma in the neurobiology of mood and anxiety disorders: preclinical and clinical studies . Biol Psychiatry . 2001 ; 49 ( 12 ): 1023 – 1039 .
Tepe V , Yarnell A , Nindl BC , Van Arsdale S , Deuster PA . Women in combat: summary of findings and a way ahead . Mil Med . 2016 ; 181 ( Suppl 1 ): 109 – 118 .
Cross JD , Johnson AE , Wenke JC , Bosse MJ , Ficke JR . Mortality in female war veterans of operations enduring freedom and Iraqi freedom . Clin Orthop Relat Res . 2011 ; 469 ( 7 ): 1956 – 1961 .
Chokshi NP , Iqbal SN , Berger RL , Hochman JS , Feit F , Slater JN , Pena-Sing I , Yatskar L , Keller NM , Babaev A , Attubato MJ , Reynolds HR . Sex and race are associated with the absence of epicardial coronary artery obstructive disease at angiography in patients with acute coronary syndromes . Clin Cardiol . 2010 ; 33 ( 8 ): 495 – 501 .
Sharaf BL , Pepine CJ , Kerensky RA , Reis SE , Reichek N , Rogers WJ , Sopko G , Kelsey SF , Holubkov R , Olson M , Miele NJ , Williams DO , Merz CN ; WISE Study Group. Detailed angiographic analysis of women with suspected ischemic chest pain (pilot phase data from the NHLBI-sponsored Women’s Ischemia Syndrome Evaluation [WISE] Study Angiographic Core Laboratory) . Am J Cardiol . 2001 ; 87 ( 8 ): 937 – 941 .
Reis SE , Holubkov R , Conrad Smith AJ , Kelsey SF , Sharaf BL , Reichek N , Rogers WJ , Merz CN , Sopko G , Pepine CJ ; WISE Investigators . Coronary microvascular dysfunction is highly prevalent in women with chest pain in the absence of coronary artery disease: results from the NHLBI WISE study . Am Heart J . 2001 ; 141 ( 5 ): 735 – 741 .
Sanghavi M , Gulati M . Sex differences in the pathophysiology, treatment, and outcomes in IHD . Curr Atheroscler Rep . 2015 ; 17 ( 6 ): 34 .
Ciccarelli A , Daly AF , Beckers A . The epidemiology of prolactinomas . Pituitary . 2005 ; 8 ( 1 ): 3 – 6 .
Klibanski A . Prolactinomas . N Engl J Med . 2010 ; 362 ( 13 ): 1219 – 1226 .
Recouvreux MV , Faraoni EY , Camilletti MA , Ratner L , Abeledo-Machado A , Rulli SB , Díaz-Torga G . Sex differences in the pituitary TGFβ1 system: the role of TGFβ1 in prolactinoma development . Front Neuroendocrinol . 2017 ; S0091-3022(17)30063-8 .
Greenblatt DJ , Harmatz JS , Singh NN , Steinberg F , Roth T , Moline ML , Harris SC , Kapil RP . Gender differences in pharmacokinetics and pharmacodynamics of zolpidem following sublingual administration . J Clin Pharmacol . 2014 ; 54 ( 3 ): 282 – 290 .
Cubała WJ , Wiglusz M , Burkiewicz A , Gałuszko-Wegielnik M . Zolpidem pharmacokinetics and pharmacodynamics in metabolic interactions involving CYP3A: sex as a differentiating factor . Eur J Clin Pharmacol . 2010 ; 66 ( 9 ): 955 .
Olubodun JO , Ochs HR , von Moltke LL , Roubenoff R , Hesse LM , Harmatz JS , Shader RI , Greenblatt DJ . Pharmacokinetic properties of zolpidem in elderly and young adults: possible modulation by testosterone in men . Br J Clin Pharmacol . 2003 ; 56 ( 3 ): 297 – 304 .
U.S. Food and Drug Administration. Questions and answers: risk of next-morning impairment after use of insomnia drugs; FDA requires lower recommended doses for certain drugs containing zolpidem (Ambien, Ambien CR, Edluar, and Zolpimist). Available at: www.fda.gov/Drugs/DrugSafety/ucm334041.htm#q6 . Accessed 17 April 2017.
U.S. Food and Drug Administration. FDA drug safety communication. FDA cautions about using testosterone products for low testosterone due to aging; requires labeling change to inform of possible increased risk of heart attack and stroke with use. Available at: www.fda.gov/Drugs/DrugSafety/ucm436259.htm . Accessed 17 April 2017.
Pope HG Jr , Kanayama G , Athey A , Ryan E , Hudson JI , Baggish A . The lifetime prevalence of anabolic-androgenic steroid use and dependence in Americans: current best estimates . Am J Addict . 2014 ; 23 ( 4 ): 371 – 377 .
Fish EN . The X-files in immunity: sex-based differences predispose immune responses . Nat Rev Immunol . 2008 ; 8 ( 9 ): 737 – 744 .
Power ML , Schulkin J . Sex differences in fat storage, fat metabolism, and the health risks from obesity: possible evolutionary origins . Br J Nutr . 2008 ; 99 ( 5 ): 931 – 940 .
Mendelsohn ME , Karas RH . Molecular and cellular basis of cardiovascular gender differences . Science . 2005 ; 308 ( 5728 ): 1583 – 1587 .
Cook DM , Ludlam WH , Cook MB . Route of estrogen administration helps to determine growth hormone (GH) replacement dose in GH-deficient adults . J Clin Endocrinol Metab . 1999 ; 84 ( 11 ): 3956 – 3960 .
Molitch ME , Clemmons DR , Malozowski S , Merriam GR , Vance ML ; Endocrine Society . Evaluation and treatment of adult growth hormone deficiency: an Endocrine Society clinical practice guideline . J Clin Endocrinol Metab . 2011 ; 96 ( 6 ): 1587 – 1609 .
Silva MJ , Barr DB , Reidy JA , Malek NA , Hodge CC , Caudill SP , Brock JW , Needham LL , Calafat AM . Urinary levels of seven phthalate metabolites in the U.S. population from the National Health and Nutrition Examination Survey (NHANES) 1999–2000 . Environ Health Perspect . 2004 ; 112 ( 3 ): 331 – 338 .
Huang T , Saxena AR , Isganaitis E , James-Todd T . Gender and racial/ethnic differences in the associations of urinary phthalate metabolites with markers of diabetes risk: National Health and Nutrition Examination Survey 2001–2008 . Environ Health . 2014 ; 13 ( 1 ): 6 .
Rengarajan S , Parthasarathy C , Anitha M , Balasubramanian K . Diethylhexyl phthalate impairs insulin binding and glucose oxidation in Chang liver cells . Toxicol In Vitro . 2007 ; 21 ( 1 ): 99 – 102 .
Gayathri NS , Dhanya CR , Indu AR , Kurup PA . Changes in some hormones by low doses of di (2-ethyl hexyl) phthalate (DEHP), a commonly used plasticizer in PVC blood storage bags & medical tubing . Indian J Med Res . 2004 ; 119 ( 4 ): 139 – 144 .
Stahlhut RW , van Wijngaarden E , Dye TD , Cook S , Swan SH . Concentrations of urinary phthalate metabolites are associated with increased waist circumference and insulin resistance in adult U.S. males . Environ Health Perspect . 2007 ; 115 ( 6 ): 876 – 882 .
James-Todd T , Stahlhut R , Meeker JD , Powell SG , Hauser R , Huang T , Rich-Edwards J . Urinary phthalate metabolite concentrations and diabetes among women in the National Health and Nutrition Examination Survey (NHANES) 2001–2008 . Environ Health Perspect . 2012 ; 120 ( 9 ): 1307 – 1313 .
Ramsey JM , Cooper JD , Penninx BW , Bahn S . Variation in serum biomarkers with sex and female hormonal status: implications for clinical tests . Sci Rep . 2016 ; 6 ( 1 ): 26947 .
Gangestad SW , Haselton MG , Welling LLM ,et al. . How valid are assessments of conception probability in ovulatory cycle research? Evaluations, recommendations, and theoretical implications . Evol Hum Behav . 2016 ; 37 ( 2 ): 85 – 96 .
Harlow SD , Gass M , Hall JE , Lobo R , Maki P , Rebar RW , Sherman S , Sluss PM , de Villiers TJ ; STRAW + 10 Collaborative Group . Executive summary of the Stages of Reproductive Aging Workshop + 10: addressing the unfinished agenda of staging reproductive aging . J Clin Endocrinol Metab . 2012 ; 97 ( 4 ): 1159 – 1168 .
Shifren JL , Schiff I . The aging ovary . J Womens Health Gend Based Med . 2000 ; 9 ( Suppl 1 ): S3 – S7 .
Rich-Edwards JW , Fraser A , Lawlor DA , Catov JM . Pregnancy characteristics and women’s future cardiovascular health: an underused opportunity to improve women’s health ? Epidemiol Rev . 2014 ; 36 ( 1 ): 57 – 70 .
Bellamy L , Casas J-P , Hingorani AD , Williams DJ . Pre-eclampsia and risk of cardiovascular disease and cancer in later life: systematic review and meta-analysis . BMJ . 2007 ; 335 ( 7627 ): 974 .
Holvik K , Meyer HE , Haug E , Brunvand L . Prevalence and predictors of vitamin D deficiency in five immigrant groups living in Oslo, Norway: the Oslo Immigrant Health Study . Eur J Clin Nutr . 2005 ; 59 ( 1 ): 57 – 63 .
Calvo MS , Whiting SJ , Barton CN . Vitamin D intake: a global perspective of current status . J Nutr . 2005 ; 135 ( 2 ): 310 – 316 .
Hilger J , Friedel A , Herr R , Rausch T , Roos F , Wahl DA , Pierroz DD , Weber P , Hoffmann K . A systematic review of vitamin D status in populations worldwide . Br J Nutr . 2014 ; 111 ( 1 ): 23 – 45 .
van Dam RM , Snijder MB , Dekker JM , Stehouwer CD , Bouter LM , Heine RJ , Lips P . Potentially modifiable determinants of vitamin D status in an older population in the Netherlands: the Hoorn Study . Am J Clin Nutr . 2007 ; 85 ( 3 ): 755 – 761 .
Kautzky-Willer A , Harreiter J . Sex and gender differences in therapy of type 2 diabetes . Diabetes Res Clin Pract . 2017 ; 131 : 230 – 241 .
Wallach JD , Sullivan PG , Trepanowski JF , Steyerberg EW , Ioannidis JP . Sex based subgroup differences in randomized controlled trials: empirical evidence from Cochrane meta-analyses . BMJ . 2016 ; 355 : i5826 .
Miller VM , Tannenbaum C , Regitz-Zagrosek V . Sex based subgroup differences in randomized controlled trials: empirical evidence from Cochrane meta-analyses; response to authors comment . BMJ . 2016 ; 355 : i5826 .
Alosh M , Fritsch K , Huque M ,et al. . Statistical considerations on subgroup analysis in clinical trials . Stat Biopharm Res . 2015 ; 7 ( 4 ): 286 – 303 .
Rothman KJ , Greenland S , Lash TL . Modern Epidemiology .3rd ed. Philadelphia, PA : Lippincott Williams & Wilkins ; 2008 .
Austin PC . An introduction to propensity score methods for reducing the effects of confounding in observational studies . Multivariate Behav Res . 2011 ; 46 ( 3 ): 399 – 424 .
Fewell Z , Davey Smith G , Sterne JA . The impact of residual and unmeasured confounding in epidemiologic studies: a simulation study . Am J Epidemiol . 2007 ; 166 ( 6 ): 646 – 655 .
Ding P , VanderWeele TJ . Sensitivity analysis without assumptions . Epidemiology . 2016 ; 27 ( 3 ): 368 – 377 .
Shi H , Seeley RJ , Clegg DJ . Sexual differences in the control of energy homeostasis . Front Neuroendocrinol . 2009 ; 30 ( 3 ): 396 – 404 .
Rosenfeld CS . Sex-dependent differences in voluntary physical activity . J Neurosci Res . 2017 ; 95 ( 1-2 ): 279 – 290 .
Valeri L , Vanderweele TJ . Mediation analysis allowing for exposure–mediator interactions and causal interpretation: theoretical assumptions and implementation with SAS and SPSS macros . Psychol Methods . 2013 ; 18 ( 2 ): 137 – 150 .
Richiardi L , Bellocco R , Zugna D . Mediation analysis in epidemiology: methods, interpretation and bias . Int J Epidemiol . 2013 ; 42 ( 5 ): 1511 – 1519 .
Buyse ME . Analysis of clinical trial outcomes: some comments on subgroup analyses . Control Clin Trials . 1989 ; 10 ( 4 , Suppl ) 187S – 194S .
Assmann SF , Pocock SJ , Enos LE , Kasten LE . Subgroup analysis and other (mis)uses of baseline data in clinical trials . Lancet . 2000 ; 355 ( 9209 ): 1064 – 1069 .
Frasure-Smith N , Lespérance F , Prince RH , Verrier P , Garber RA , Juneau M , Wolfson C , Bourassa MG . Randomised trial of home-based psychosocial nursing intervention for patients recovering from myocardial infarction . Lancet . 1997 ; 350 ( 9076 ): 473 – 479 .
Month: | Total Views: |
---|---|
April 2018 | 99 |
May 2018 | 25 |
June 2018 | 19 |
July 2018 | 12 |
August 2018 | 1,199 |
September 2018 | 760 |
October 2018 | 495 |
November 2018 | 112 |
December 2018 | 35 |
January 2019 | 52 |
February 2019 | 65 |
March 2019 | 49 |
April 2019 | 61 |
May 2019 | 37 |
June 2019 | 42 |
July 2019 | 32 |
August 2019 | 361 |
September 2019 | 427 |
October 2019 | 390 |
November 2019 | 399 |
December 2019 | 431 |
January 2020 | 555 |
February 2020 | 974 |
March 2020 | 1,009 |
April 2020 | 1,361 |
May 2020 | 989 |
June 2020 | 1,386 |
July 2020 | 1,314 |
August 2020 | 1,588 |
September 2020 | 1,766 |
October 2020 | 2,301 |
November 2020 | 1,933 |
December 2020 | 2,010 |
January 2021 | 2,040 |
February 2021 | 2,017 |
March 2021 | 2,329 |
April 2021 | 2,367 |
May 2021 | 2,148 |
June 2021 | 2,029 |
July 2021 | 2,180 |
August 2021 | 2,605 |
September 2021 | 3,502 |
October 2021 | 2,862 |
November 2021 | 2,311 |
December 2021 | 1,696 |
January 2022 | 1,773 |
February 2022 | 1,904 |
March 2022 | 2,529 |
April 2022 | 2,596 |
May 2022 | 2,364 |
June 2022 | 1,917 |
July 2022 | 1,363 |
August 2022 | 1,281 |
September 2022 | 1,490 |
October 2022 | 2,077 |
November 2022 | 1,724 |
December 2022 | 1,100 |
January 2023 | 1,074 |
February 2023 | 910 |
March 2023 | 870 |
April 2023 | 778 |
May 2023 | 784 |
June 2023 | 709 |
July 2023 | 667 |
August 2023 | 617 |
September 2023 | 666 |
October 2023 | 642 |
November 2023 | 655 |
December 2023 | 524 |
January 2024 | 628 |
February 2024 | 480 |
March 2024 | 561 |
April 2024 | 574 |
May 2024 | 588 |
June 2024 | 442 |
July 2024 | 373 |
August 2024 | 144 |
Citing articles via.
Oxford University Press is a department of the University of Oxford. It furthers the University's objective of excellence in research, scholarship, and education by publishing worldwide
Sign In or Create an Account
This PDF is available to Subscribers Only
For full access to this pdf, sign in to an existing account, or purchase an annual subscription.
4866 Accesses
8 Citations
Explore all metrics
“Achieve gender equality and empower all women and girls” is essential to reduce gender disparity and improve the status of women. But it remains a challenge to narrow gender differences and improve gender equality in academic research. In this paper, we propose that the impact of articles is lower and writing style of articles is less positive when the article’s first author is female relative to male first authors, and writing style mediates this relationship. Focusing on the positive writing style, we attempt to contribute and explain the research on gender differences in research performance. We use BERT-based textual sentiment analysis to analyse 87 years of 9820 articles published in the top four marketing journals and prove our hypotheses. We also consider a set of control variables and conduct a set of robustness checks to ensure the robustness of our findings. We discuss the theoretical and managerial implications of our findings for researchers.
Avoid common mistakes on your manuscript.
As one of the Sustainable Development Goals (SDGs), the SDG 5 (Gender Equality): “achieve gender equality and empower all women and girls” is essential to reduce gender disparity and improve the status of women (UnitedNations, 2015 ). On International Women’s Day in 2021, Elsevier, a renowned information services provider, and publisher, released a report titled “Researcher Journey Through a Gender Lens,” which shows that there are gender differences in scientific research (Elsevier, 2020 ). On the one hand, gender differences exist across various subject areas, but the extent of these differences varies. On the other hand, the gender differences between men and women vary between countries. Japan, for example, has larger gender differences in research performance than the United States and China (Elsevier, 2020 ).
Researchers have made considerable efforts to promote gender equality (Badar et al., 2014 ; Kou et al., 2019 ; Lopez & Pereira, 2021 ; Myers et al., 2020 ; Restrepo et al., 2021 ). However, previous studies on gender equality in academia have three limitations. First, we observe that academic achievement studies have primarily focused on science, technology, engineering, and mathematics (STEM) disciplines, while business disciplines receive relatively little attention (Gruber et al., 2021 ). Second, there has been extensive prior research on gender differences, but limited effort has been made to explain the underlying mechanisms. As of now, some factors have been suggested as contributing to these differences, including such as age, country, institution, productivity (Lopez & Pereira, 2021 ; Myers et al., 2020 ; Restrepo et al., 2021 ), as well as differences in language use (Lerchenmueller et al., 2019 ; Newman et al., 2008 ; Urquhart-Cronish & Otto, 2019 ). Previous studies have also examined gender differences in research performance, but they tend to focus more on revealing the phenomenon than explaining its underlying mechanisms. Meanwhile, only a few of those studies have concentrated on writing style and have addressed the relationship between author gender and writing style. There are no studies that have examined the relationship between writing style and research performance. Based on those studies about gender differences in writing style (Lerchenmueller et al., 2019 ), we address two research questions in this study in an attempt to explain the gender differences in research performance through writing style. Here are the research questions:
Research Question 1: Are there gender differences in the research performance of male and female academics in business?
Research Question 2: What role does article’s writing style play in explaining the gender differences in research performance?
The academic status of female authors in marketing is also inferior to that of their male counterparts (Elsevier, 2020 ). We propose that the articles with female first authors have a lower impact than the articles with male first authors. Academics of different genders exhibit different levels of confidence in their academic work (Heath et al., 2022 ; Hoops et al., 2019 ; Meyerson et al., 2017 ; Sawdon & Finn, 2014 ). According to (Ehrlinger & Dunning, 2003 ), women express significantly lower confidence than men, and we thus propose that writing style of articles with female first authors is less positive than that of articles with male first authors. Finally, female authors use fewer positive words in their academic writing, and their writing style is less positive. Combined with the “self-confidence effect”, self-confidence predicts success in the future (Meisha & Al-dabbagh, 2021 ). An article with a positive writing style reflects the writer's confidence, and one with a more confident expression is more likely to be approved by the reader. Therefore, we propose that the writing style mediates the positive effect of gender differences on research performance.
Using BERT-based textual sentiment analysis, an analysis of 86 years’ worth of 9,820 articles from the top four marketing journals addresses these research questions. The results prove our hypotheses. We control for factors related to the articles’ writing style and research impact, including many factors at the author level, article level, journal level, and affiliate level, and conduct a set of robustness checks, further ensuring the robustness of findings.
Gender inequalities in academia.
Due to gender differences, men develop their careers more rapidly than women (van den Besselaar & Sandstrom, 2016 ). Access to valuable resources is differentially distributed among male and female scientists (Shauman & Xie, 2003 ). Additionally, the increased participation of women in STEM fields has also led to larger gender differences relating to productivity and impact (Elsevier, 2017 ; Huang et al., 2020 ; van Arensbergen et al., 2012 ). Literature on gender differences in research performance suggests that men outperform women (Abramo et al., 2015 ).
In recent decades, the gender context of academic science has substantially changed, with more female scientists entering the field (Elsevier, 2017 ; Huang et al., 2020 ; Lariviere et al., 2013 ) and occupying high-level academic positions (Diezmann & Grieshaber, 2019 ; Zippel, 2020 ). However, gender imbalances are still evident in the production of knowledge (Dinu, 2021 ; Koseoglu et al., 2019 ). According to Paswan and Singh ( 2020 ), women’s representation varies by field, with biology (37%) having a relatively higher percentage of female authors compared with engineering (20%), information science (21%), and mathematics (22%). The degree of gender differences varies fundamentally by discipline. There is still a significant underrepresentation of women in academic medicine and life science (Ha et al., 2021 ; Lerchenmueller & Sorenson, 2018 ; Lerchenmueller et al., 2019 ). Gender differences also persist in other disciplines. Ghiasi et al. ( 2015 ) report men produce 80% of all scientific production in engineering. Women in the biomedical field have fewer publications on COVID-19 (Muric et al., 2021 ). In addition, female authors and reviewers are underrepresented in entomology journals (Walker, 2020 ).
Bibliometric studies have focused on gender differences in academic performance. Despite this, these studies are rarely able to explain these phenomena in terms of their underlying mechanisms, sticking instead to revealing the characteristics of these phenomena. Additionally, these studies are focused primarily on engineering and medicine, with little emphasis on the business sector. This article focuses on the discipline of marketing in the business. The academic status of female authors in marketing is also inferior to that of their male counterparts (Elsevier, 2020 ). We propose that articles with female first authors have a lower impact than the articles with male first authors. Here is the hypothesis.
The impact of articles with female first authors is lower than that of articles with male first authors.
To eliminate these imbalances, we need first to explain the mechanism of this phenomenon. In this study, we focus on writing style and try to explain the research performance resulting from gender differences.
Multiple factors have been proposed as contributing to gender differences in research performance. The author’s characteristics, such as age, country, institution, productivity, country of origin, the field of study, and position in the academic system, can affect gender differences in research performance (Abramo et al., 2021 ; Lopez & Pereira, 2021 ; Myers et al., 2020 ; Restrepo et al., 2021 ; van Arensbergen et al., 2012 ). For example, women who work in research and those who have young children have had a significant decline in time devoted to research (Myers et al., 2020 ) and are less effective at technology development activities (Kou et al., 2019 ). Lopez and Pereira ( 2021 ) contend that female researchers are even less capable of transferring knowledge profitably and efficiently from a business standpoint. Researchers of male researchers (collaborating primarily with same-sex scientists) adhere to the principle of gender homophily, but females do not (Abramo et al., 2019b ; Jung et al., 2017 ; Kwiek & Roszka, 2021 ).
This paper focuses on the characteristics of the articles. Concerning the topic, Shang et al. ( 2022 ) explore gender balance and differences among first authors within the SDG 5-oriented research. Compared with the other 16 SDGs, the field of the SDG 5 produces relatively fewer scientific publications, with most of the first authors being female. Regarding the aim, Zhang et al., ( 2022a , 2022b ) find that male researchers more often value and engage in research geared mainly toward scientific progress, which is more cited. However, female researchers more often value and engage in research mainly aimed at contributing to societal progress, which has more abstract views (usage). Regarding language use, some researchers give considerable attention to writing style (Lerchenmueller et al., 2019 ; Newman et al., 2008 ; Urquhart-Cronish & Otto, 2019 ). The writing style in academic articles is studied across a wide range of disciplines, including medical and life science (Cao et al., 2021 ; Lerchenmueller et al., 2019 ; Wen & Lei, 2022b ), political science (Weidmann et al., 2018 ), and cross-cultural psychology (Holtz et al., 2017 ). For example, using sentiment analysis to examine the diachronic change in linguistic positivity, Yuan and Yao ( 2022 ) show that academic writing style in research articles in the journal science has become significantly more positive in the past 25 years.
Several earlier studies examine differences between the writing styles of male and female authors. According to some studies, gender differences exist in writing style, including levels of readability and concreteness (DeJesus et al., 2021 ; Joshi et al., 2020 ; Kolev et al., 2019 ), the extent of self-promotion (Cheng et al., 2017 ; Scharff, 2015 ), and the use of positive words (DeJesus et al., 2021 ; Lerchenmueller et al., 2019 ). By examining how gender differences affect the presentation of scientific research in positive ways, Lerchenmueller et al. ( 2019 ) discover that authors use more positive words to describe their research in scientific titles and abstracts, including “novel,” “unique,” “unprecedented,” etc. Furthermore, Dehdarirad and Yaghtin ( 2022 ) report that women use fewer positive terms in citing research findings in papers. When citing papers, men were significantly more likely to use positive terms.
We summarize research on gender differences in research performance and writing style. Previous studies have also examined gender differences in research performance, but they tend to focus more on revealing the phenomenon than explaining its underlying mechanisms. In addition, although many studies have concentrated on writing style, very few have addressed the relationship between author gender and writing style. Meanwhile, no studies have examined the relationship between writing style and research performance. In this paper, we attempt to explain the gender differences in research performance through the writing style.
Academics of different genders exhibit varying degrees of confidence in their academic work. A gender-based “confidence gap” in medicine is characterized by differences between performance and self-concept (i.e., how an individual sees himself) (Heath et al., 2022 ). Despite similar performance metrics, women consistently self-assess themselves as lower than men (Hoops et al., 2019 ; Meyerson et al., 2017 ; Sawdon & Finn, 2014 ). Women in various fields, including science, engineering, economics, athletics, and academia, report low self-esteem and self-confidence regardless of their abilities or competencies (Hubble & Zhao, 2016 ; Lerchenmueller et al., 2019 ). Females tend to have lower levels of confidence (Dunn et al., 2021a , 2021b ; Walker, 2020 ), and are also routinely less confident in their abilities and products than their male peers (Beyer & Bowden, 1997 ; Huang, 2013 ; Instone et al., 1983 ; Stankov & Lee, 2014 ) in math and science domains (Ehrlinger et al., 2018 ; Ellis et al., 2016 ; Else-Quest et al., 2010 ; Micari et al., 2007 ).
Across two preregistered studies with more than 900 active researchers in psychology, Dunn et al. ( 2021a , 2021b ) show that more self-confident researchers select larger prior means, in part due to gender differences in researcher self-confidence. Furthermore, women express significantly lower confidence than men, which leads to lower confidence in their work quality than their male peers (despite performing equally well on the test) (Ehrlinger & Dunning, 2003 ). Therefore, we propose that the writing style of articles with female first authors is less positive than that of articles with male first authors. Here is the hypothesis.
The writing style of articles with female first authors is less positive than that of articles with male first authors.
Writing style influences research impact (Morris et al., 2021 ; Parsons & Baglini, 2021 ). For example, using regression analysis and pairwise comparisons, Dehdarirad and Yaghtin ( 2022 ) show that male-authored papers receive a significantly higher positive sentiment compared with female-authored papers. Parsons and Baglini ( 2021 ) point out the importance of neutral language in peer review and provide examples of nonneutral linguistic and stylistic devices that emphasize a reviewer's personal response to the manuscript rather than their objective assessment. Back to writing style, female authors use fewer positive words in their academic writing, and their writing style is less positive. Referring to the “self-confidence effect”, self-confidence predicts success in the future (Meisha & Al-dabbagh, 2021 ). An article with a positive writing style reflects the writer's confidence, and one with a more confident expression is more likely to be approved by the reader. We, therefore, propose that writing style mediates the positive effect of gender differences on research performance. Here is the hypothesis.
The writing style of articles mediates the positive effect of gender differences on research performance.
The procedures of data processing are presented in Fig. 1 .
Framework for data collection and processing
Top journals are more influential and representative, which means a high position in the research system (Mauleon & Bordons, 2006 ; Mayer & Rathmann, 2018 ; Nielsen, 2017 ). For our study, we select the top four journals in the marketing field. There have been no previous studies on the research performance of female scholars in leading journals. Although there are many high-quality marketing journals, four journals have been selected for this study: Journal of Consumer Research (JCR), Journal of Marketing (JM), Journal of Marketing Research (JMR), and Marketing Science (MS). Among the leading marketing journals in the world, these four are widely recognized (Bauerly & Johnson, 2005 ; Stremersch & Verhoef, 2005 ; Tellis et al., 1999 ; Yoo, 2009 ).
Through the years, bibliometric studies have designed several methodologies to analyse scholarly output (Halevi, 2019 ). The article information data is obtained from the Web of Science (WoS), including the article, author, journal, and affiliation. We collected articles from the four journals Footnote 1 founded throughout 84 years, from 1936 to 2021. Footnote 2 To minimize the potential effect of a time interval on measuring the impact of publications, all the data were collected once on October 16, 2021. The corpus consists of 9,820 research articles (see supplementary materials for the descriptive statistics). We download full text from EBSCO. Footnote 3
Determining the first author researcher’s gender, why the first author.
According to Baerlocher et al. ( 2007 ), the order of the authors’ names appearing in a paper generally indicates the extent to which each author contributed to the work (Larivière et al., 2016 ). It is not easy to quantify the contributions of each author. Current studies examining the relationship between authorship characteristics and article impact tend to focus on specific author positions, such as first authors, last authors, corresponding authors, senior authors, and so on (Skitka et al., 2021 ).
The first author is typically the one who leads the research and writing process. Most bibliometric studies focus on the first author in the current literature (Decullier & Maisonneuve, 2021 ; Jemielniak et al., 2022 ; Liu et al., 2022 ; Nguyen et al., 2021 ; Thelwall & Maflahi, 2022 ; Thelwall & Mas-Bleda, 2020 ; Thelwall et al., 2019 ; Thelwall, 2018 , 2020a , 2020b ). Shang et al. ( 2022 ) explore gender balance and differences among first authors within the SDG5-oriented research during the first five years after the implementation of the SDG5 in 2016. According to Zhang et al. ( 2021 ), there is an upward trend in the number of articles with a Chinese first author in international journals. Considering female and male first authors, Fox and Paine ( 2019 ) test whether gender predicts the outcomes of editorial and peer review for > 23,000 research manuscripts submitted to six journals in ecology and evolution from 2010 to 2015. Zeina et al. ( 2020 ) analyze the relationship between the first author’s gender, ethnicity, and the chance of publication of rapid responses in the British Medical Journal (BMJ).
Besides, researchers have also considered authors in other positions when considering collaboration between genders. For example, the last author and the first author are often followed simultaneously (Sebo & Clair, 2023 ). Lerchenmueller et al., ( 2019 ) analyze whether men and women differ in how positively they frame their research findings and analyze whether the positive framing of research is associated with higher downstream citations. Specifically, they estimate the relative probability of positive framing as a function of the gender composition of the first and last authors. Andersen et al. ( 2020 ) report the results of an analysis that compares the gender distribution of authors in 1893 medical papers related to the pandemic with that papers published in the same journals in 2019, for papers with first authors and last authors from the United States. Research in pharmaceuticals and life sciences generally employs this approach.
In addition, some studies have also focused on other authors, such as corresponding authors (Edwards et al., 2018 ; Fox & Paine, 2019 ), senior authors (Polanco et al., 2020 ; Powell et al., 2022 ), solo authors (Nunkoo et al., 2020 ), middle authors, and mentee authors (Lopez-Padilla et al., 2021 ), co-first, senior, and co-senior authors (DeFilippis et al., 2021 ). While different types of other authors are taken into consideration, the first author is one that is emphasized by almost all authors. For example, Powell et al. ( 2022 ) investigated trends in female authorship in three journals over the past 25 years by using data for both first and senior authors. Lopez-Padilla et al. ( 2021 ) determine the changing patterns in gender differences and factors associated with the positioning of authors. They analyzed in four scenarios: first authors, last authors, middle authors, and mentee authors.
First authors play a significant role in bibliometric studies, and their importance cannot be overstated. In addition, since the sample articles in this study are mainly from marketing journals, the authors are not generally arranged alphabetically in the marketing field. In this study, we use the first authors to represent the gender attribute of a paper, considering those researchers make major contributions to scientific publications (Shang et al., 2022 ). We are concerned about articles with a female first author (AFFA).
Considering part of the authors’ names are abbreviated previously in WoS. To improve the quality of the authors’ names used in our study, we further conduct author disambiguation procedures. We obtain the authors’ full names from the Crossref Footnote 4 database using the DOI number of the article. After the name disambiguation, we get the first names of all authors. Code and a data demo are provided to demonstrate how we obtained this information at OSF: https://osf.io/bw8gx/ .
Gender identification is an enormous challenge, given that bibliographic data does not reveal it (Halevi, 2019 ). New bibliometric literature applying various gender-determination methods to authors and authorships (Elsevier, 2020 ; Halevi, 2019 ) provides new data-driven insights into gender disparities in science. Like other studies (Shang et al., 2022 ), the binary genders are considered and used in our analysis as well (Santamaría & Mihaljević, 2018 ). If no gender information could be inferred from an author, the gender was considered unknown (Shang et al., 2022 ).
A person’s first name can be a strong signal of his/her gender (Liu & Ruths, 2013 ). Zeina et al. ( 2020 ) analyze the relationship between the first author's gender estimated from the first name and the chance of publication of rapid responses in the British Medical Journal. For each author in our sample, we use a new model architecture to identify the author’s gender. The gender classifier is implemented using Character-level Multilayer long short-term memory (LSTM). It depends on NumPy, Scipy and, TensorFlow, Python packages for scientific computing. We use training data that a million names with gender annotation obtained from different countries. The architecture is as follows: Character Embedding Layer, 1st LSTM Layer, 2nd LSTM Layer, Pooling Layer, and Fully Connected Layer. The fully connected layer outputs the probability that a name is a male name. TensorFlow is used to build a character-level multi-layer LSTM neural network for machine learning, and a Python program is written for scholars’ gender prediction. This model predicts gender by importing the names of scholars without surnames, returning the probability estimates of their genders, and classifying the genders. The recall and precision rates are 94.0/93.5% for men and 91.8/97.8% for women, resulting in an F1 score of 0.95 for men and 0.93 for women. Given the high F1 score, the threshold of ≥ 0.85 (equivalent to a Gender Probability Score ≥ 1.735) is used to infer gender (Elsevier, 2020 ).
In addition, the gender of these individuals is determined by associating each author’s first name with the probability of the name being held by a man versus by a woman, using the Genderize database. Footnote 5 Researchers evaluate four gender assignment algorithms, using a control sample of gender-matched forenames from a U.S. government office, and find that the Genderize algorithm provided the most accurate gender assignment results. Applying a 90% probability threshold to the Genderize algorithm’s gender designation yields the same determination with which gender can be predicted in our dataset for analysis (Lerchenmueller et al., 2019 ).
We conduct a random selection of 500 first authors to demonstrate the accuracy of our gender determination method. Using the authors’ e-mail addresses, we manually collect the gender of these 500 authors by visiting their websites (we show the screenshot of these websites in the supplementary materials). The results of this analysis are then compared with the results of gender prediction calculated using machine learning. The results show that the coefficient of Cohen’s Kappa is 0.881, indicating a good agreement (Zhu et al., 2020 ). This also confirms the reliability of the prediction approach.
Productivity and impact are the two most important indicators of research performance across institutions (Larivière & Costas, 2016 ). Usually, citation counts and the number of publications published in scholarly journals are used to evaluate the research performance (Ghiasi et al., 2015 ; Zhang et al., 2020 ). Research performance is often determined by the number of citations that are cited as a result of the findings being read, used, applied, built upon, and cited by other researchers (Harnad et al., 2008 ). We regard the number of citations to be a measure of research performance (Jiang et al., 2018 ; Zhu et al., 2021 ).
We quantify the positive writing style based on the words in titles, abstracts, and full papers. To ensure that all data are full and available, the corpus consists of 5,431 research articles dating with a total of 72,971,482 words (see the descriptive statistics in Table 1 ). Titles and abstracts represent some of the most important text in research papers, as readers often use these to screen articles to determine which ones deserve further attention (Lerchenmueller et al., 2019 ). We conduct the investigation on the full texts to gain a holistic understanding of academic writing, which yields more reliable and generalizable results than those studies analysing only abstracts (Yuan & Yao, 2022 ).
Considering the limitations of the small list of positive and negative words, many researchers adopt self-created dictionaries (Holtz et al., 2017 ), expand lists of positive and negative words (Bordignon et al., 2021 ), or use sentiment analysis with large lexicons in R (Wen & Lei, 2022a ) to triangulate the results based on the small list of positive and negative words (Vinkers et al., 2015 ). Besides, it is extremely difficult to map the trajectory of discrete emotions using traditional survey methods due to their intensity and transience (Barsade & Gibson, 2007 ). Due to the advancement of automated text mining technology, some recent studies have begun to use advanced sentiment analysis techniques (Min et al., 2021 ). Due to BERT’s exceptional understanding of the relationship between words and its ability to understand context, fine-tuning BERT is more accurate than traditional Linguistic Inquiry and Word Count based SVMs (EmoLex) (Min et al., 2021 ). To capture whether the articles’ writing style is positive, we deploy fine-tuned BERT algorithms (Kumar et al., 2020 ; Min et al., 2021 ). BERT is an open-source deep learning model that is designed to perform well in a variety of natural language processing tasks (Devlin & Billings, 2018 ).
We use deep learning-based classification models to predict each paper’s PWS. Formally, let \({x}_{i}\) be text content of article i, and \({f}^{e}({x}_{i})\) represents a binary classifier for PWS. Then, the predicted label of \({x}_{i}\) for the writing style e becomes:
The binary classifiers, \({f}^{e}\left({x}_{i}\right)\) are constructed by training the fine-tuned BERT models. The BERT base model has 12 layers of transformer blocks (see Fig. 2 ).
We train the fine-tuned BERT models with the open-sourced TensorFlow implementation for BERT Footnote 6 and the pre-trained weights from the PyTorch port built by Hugging Face. Footnote 7 We also open the complete code used in our study's data collection and processing framework at OSF: https://osf.io/bw8gx/ . The main components include code for training/inferencing the fine-tuned BERT models.
In addition, to eliminate other factors that may affect the author’s writing style and authorial impact of the essay, we control for factors related to the articles' writing style and research impact, including many factors at the author level, article level, journal level, and affiliate level, that may influence articles’ research performance. Specifically, on the author level, publication productivity is a primary criterion for tenure and promotion in academia (Rigg et al., 2012 ). A more published author will be less pressured to create new articles and be more confident in their writing abilities. We control the author’s preview publications in these top journals. The collaboration influences the research impact (Abramo et al., 2019a ; Liu et al., 2022 ), and writing style of a manuscript is not only dependent on or determined by its first author, but also most likely by other authors. We, therefore, control the presence of men in the author team due to the influence of male authors.
On the article level, the length of the text influences the research impact (Arkin et al., 2019 ; Huang et al., 2020 ). Furthermore, the length of the text may dilute its stylistic features (dilution effect). We control the length of the abstract as well as the full article was controlled (Zeina et al., 2020 ). Compared to male authors, women tend to use fewer positive terms when citing research findings from papers composed of the same gender (Dehdarirad & Yaghtin, 2022 ). In general, the more references that are used, the greater the impact on the overall writing style. So, we also control the number of references used.
We also control variables on the journal level (Fernández et al., 2020 ; Lerchenmueller et al., 2019 ; Zeina et al., 2020 ). An examination of the relationship between the impact of a journal and the citation of an article, with the impact of a journal varying from year to year. Accordingly, we use the journal’s impact factor for the corresponding year as a control variable. Moreover, different journals are positioned differently, and their articles are written differently. For example, Marketing Science focuses primarily on articles that answer important research questions in marketing using mathematical modeling. Footnote 8 The Journal of Consumer Research publishes scholarly research that describes and explains consumer behavior. Footnote 9 Finally, since journal style is difficult to quantify, as well as other characteristics of journals that may be overlooked, we add a journal fixed effect to the model.
We also control variables on the affiliate level (Fernández et al., 2020 ; Jiang et al., 2018 ; Liu et al., 2022 ). Research quality is affected by English language proficiency (Zhang et al., 2022a , 2022b ). In non-English-speaking countries, editorial services are becoming increasingly popular, which means that non-English-speaking authors are using these services more frequently. Editorial services obviously affect the language proficiency of the article, so we control the affiliate language.
Finally, since the dataset of this study covered a long period of time, and there has been a significant improvement in academic writing in the past 25 years (Yuan & Yao, 2022 ), it is necessary to add the year fixed effect. We summarize all variables used in Table 2 . Table 3 and Table 4 describe our samples with descriptive statistics and the correlation. It should be noted that because of the discrete lognormal distribution of data, we use the natural logarithms of some measurements as variables, including citations, publications, and so on.
This study examines gender inequalities in marketing between males and females. Referring to previous studies (Powell et al., 2022 ; Shang et al., 2022 ), we regard the number of female authors, the number of articles with female first authors, and the research performance of articles with female first authors.
Firstly, we calculate the percentage of female authors in all articles published in the top four journals for each year. In the period 1936–2021, there was a rise in the number of authors publishing papers in the top four journals. The percentage of women authors is just 0.10 in 1936, and there is only one female for every nine authors. The percentage of women in 2021 is 0.40, and four women out of every ten authors are women. Figure 3 reveals that female researchers are increasingly publishing articles in leading marketing journals. By comparing the trend of female authors in the top four marketing journals between 1936 and 2021, we find that the proportion of female authors has grown. But in general, the number of female authors published in the top four marketing journals each year is still less than that of male authors. Consistent with previous studies, our study proves that gender differences between men and women still exist in marketing.
Percentage of female authors
We look at the trend in the percentage of AFFA. As a result, for each year, we calculate the percentage of AFFA among all authors who published articles in the top four journals. There is an increase in the annual trend for the percentage of the article with a female first author (AFFA) in the top four journals between 1936 and 2021. There was only one AFFA in every 20 articles in 1936, and the percentage of AFFA was 0.10. By 2021, the percentage of AFFA increased to 0.50, and there were 92 AFFA in 184 articles. Results show that, in marketing, more and more AFFA are published in top journals, as illustrated in Fig. 4 . The number of articles published in the top four marketing journals per year is lower for AFFA than for male first authors. Our study confirms the existence of gender differences in marketing, consistent with previous research. While our results show an increase in the annual trend for the percentage of AFFA in top marketing journals, these results are only indicative of the increase in female researchers’ research performance. It is interesting to compare the quality of the articles written by female researchers and the contribution made by female researchers. We further compare the research performances of AFFA.
Percentage of AFFA
From 1936 to 2021, we compare the impact of AFFA in the four top marketing journals. We calculate the percentage of the citations of AFFA among the citations of all articles in the top four journals yearly. There has been an increase in the impact of AFFA papers published in the four top journals between 1936 and 2021. In 1936, there was a 0.00 percent of AFFA among the sum citations of all articles. Accordingly, the impact of AFFA in the sum citations of all articles increase to 0.32 in 2021. According to the results, the impact of AFFA published in top journals in marketing is increasing, see Fig. 5 . Qualitatively, this result indicates that the quality of the impact of AFFA is improving. This indicates that female researchers are performing better in their research. In the four top marketing journals, AFFAs receive fewer citations than articles with male first authors each year. Our study again demonstrates that gender differences still exist in marketing, consistent with previous research.
Percentage of AFFA citations
It is necessary to disambiguate the authors according to their names, affiliations, publications, etc. To better understand the observed gender differences in the research performance of AFFA, we use Ordinary Least Squares (OLS) regressions in STATA 17 to detect the differences in research performance after other variables are added to the models.
To explore the relationship between the articles’ impact and the author's gender, we estimate the following baseline model:
where i represents the article, and t represents the year. \({Impact}_{it}\) represents the research of article ( i ) in the year ( t ). \({Gender}_{it}\) is a dummy variable coded 1 for the female author and 0 for the male author. Our control variables are based on the variables we analyzed above. As the dependent variable in our data is compressed at 0 for some observations, we employ the Tobit model (Zhu et al., 2022 ).
To examine the mechanism for the articles’ impact, we use a modified version of Baron & Kenny’s ( 1986 ) three-step mediation test proposed by Zhao et al. ( 2010 ), in which the Sobel test is replaced by bootstrap (Zhu et al., 2022 ). To enhance the diversity of analytical methods, we also use the Monte Carlo method (Li et al., 2021 ; Selig & Preacher, 2008 ) with 50,000 bootstrapping samples. The mediation effect model consists of the following components:
where \({WS}_{it}\) is the writing style of the article of article ( i ) in the year ( t ).
Model 1, Model 2, and Model 3 in Table 4 report regression results where the dependent variable is impact. Model 1 includes only gender. Model 2 adds control variables at the author level, article level, journal level, and affiliate level. Model 3 adds all control variables and includes the year, journal publisher, and country fixed effects.
In Table 5 , the coefficient on gender is negative and significant across all three models, suggesting that gender is negatively associated with impact. For example, the coefficient on gender in Model 3 equals -0.0583 ( p =− 0.035). There is a significant negative correlation between full-text length, reference, and impact. However, there is no significant correlation between other control variables and impact. Baseline results supported H1.
In the next step, we analyse writing style of the articles in order to explain the reasons for the differences in impact between male and female authors.
The dependent t-test indicated that articles with a male first author had a more positive writing style than those with a female first author (M female = 0.52, SD = 0.63 vs. M male = 0.89, SD = 0.66, t (5430) = 3.693, p = 0.000). H2 is supported.
The next step will be directly verifying the mediating role of WS. The estimation results for anxiety are reported in Table 6 . Column (1) shows that the coefficient on gender is negative and significant across all three models, suggesting that gender is negatively associated with impact ( β = -0.0583, p = 0.035). H1 is supported again. Columns (2) indicate that the coefficient on gender is negative and significant across all three models, suggesting that gender is negatively associated with WS ( β = − 0.0294, p = 0.079), and H2 is supported again. In column (3), gender also has a significantly negative relationship with impact with less coefficient ( β = − 0.0313, p = 0.051), and WS has a positive effect ( β = 0.0787, p = 0.0016) on impact. The mediation effect of WS is significant for the articles’ impact. H3 is supported.
We used the Monte Carlo method (Li et al., 2021 ; Selig & Preacher, 2008 ) with 50,000 bootstrapping samples, and results supported the mediating effect of WS on the relationship between author gender and research impact (estimate = − 0.75, 95% CI [− 0.0280, − 0.0082]). Results supported the mediating effect of WS. H3 is supported again.
As a means of further enhancing the stability of this paper’s findings, we conduct a set of robustness checks.
Based on our hypothetical derivation, the percentage of female authors (0–100%) was used as a proxy measure of gender, taking into account the role of authors on other positions. We predicate that the lower the gender ratio (0–100%) in the author team, the greater the impact of the article.
We determine the percentage of female authors based on the count of all authors in each article, and the female percentage is calculated as follows:
where Female percentage is the index of the article i ’s female authors percentage, and Female authors is the count of female authors in the article i . Total authors is the total number of authors in the article i .
To explore the relationship between the articles’ impact and the author’s gender, we use the same models (1), but we use the female percentage as the independent variable.
Model 1, Model 2, and Model 3 in Table 7 report regression results where the dependent variable is impact. Model 1 includes only the female percentage. Model 2 adds control variables at the author level, article level, journal level, and affiliate level. Model 3 adds all control variables and includes the year, journal publisher, and country fixed effects.
In Table 7 , the coefficient on the female percentage is negative and significant across all three models, suggesting that the female percentage is negatively associated with the impact. For example, the coefficient on gender in Model 3 equals − 0.802 (p = 0.008). There is a significant negative correlation between the full-text length, the reference, and the impact. However, there is no significant correlation between other control variables and the impact. Baseline results support H1 again.
In the next step, we analyse the writing style of the articles in order to explain the reasons for the differences in impact between male and female authors.
The next step will be directly verifying the mediating role of WS. The estimation results for anxiety are reported in Table 8 . Column (1) shows that the coefficient on gender is negative and significant across all three models, suggesting that gender is negatively associated with impact ( β = − 0.802, p = 0.008). H1 is supported again. Columns (2) indicate that the coefficient on gender is negative and significant across all three models, suggesting that gender is negatively associated with WS ( β = − 0.494, p = 0.027), and H2 is supported again. In column (3), gender also has a significantly negative relationship with impact with less coefficient ( β = − 0.579, p = 0.011), and WS has a negative effect ( β =− 0.494, p = 0.027) on impact. The mediation effect of WS is significant for the articles’ impact. H3 is supported.
We used the Monte Carlo method (Li et al., 2021 ; Selig & Preacher, 2008 ) with 50,000 bootstrapping samples, and results supported the mediating effect of WS on the relationship between author gender and research impact (estimate = − 0.11, 95% CI [− 0.1201, − 0.0562]). Results supported the mediating effect of WS. H3 is supported again.
A lesser-known form of cultural bias called masculine defaults must be recognized to understand and remedy women’s underrepresentation in majority-male fields and occupations (Cheryan & Markus, 2020 ).
Masculinity and femininity oppose ego goals with social goals. While masculinity is characterized by competition, achievement, assertiveness, and success, femininity relates to cooperation, helping others, sharing, empathy, and solidarity. A feminist culture emphasizes modesty and subtlety, while a masculine culture emphasizes selfishness and competition (Hofstede, 2001 ). Regarded masculinity and femininity (Hofstede, 2001 ), we propose that masculinity and femininity influence the article’s impact. According to our conclusions, we predict that there is a significant difference between the impact of articles with different gender authors in the context of feminist culture and that of masculinist culture. The impact of articles with first authors from a feminine country is lower than that of articles with first authors from a masculine country.
Using a common approach to verification mediation through manipulation of conditioning in psychology and management (Fishbach et al., 2006 ; Huang et al., 2017 ; Salerno et al., 2019 ; Woolley & Risen, 2021 ; Yani-de-Soriano et al., 2019 ), people’s attitudes or behaviours are observed to change accordingly by affecting conditions related to psychological mechanisms using natural or experimental stimuli. A psychological mechanism is then indirectly validated. If our proposed psychological mechanism for writing style holds, then our prediction will be true. H1, H2, and H3 are supported.
Using the author’s e-mail address, we acquired each researcher’s affiliation list and extracted corresponding country information. To determine the researcher’s affiliation country of origin where the institution is located, we adopt the method used by (Boekhout et al., 2021 ; Shang et al., 2022 ). Three steps were taken: (1) For researchers with affiliations from only one country, the country is marked as the researcher's country of origin. (2) For a researcher with affiliations from more than one country, if the country most often associated with the researcher in their publications coincided with the country associated with the researcher in their first publication, then this country is considered the researcher's country of origin. Otherwise, we regard the evidence as insufficient to determine a single country of origin (Shang et al., 2022 ). (3) Referring to Hofstede Insight, Footnote 10 we calculate the masculinity score for each country.
To explore the relationship between articles’ impact and masculinity scores for affiliates, we use the same models (1), but we use the masculinity scores (masculinity scores for the country of the author’s masculinity) as the independent variable.
Model 1, Model 2, and Model 3 in Table 10 report regression results where the dependent variable is impact. Model 1 includes only masculinity scores. Model 2 adds control variables at the author level, article level, journal level, and affiliate level. Model 3 adds all control variables and includes the year, journal publisher, and country fixed effects.
In Table 9 , the coefficient on the masculinity score is positive and significant across all three models, suggesting that the masculinity score is positively associated with impact. For example, the coefficient on gender in Model 3 equals 0.101 ( p = 0.031). There is a significant negative correlation between abstract, full-text length, reference, and impact. However, there is no significant correlation between other control variables and impact. Baseline results support H1.
In the next step, we analyse the writing style of the articles in order to explain this effect.
The dependent t-test indicated that articles with a first author from high masculinity country (masculinity scores > 50) had a more impact than those with a first author from low masculinity country (masculinity scores < 50) (M low masculinity scores = 0.82, SD = 0.63 vs. M high masculinity scores = 0.89, SD = 0.66, t (5430) = 3.693, p = 0.000). H1 is supported. The dependent t-test indicated that articles with a first author from high masculinity country have a more positive writing style than those with first author from low masculinity country (M low masculinity scores = 1.42, SD = 0.63 vs. M high masculinity scores = 2.76, SD = 0.66, t (5430) = 5.693, p = 0.000). H2 is supported.
The next step will be directly verifying the mediating role of WS. The estimation results for anxiety are reported in Table 10 . Column (1) shows that the coefficient on masculinity scores is positive and significant across all three models, suggesting that masculinity scores are positively associated with impact ( β = 0.101, p = 0.031). H1 is supported again. Column (2) indicates that the coefficient on masculinity scores is negative and significant across all three models, suggesting that the masculinity score is positively associated with WS ( β = 0.117, p = 0.048), and H2 is supported again. In column (3), the masculinity score also has a significantly positive relationship with impact ( β = − 0.0831, p = 0.063), and WS has a positive effect ( β = 0.0747, p = 0.003) on impact. The mediation effect of WS is significant. H3 is supported.
We use the Monte Carlo method (Li et al., 2021 ; Selig & Preacher, 2008 ) with 50,000 bootstrapping samples, and results support the mediating effect of WS on the relationship between author masculinity scores and research impact (estimate = 0.14, 95% CI [1.0280, 1.7102]). Results support the mediating effect of WS. H3 is supported again.
Several studies use a small list of predefined positive/negative words to examine the linguistic positivity bias (Lerchenmueller et al., 2019 ; Vinkers et al., 2015 ; Weidmann et al., 2018 ). Following Lerchenmueller et al., ( 2019 ), we explore gender differences in the use of each of these 25 positive words that are used in life science (we show this all 25 positive words in the supplementary materials).
There is no doubt that titles and abstracts are among the most important text in research papers since readers often use this information to determine which articles deserve further investigation (Lerchenmueller et al., 2019 ). We focus on the frequency of these 25 positive words that are used in all papers’ abstracts or titles. To ensure that all data are full and available, the corpus consists of 5,431 research articles (see Table 5 for the descriptive statistics).
To determine whether men and women differ in the positive presentation of their research, we use the percentage of these 25 positive words ( Positive words ) based on the count of words in each article. Due to the right-skewed nature of the data, this research transforms the data by taking the logarithm. The Positive words are calculated as follows:
where Positive words is the index of the article i ’s percentage of these 25 positive words, and Positive words is the count of these 25 positive keywords in the abstract or title of the article i . Total Words is the total number of words in the abstract or the title of the article i .
To explore the relationship between the articles’ impact and the author's gender, we use the same models (2–4), but we use positive words as the mediator.
The dependent t-test indicated that articles with a male first author had a greater impact than those with a female first author (M female = 0.78, SD = 0.45 vs. M male = 0.91, SD = 0.71, t (5430) = 4.527, p = 0.000). H1 is supported. The dependent t-test indicated that articles with a male first author use more positive words than those with a female first author (M female = 1.26, SD = 0.69 vs. M male = 1.38, SD = 0.45, t (5430) = 3.693, p = 0.000). H2 is supported.
The next step will be directly verifying the mediating role of Positive words . The estimation results for anxiety are reported in Table 11 . Column (1) shows that the coefficient of gender is negative and significant across all three models, suggesting that gender is negatively associated with the impact ( β = − 0.0583, p = 0.035). H1 is supported again. Columns (2) indicate that the coefficient on gender is negative and significant across all three models, suggesting that gender is negatively associated with Positive words ( β = − 0.0798, p = 0.076), and H2 is supported again. In column (3), gender also has a significantly negative relationship with impact with less coefficient ( β = − 0.0598, p = 0.018), and Positive words has a positive effect ( β = 0.186, p = 0.000) on impact. The mediation effect of Positive words is significant for the articles’ impact. H3 is supported.
We used the Monte Carlo method (Li et al., 2021 ; Selig & Preacher, 2008 ) with 50,000 bootstrapping samples, and results supported the mediating effect of Positive words on the relationship between author gender and research impact (estimate = − 0.88, 95% CI [− 0.0280, − 0.0102]). Results supported the mediating effect of Positive words . H3 is supported again.
In order to address gender disparities and improve women’s status, the UN proposes promoting “gender equality” as one of the SDGs. This study is piqued by an aim to underpin current global efforts to promote gender diversity in studies, which matters for the achievement of gender equality in research and society.
An analysis of the 86 year 9820 articles from the top four leading journals in marketing from 1936 to 2021 is presented in this study. Our conclusions are as follows. We draw four main conclusions from our analysis. Firstly, we find that female authors have an increasing academic status in marketing, as evidenced by their number, publications, and influence. However, there are still gender differences between men and women, which is in line with previous research (Elsevier, 2017 ; Huang et al., 2020 ; Lariviere et al., 2013 ). Secondly, by combining the study of writing style and assertiveness, we find that articles with female first authors have a more negative language style than those with male first authors. In addition, the positive writing style of the articles explains the gender differences in research performance. Thirdly, in the robustness check, we find that masculinist and feminist cultural traits moderate the effect. Compared to the articles whose first authors originate from feminist culture emphasizing modesty, the articles whose first authors originate from masculinist culture emphasizing competition have a greater impact.
We make three contributions to the literature in this paper. Firstly, focusing on the top four marketing journals, we find that although female scholars are becoming more academically prominent, gender differences between men and women still exist. Previous studies have focused on STEM and medicine (Elsevier, 2017 ; Huang et al., 2020 ; van Arensbergen et al., 2012 ), and we complement the study of gender differences in research performance in marketing.
Furthermore, we explain the differences between male and female scholars on research performance by combining studies on confidence and writing style. On the one hand, these studies typically consider only descriptive variables such as age, country, institution, productivity, etc. (Lopez & Pereira, 2021 ; Myers et al., 2020 ; Restrepo et al., 2021 ). In this paper, we discuss the writing style and promote research in this area. On the other hand, previous studies have failed to investigate the underlying mechanisms of gender differences in research performance (Fox & Paine, 2019 ; Horbach et al., 2022 ), and we accounted for gender differences by examining the writing style.
Additionally, previous research on research performance differences has rarely focused on cultural differences (Cheryan & Markus, 2020 ; Khosrowjerdi & Bornmann, 2021 ). In a robustness check, we find the effects are moderated by the culture of masculinism and feminism. The authors’ articles have a greater impact in a masculinist culture than in a feminist culture. However, we find a correlation between Hofstede’s masculinity and femininity cultural dimension and research performance. We contribute to the study of research performance and cultural differences, but it needs to be further investigated.
Finally, we contribute to the method of analyzing writing style. The latest studies resort to larger dictionaries and lexicons to tackle the limitation of the small list of positive and negative words (Bordignon et al., 2021 ; Holtz et al., 2017 ; Vinkers et al., 2015 ; Wen & Lei, 2022a ). We use advanced sentiment analysis techniques (Min et al., 2021 ). Consistent with Min et al. ( 2021 ) in organizational behavior, we also find that fine-tuning BERT enhanced the extraordinary understanding of the relationship between words and BERT’s ability to understand the context of the original sentence in marketing. We share the data, code, and stimuli at OSF: https://osf.io/bw8gx/ . This article uses the latest deep learning algorithm to promote the research of big data analysis methods in marketing research and provides method guidelines and references for future research on the writing style of the article.
The findings of this study are practical in nature. To achieve gender equality, academics must put forth a concerted effort. We find that, despite the persisting gaps in performance between men and women, the academic status of women has significantly improved. Based on these results, we offer theoretical insights to reduce gender differences. Despite the gender differences that have been identified by studies, we propose a method to boost the research performance of women researchers. Women can be more confident and active in writing articles, which helps increase the article's impact.
But it should be more cautious about the managerial implications (Cao et al., 2021 ; Millar et al., 2019 ; Yuan & Yao, 2022 ). Research is based on scientific evidence and rigorous logic to seek truth and facts. The best way to publish a paper with high impact is to improve the quality of this research. Our findings encourage authors to collaborate and express more actively while maintaining scientific rigor and accuracy.
In spite of the fact that all of our research hypotheses are confirmed, there are still some limitations to our study with robustness. First, we use the gender of the first author to represent the gender attribute of a paper (Decullier & Maisonneuve, 2021 ; Jemielniak et al., 2022 ; Liu et al., 2022 ; Nguyen et al., 2021 ; Thelwall & Maflahi, 2022 ; Thelwall & Mas-Bleda, 2020 ; Thelwall et al., 2019 ; Thelwall, 2018 , 2020a , 2020b ), a set of robustness check improve the robustness of findings. But it should be noted that a manuscript has also been edited/revised by other authors before it is submitted and published. That is, the writing style of a manuscript is not only dependent on or determined by its first author, but also most likely by other authors. There is also a need to consider the contribution and the impact of the authors in other positions in the article, such as the last authors (Andersen et al., 2020 ; Lerchenmueller et al., 2019 ; Sebo & Clair, 2022 ), corresponding authors (Edwards et al., 2018 ; Fox & Paine, 2019 ), senior authors (Polanco et al., 2020 ; Powell et al., 2022 ), solo authors (Nunkoo et al., 2020 ), middle authors, and mentee authors (Lopez-Padilla et al., 2021 ), co-first, senior, and co-senior authors (DeFilippis et al., 2021 ). Based on the foregoing point, we suggest more research needs to pay attention to this point in future research.
Moreover, although our research demonstrates that a positive writing style can have a positive impact on an article's impact, we ignore its negative “backfire”. It is detrimental to incorporate language associated with self-promotion and aggrandization into scientific writing (Morris et al., 2021 ). Our study aims to explain gender differences in academic performance from the perspective of the writing style, and we do not examine this negative “backfire”. Future research should, however, explore the limits and possible inflection points of the effects of the positive writing style. This might help to rectify the problem.
Besides, the correlation between positive words and research performance may be affected by other factors, such as an individual’s race (Palomo et al., 2017 ). The article’s unstructured data, in addition to the positive words, gives us additional information, such as the topic, the methodology, the subject, etc. This study is not able to investigate these factors due to the length of the article and the scope of our research. We intend to combine our findings with other databases to investigate these factors in the future.
Finally, we focus exclusively on marketing. To generalize our findings to other scientific fields, future studies should examine more journals in different fields of study. Meanwhile, please note that the articles used for this study are those published in leading journals with high scientific quality. Further research can determine whether this effect applies to general journals.
These four journals were founded in 1936 (JM), 1964 (JMR), 1974 (JCR), 1982 (MS).
The data of articles in 2021 were collected on October 16, 2021, when data collection was completed. This issue will not be repeated below.
https://web.s.ebscohost.com .
https://www.crossref.org .
Genderize database containing 216 286 distinct names across 79 countries and 89 languages.
https://www.tensorflow.org/official_models/fine_tuning_bert .
https://huggingface.co/transformers .
https://pubsonline.informs.org/page/mksc/submission-guidelines .
https://academic.oup.com/jcr .
https://www.hofstede-insights.com/country-comparison/greece .
Abramo, G., Aksnes, D. W., & D’Angelo, C. A. (2021). Gender differences in research performance within and between countries: Italy vs Norway. Journal of Informetrics , 15 (2), 101144. https://doi.org/10.1016/j.joi.2021.101144
Article Google Scholar
Abramo, G., Cicero, T., & D’Angelo, C. A. (2015). Should the research performance of scientists be distinguished by gender? Journal of Informetrics, 9 (1), 25–38. https://doi.org/10.1016/j.joi.2014.11.002 .
Abramo, G., D’Angelo, C. A., & Di Costa, F. (2019a). The collaboration behavior of top scientists. Scientometrics, 118 (1), 215–232. https://doi.org/10.1007/s11192-018-2970-9 .
Abramo, G., D’Angelo, C. A., & Di Costa, F. (2019b). A gender analysis of top scientists’ collaboration behavior: Evidence from Italy. Scientometrics, 120 (2), 405–418. https://doi.org/10.1007/s11192-019-03136-6 .
Andersen, J. P., Nielsen, M. W., Simone, N. L., Lewiss, R. E., & Jagsi, R. (2020). COVID-19 medical papers have fewer women first authors than expected. Elife, 9 , e58807. https://doi.org/10.7554/eLife.58807 .
Arkin, N., Lai, C., Kiwakyou, L. M., Lochbaum, G. M., Shafer, A., Howard, S. K., Mariano, E. R., & Fassiotto, M. (2019). What’s in a Word? Qualitative and quantitative analysis of leadership language in anesthesiology resident feedback. Journal of Graduate Medical Education, 11 (1), 44–52. https://doi.org/10.4300/JGME-D-18-00377.1 .
Badar, K., Hite, J. M., & Badir, Y. F. (2014). The moderating roles of academic age and institutional sector on the relationship between co-authorship network centrality and academic research performance. Aslib Journal of Information Management, 66 (1), 38–53. https://doi.org/10.1108/Ajim-05-2013-0040 .
Baerlocher, M. O., Newton, M., Gautam, T., Tomlinson, G., & Detsky, A. S. (2007). The meaning of author order in medical research. Journal of Investigative Medicine, 55 (4), 174–180. https://doi.org/10.2310/6650.2007.06044 .
Baron, R. M., & Kenny, D. A. (1986). The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations. Journal of Personality and Social Psychology , 51 (6), 1173–1182. https://doi.org/10.1037/0022-3514.51.6.117365 .
Barsade, S. G., & Gibson, D. E. (2007). Why does affect matter in organizations? Academy of Management Perspectives, 21 (1), 36–59. https://doi.org/10.5465/amp.2007.24286163 .
Bauerly, R. J., & Johnson, D. T. (2005). An evaluation of journals used in doctoral marketing programs. Journal of the Academy of Marketing Science, 33 (3), 313–329. https://doi.org/10.1177/0092070304272052 .
Beyer, S., & Bowden, E. M. (1997). Gender differences in seff-perceptions: Convergent evidence from three measures of accuracy and bias. Personality and Social Psychology Bulletin, 23 (2), 157–172. https://doi.org/10.1177/0092070304272052 .
Boekhout, H., van der Weijden, I., & Waltman, L. (2021). Gender differences in scientific careers: A large-scale bibliometric analysis. arXiv preprint http://arxiv.org/abs/2106.12624 .
Bordignon, F., Ermakova, L., & Noel, M. (2021). Over-promotion and caution in abstracts of preprints during the COVID-19 crisis. Learned Publishing, 34 (4), 622–636. https://doi.org/10.1002/leap.1411
Cao, X., Lei, L., & Wen, J. (2021). Promoting science with linguistic devices: A large-scale study of positive and negative words in academic writing. Learned Publishing, 34 (2), 82–88. https://doi.org/10.1002/leap.1322
Cheng, C., Liu, Y., & Liu, Z. (2017). Empirical likelihood ratio under infinite second moment. Communications in Statistics-Theory and Methods, 46 (14), 6909–6915. https://doi.org/10.1080/03610926.2016.1139135
Article MathSciNet MATH Google Scholar
Cheryan, S., & Markus, H. R. (2020). Masculine defaults: Identifying and mitigating hidden cultural biases. Psychological Review, 127 (6), 1022–1052. https://doi.org/10.1037/rev0000209
Decullier, E., & Maisonneuve, H. (2021). Retraction according to gender: A descriptive study. Accountability in Research, 1–6. https://doi.org/10.1080/08989621.2021.1988576
DeFilippis, E. M., Sinnenberg, L., Mahmud, N., Wood, M. J., Hayes, S. N., Michos, E. D., & Reza, N. (2021). Gender differences in publication authorship during COVID-19: A bibliometric analysis of high-impact cardiology journals. Journal of the American Heart Association , 10 (5), e019005. https://doi.org/10.1161/jaha.120.019005
Dehdarirad, T., & Yaghtin, M. (2022). Gender differences in citation sentiment: A case study in life sciences and biomedicine. Journal of Information Science . https://doi.org/10.1177/01655515221074327
DeJesus, J. M., Umscheid, V. A., & Gelman, S. A. (2021). When gender matters in scientific communication: The role of generic language. Sex Roles, 85 (9–10), 577–586. https://doi.org/10.1007/s11199-021-01240-7
Devlin, M., & Billings, A. C. (2018). Examining confirmation biases: implications of sponsor congruency. International Journal of Sports Marketing & Sponsorship, 19 (1), 58–73. https://doi.org/10.1108/ijsms-10-2016-0078
Diezmann, C., & Grieshaber, S. (2019). Women Professors: Who Makes it and How? Springer.
Book Google Scholar
Dinu, N.-R. (2021). ¿Citan las mujeres investigadoras más a las otras mujeres que a los hombres? Telos Revista de Estudios Interdisciplinarios en Ciencias Sociales, 23 (3), 568–583. https://doi.org/10.36390/telos233.05
Dunn, E. W., Chen, L., Proulx, J. D. E., Ehrlinger, J., & Savalei, V. (2021). Can researchers’ personal characteristics shape their statistical inferences? Personality and Social Psychology Bulletin, 47 (6), 969–984. https://doi.org/10.1177/0146167220950522
Edwards, H. A., Schroeder, J., & Dugdale, H. L. (2018). Gender differences in authorships are not associated with publication bias in an evolutionary journal. Plos One, 13 (8), e0201725. https://doi.org/10.1371/journal.pone.0201725
Ehrlinger, J., & Dunning, D. (2003). How chronic self-views influence (and potentially mislead) estimates of performance. Journal of Personality and Social Psychology , 84 (1), 5–17. https://doi.org/10.1037/0022-3514.84.1.5
Ehrlinger, J., Plant, E. A., Hartwig, M. K., Vossen, J. J., Columb, C. J., & Brewer, L. E. (2018). Do gender differences in perceived prototypical computer scientists and engineers contribute to gender gaps in computer science and engineering? Sex Roles, 78 (1), 40–51. https://doi.org/10.1007/s11199-017-0763-x
Ellis, J., Fosdick, B. K., & Rasmussen, C. (2016). Women 1.5 times more likely to leave STEM pipeline after calculus compared to men: lack of mathematical confidence a potential culprit. Plos One., 11 (7), e0157447. https://doi.org/10.1371/journal.pone.0157447
Else-Quest, N. M., Hyde, J. S., & Linn, M. C. (2010). Cross-national patterns of gender differences in mathematics: A meta-analysis. Psychological Bulletin, 136 (1), 103–127. https://doi.org/10.1037/a0018053
Elsevier. (2017). Gender in the Global Research Landscape: Analysis of Research Performance Through a Gender Lens Across 20 Years, 12 Geographies, and 27 Subject Ares .
Elsevier. (2020). The researcher journey through a gender lens. Mendeley Data . https://doi.org/10.17632/ww6g4t2r32.2
Fernández, A., Ferrándiz, E., & León, M. D. (2020). Are organizational and economic proximity driving factors of scientific collaboration? Evidence from Spanish universities, 2001–2010. Scientometrics, 126 (1), 579–602. https://doi.org/10.1007/s11192-020-03748-3
Fishbach, A., Dhar, R., & Zhang, Y. (2006). Subgoals as substitutes or complements: The role of goal accessibility. Journal of Personality and Social Psychology, 91 (2), 232–242. https://doi.org/10.1037/0022-3514.91.2.232
Fox, C. W., & Paine, C. E. T. (2019). Gender differences in peer review outcomes and manuscript impact at six journals of ecology and evolution. Ecology and Evolution, 9 (6), 3599–3619. https://doi.org/10.1002/ece3.4993
Ghiasi, G., Lariviere, V., & Sugimoto, C. R. (2015). On the compliance of women engineers with a gendered scientific system. Plos One, 10 (12), e0145931. https://doi.org/10.1371/journal.pone.0145931
Gruber, J., Mendle, J., Lindquist, K. A., Schmader, T., Clark, L. A., Bliss-Moreau, E., Akinola, M., Atlas, L., Barch, D. M., Barrett, L. F., Borelli, J. L., Brannon, T. N., Bunge, S. A., Campos, B., Cantlon, J., Carter, R., Carter-Sowell, A. R., Chen, S., Craske, M. G., & Williams, L. A. (2021). The Future of women in psychological science. Perspect Psychol Sci, 16 (3), 483–516. https://doi.org/10.1177/1745691620952789
Ha, G. L., Lehrer, E. J., Wang, M., Holliday, E., Jagsi, R., & Zaorsky, N. G. (2021). Sex differences in academic productivity across academic ranks and specialties in academic medicine: A systematic review and meta-analysis. JAMA Netw Open, 4 (6), e2112404. https://doi.org/10.1001/jamanetworkopen.2021.12404
Halevi, G. (2019). Bibliometric Studies on Gender Disparities in Science . Springer.
Harnad, S., Brody, T., Vallieres, F., Carr, L., Hitchcock, S., Gingras, Y., Oppenheim, C., Hajjem, C., & Hilf, E. R. (2008). The access/impact problem and the green and gold roads to open access: An update. Serials Review, 34 (1), 36–40. https://doi.org/10.1016/j.serrev.2007.12.005
Heath, J. K., Alvarado, M. E., Clancy, C. B., Barton, T. D., Kogan, J. R., & Dine, C. J. (2022). The context of “Confidence”: Analyzing the term confidence in resident evaluations. Journal of General Internal Medicine, 37 (9), 2187–2193. https://doi.org/10.1007/s11606-022-07535-z
Hofstede, G. (2001). Culture’s consequences: Comparing values, behaviors, institutions and organizations across nations . Sage publications.
Google Scholar
Holtz, P., Deutschmann, E., & Dobewall, H. (2017). Cross-cultural psychology and the rise of academic capitalism: Linguistic changes in CCR and JCCP articles, 1970–2014. Journal of Cross-Cultural Psychology, 48 (9), 1410–1431. https://doi.org/10.1177/0022022117724902
Hoops, H., Heston, A., Dewey, E., Spight, D., Brasel, K., & Kiraly, L. (2019). Resident autonomy in the operating room: Does gender matter? The American Journal of Surgery, 217 (2), 301–305. https://doi.org/10.1016/j.amjsurg.2018.12.023
Horbach, S. P. J. M., Schneider, J. W., & Sainte-Marie, M. (2022). Ungendered writing: Writing styles are unlikely to account for gender differences in funding rates in the natural and technical sciences. Journal of Informetrics, 16 (4), 101332. https://doi.org/10.1016/j.joi.2022.101332
Huang, C. J. (2013). Gender differences in academic self-efficacy: A meta-analysis. European Journal of Psychology of Education, 28 (1), 1–35. https://doi.org/10.1007/s10212-011-0097-y
Huang, J., Gates, A. J., Sinatra, R., & Barabasi, A. L. (2020). Historical comparison of gender inequality in scientific careers across countries and disciplines. Proc Natl Acad Sci U S A, 117 (9), 4609–4616. https://doi.org/10.1073/pnas.1914221117
Huang, S. C., Etkin, J., & Jin, L. (2017). How winning changes motivation in multiphase competitions. Journal of Personality and Social Psychology, 112 (6), 813–837. https://doi.org/10.1037/pspa0000082
Hubble, C., & Zhao, J. (2016). Gender differences in marathon pacing and performance prediction. Journal of Sports Analytics, 2 , 19–36. https://doi.org/10.3233/JSA-150008
Instone, D., Major, B., & Bunker, B. B. (1983). Gender, self confidence, and social influence strategies: An organizational simulation. Journal of Personality and Social Psychology, 44 (2), 322–333. https://doi.org/10.1037/0022-3514.44.2.322
Jemielniak, D., Slawska, A., & Wilamowski, M. (2022). COVID-19 effect on the gender gap in academic publishing. Journal of Information Science . https://doi.org/10.1177/01655515211068168
Jiang, L., Zhu, N. B., Yang, Z. L., Xu, S., & Jun, M. (2018). The relationships between distance factors and international collaborative research outcomes: A bibliometric examination. Journal of Informetrics, 12 (3), 618–630. https://doi.org/10.1016/j.joi.2018.04.004
Joshi, P. D., Wakslak, C. J., Appel, G., & Huang, L. (2020). Gender differences in communicative abstraction. Journal of Personality and Social Psychology, 118 , 417–435. https://doi.org/10.1037/pspa0000177
Jung, H., Seo, I., Kim, J., & Kim, B. K. (2017). Factors affecting government-funded research quality. Asian Journal of Technology Innovation, 25 (3), 447–469. https://doi.org/10.1080/19761597.2018.1436411
Khosrowjerdi, M., & Bornmann, L. (2021). Is culture related to strong science? An empirical investigation. Journal of Informetrics, 15 (4), 101160. https://doi.org/10.1016/j.joi.2021.101160
Kolev, J., Fuentes-Medel, Y., & Murray, F. (2019). Is blinded review enough? How gendered outcomes arise under anonymous evaluation. Academy of Management Proceedings, 2019 (1), 15210. https://doi.org/10.5465/AMBPP.2019.15210abstract
Koseoglu, M. A., King, B., & Rahimi, R. (2019). Gender disparities and positioning in collaborative hospitality and tourism research. International Journal of Contemporary Hospitality Management, 32 (2), 535–559. https://doi.org/10.1108/Ijchm-09-2018-0747
Kou, M., Zhang, Y., Zhang, Y., Chen, K., Guan, J., & Xia, S. (2019). Does gender structure influence R&D efficiency? A Regional Perspective. Scientometrics, 122 (1), 477–501. https://doi.org/10.1007/s11192-019-03282-x
Kumar, A., Khan, S. U., & Kalra, A. (2020). COVID-19 pandemic: A sentiment analysis. European Heart Journal, 41 (39), 3782–3783. https://doi.org/10.1093/eurheartj/ehaa597
Kwiek, M., & Roszka, W. (2021). Gender-based homophily in research: A large-scale study of man-woman collaboration. Journal of Informetrics, 15 (3), 101171. https://doi.org/10.1016/j.joi.2021.101171
Larivière, V., & Costas, R. (2016). How many is too many? On the relationship between research productivity and impact. PLoS ONE, 11 (9), e0162709. https://doi.org/10.1371/journal.pone.0162709
Larivière, V., Desrochers, N., Macaluso, B., Mongeon, P., Paul-Hus, A., & Sugimoto, C. R. (2016). Contributorship and division of labor in knowledge production. Social Studies of Science, 46 (3), 417–435. https://doi.org/10.1177/0306312716650046
Lariviere, V., Ni, C., Gingras, Y., Cronin, B., & Sugimoto, C. R. (2013). Bibliometrics: Global gender disparities in science. Nature, 504 (7479), 211–213. https://doi.org/10.1038/504211a
Lerchenmueller, M. J., & Sorenson, O. (2018). The gender gap in early career transitions in the life sciences. Research Policy, 47 (6), 1007–1017. https://doi.org/10.1016/j.respol.2018.02.009
Lerchenmueller, M. J., Sorenson, O., & Jena, A. B. (2019). Gender differences in how scientists present the importance of their research: observational study. British Medical Journal, 367 , l6573. https://doi.org/10.1136/bmj.l6573
Li, A., Chiu, S. S., Kong, D. T., Cropanzano, R., & Ho, C. W. (2021). How CEOs respond to mortality salience during the COVID-19 pandemic: Integrating terror management theory with regulatory focus theory. Journal of Applied Psychology, 106 (8), 1188–1201. https://doi.org/10.1037/apl0000956
Liu, M., Zhang, N., Hu, X., Jaiswal, A., Xu, J., Chen, H., Ding, Y., & Bu, Y. (2022). Further divided gender gaps in research productivity and collaboration during the COVID-19 pandemic: Evidence from coronavirus-related literature. Journal of Informetrics, 16 (2), 101295. https://doi.org/10.1016/j.joi.2022.101295
Liu, W., & Ruths, D. (2013). What’s in a name? using first names as features for gender inference in twitter. AAAI Spring Symposium: Analyzing Microtext .
Lopez, A. J., & Pereira, D. (2021). The value of transfer of knowledge in bridging the gender gap in STEM. Sustainability , 13 (10), 5426. https://doi.org/10.3390/su13105426
Lopez-Padilla, D., Garcia-Rio, F., Alonso-Arroyo, A., Arenas Valls, N., Cerezo Lajas, A., Corral Blanco, M., Gallo Gonzalez, V., Llanos Flores, M., Martinez Redondo, M., Martos Gisbert, N., Ojeda Castillejo, E., Padilla Bernaldez, M., Perez Gallan, M., Prudencio Ribera, V., Puente Maestu, L., Recio Moreno, B., Rodriguez Jimeno, E., Sanchez Azofra, A., Segrelles-Calvo, G., & Ignacio de Granda-Orive, J. (2021). Gender differences in original archivos de bronconeumologia publications, 2001–2018. Archivos De Bronconeumologia, 57 (2), 107–114. https://doi.org/10.1016/j.arbres.2020.04.020
Mauleon, E., & Bordons, M. (2006). Productivity, impact and publication habits by gender in the area of Materials Science. Scientometrics, 66 (1), 199–218. https://doi.org/10.1007/s11192-006-0014-3
Mayer, S. J., & Rathmann, J. M. K. (2018). How does research productivity relate to gender? Analyzing gender differences for multiple publication dimensions. Scientometrics, 117 (3), 1663–1693. https://doi.org/10.1007/s11192-018-2933-1
Meisha, D. E., & Al-dabbagh, R. A. (2021). Self-confidence as a predictor of senior dental student academic success. Journal of Dental Education, 85 (9), 1497–1503. https://doi.org/10.1002/jdd.12617
Meyerson, S. L., Sternbach, J. M., Zwischenberger, J. B., & Bender, E. M. (2017). The effect of gender on resident autonomy in the operating room. Journal of Surgical Education, 74 (6), e111–e118. https://doi.org/10.1016/j.jsurg.2017.06.014
Micari, M., Pazos, P., & Hartmann, M. J. (2007). A matter of confidence: gender differences in attitudes toward engaging in lab and course work in undergraduate engineering. Journal of Women and Minorities in Science and Engineering, 13 (3), 279. https://doi.org/10.1615/JWomenMinorScienEng.v13.i3.50
Millar, N., Salager-Meyer, F., & Budgell, B. (2019). “It is important to reinforce the importance of…”:‘Hype’in reports of randomized controlled trials. English for Specific Purposes, 54 , 139–151. https://doi.org/10.1016/j.esp.2019.02.004
Min, H., Peng, Y., Shoss, M., & Yang, B. (2021). Using machine learning to investigate the public’s emotional responses to work from home during the COVID-19 pandemic. Journal of Applied Psychology, 106 (2), 214–229. https://doi.org/10.1037/apl0000886
Morris, D. W., MacGillivray, E., & Pither, E. N. (2021). Self-promotion and the need to be first in science. FACETS, 6 , 1881–1891. https://doi.org/10.1139/facets-2021-0100
Muric, G., Lerman, K., & Ferrara, E. (2021). Gender disparity in the authorship of biomedical research publications during the COVID-19 pandemic: retrospective observational study. Journal of Medical Internet Research, 4 , e25379. https://doi.org/10.2196/25379
Myers, K. R., Tham, W. Y., Yin, Y., Cohodes, N., Thursby, J. G., Thursby, M. C., Schiffer, P., Walsh, J. T., Lakhani, K. R., & Wang, D. (2020). Unequal effects of the COVID-19 pandemic on scientists. Nature Human Behaviour, 4 (9), 880–883. https://doi.org/10.1038/s41562-020-0921-y
Newman, M. L., Groom, C. J., Handelman, L. D., & Pennebaker, J. W. (2008). Gender differences in language use: An analysis of 14,000 text samples. Discourse Processes, 45 (3), 211–236. https://doi.org/10.1080/01638530802073712
Nguyen, E., Robinson, R., & Hoover, R. M. (2021). Women as first authors in key pharmacy journals: Analysis by publication type. Journal of the American Pharmacists Association, 61 (1), e26–e29. https://doi.org/10.1016/j.japh.2020.08.037
Nielsen, M. W. (2017). Gender consequences of a national performance-based funding model: New pieces in an old puzzle. Studies in Higher Education, 42 (6), 1033–1055. https://doi.org/10.1080/03075079.2015.1075197
Nunkoo, R., Thelwall, M., Ladsawut, J., & Goolaup, S. (2020). Three decades of tourism scholarship: Gender, collaboration and research methods. Tourism Management . 78 , 104056. https://doi.org/10.1016/j.tourman.2019.104056
Palomo, J., Figueroa-Domecq, C., & Laguna, P. (2017). Women, peace and security state-of-art: A bibliometric analysis in social sciences based on SCOPUS database. Scientometrics, 113 (1), 123–148. https://doi.org/10.1007/s11192-017-2484-x
Parsons, C. E., & Baglini, R. B. (2021). Peer review: The case for neutral language. Trends in Cognitive Sciences, 25 (8), 639–641. https://doi.org/10.1016/j.tics.2021.05.003
Paswan, J., & Singh, V. K. (2020). Gender and research publishing analyzed through the lenses of discipline, institution types, impact and international collaboration: A case study from India. Scientometrics, 123 (1), 497–515. https://doi.org/10.1007/s11192-020-03398-5
Polanco, N. A. P., McNally, B. B., Levy, C., Carey, E. J., Palomique, J., & Tran, T. T. (2020). Gender differences in hepatology medical literature. Digestive Diseases and Sciences, 65 (10), 3014–3022. https://doi.org/10.1007/s10620-019-06025-3
Powell, S. N., Hunting, J. C., Frazier, L. P., Keeling, L. E., & Janowski, J. (2022). Evolution and trends in male versus female authorship of articles in flagship orthopaedic journals from 1995 to 2020. Journal of the American academy of orthopaedic surgeons, 30 (12), E878–E885. https://doi.org/10.5435/JAAOS-D-21-01113
Restrepo, N., Unceta, A., & Barandiaran, X. (2021). Gender diversity in research and innovation projects: The proportion of women in the context of higher education. Sustainability, 13 (9), 5111. https://doi.org/10.3390/su13095111
Rigg, L. S., McCarragher, S., & Krmenec, A. (2012). Authorship, collaboration, and gender: 15 years of publication productivity in selected geography journals. Professional Geographer, 64 (4), 491–502. https://doi.org/10.1080/00330124.2011.611434
Salerno, A., Laran, J., & Janiszewski, C. (2019). The Bad Can Be Good: When Benign and Malicious Envy Motivate Goal Pursuit. Journal of Consumer Research, 46 (2), 388–405. https://doi.org/10.1093/jcr/ucy077
Santamaría, L., & Mihaljević, H. (2018). Comparison and benchmark of name-to-gender inference services. Peerj Computer Science, 4 , e156. https://doi.org/10.7717/peerj-cs.156
Sawdon, M., & Finn, G. (2014). The ‘unskilled and unaware’effect is linear in a real-world setting. Journal of Anatomy, 224 (3), 279–285. https://doi.org/10.1111/joa.12072
Scharff, C. (2015). Blowing your own Trumpet: Exploring the gendered dynamics of self-promotion in the classical music profession. The Sociological Review, 63 (1_suppl), 97–112. https://doi.org/10.1111/1467-954X.12243
Sebo, P., & Clair, C. (2023). Gender inequalities in citations of articles published in high-impact general medical journals: a cross-sectional study. Journal of General Internal Medicine , 38 , 661-666. https://doi.org/10.1007/s11606-022-07717-9
Selig, J. P., & Preacher, K. J. (2008). Monte Carlo method for assessing mediation: An interactive tool for creating confidence intervals for indirect effects. Computer software. http://quantpsy.org .
Shang, Y. Y., Sivertsen, G., Cao, Z., & Zhang, L. (2022). Gender differences among first authors in research focused on the Sustainable Development Goal of Gender Equality. Scientometrics, 127 (8), 4769–4796. https://doi.org/10.1007/s11192-022-04430-6
Shauman, K. A., & Xie, Y. (2003). Explaining sex differences in publication productivity among postsecondary faculty. In: Equal rites, unequal outcomes (pp. 175–208). Springer: Netherlands. https://doi.org/10.1007/978-94-010-0007-9_8
Skitka, L. J., Melton, Z. J., Mueller, A. B., & Wei, K. Y. (2021). The gender gap: Who is (and is not) included on graduate-level syllabi in social/personality psychology. Personality and Social Psychology Bulletin, 47 (6), 863–872. https://doi.org/10.1177/0146167220947326
Stankov, L., & Lee, J. (2014). Overconfidence Across World Regions. Journal of Cross-Cultural Psychology, 45 (5), 821–837. https://doi.org/10.1177/0022022114527345
Stremersch, S., & Verhoef, P. C. (2005). Globalization of authorship in the marketing discipline: Does it help or hinder the field? Marketing Science, 24 (4), 585–594. https://doi.org/10.1287/mksc.1050.0152
Tellis, G. J., Chandy, R. K., & Ackerman, D. S. (1999). In Search of Diversity: The Record of Major Marketing Journals. Journal of Marketing Research , 36 (1), 120-131. https://doi.org/10.1177/002224379903600110
Thelwall, M. (2018). Do gendered citation advantages influence field participation? Four unusual fields in the USA 1996–2017. Scientometrics, 117 (3), 2133–2144. https://doi.org/10.1007/s11192-018-2926-0
Thelwall, M. (2020a). Author gender differences in psychology citation impact 1996–2018. International Journal of Psychology, 55 (4), 684–694. https://doi.org/10.1002/ijop.12633
Thelwall, M. (2020b). Gender differences in citation impact for 27 fields and six English-speaking countries 1996–2014. Quantitative Science Studies, 1 (2), 599–617. https://doi.org/10.1162/qss_a_00038
Thelwall, M., Bailey, C., Makita, M., Sud, P., & Madalli, D. P. (2019). Gender and research publishing in India: Uniformly high inequality? Journal of Informetrics, 13 (1), 118–131. https://doi.org/10.1016/j.joi.2018.12.003
Thelwall, M., & Maflahi, N. (2022). Small female citation advantages for US journal articles in medicine. Journal of Information Science, 48 (1), 106–117. https://doi.org/10.1177/0165551520942729
Thelwall, M., & Mas-Bleda, A. (2020). A gender equality paradox in academic publishing: Countries with a higher proportion of female first-authored journal articles have larger first-author gender disparities between fields. Quantitative Science Studies, 1 (3), 1260–1282. https://doi.org/10.1162/qss_a_00050
UnitedNations. (2015). Transforming our world: the 2030 Agenda for Sustainable Development . Retrieved August, 7, 2022 from https://sdgs.un.org/2030agenda
Urquhart-Cronish, M., & Otto, S. P. (2019). Gender and language use in scientific grant writing. FACETS, 4 (1), 442–458. https://doi.org/10.1139/facets-2018-0039
van Arensbergen, P., van der Weijden, I., & van den Besselaar, P. (2012). Gender differences in scientific productivity: A persisting phenomenon? Scientometrics, 93 (3), 857–868. https://doi.org/10.1007/s11192-012-0712-y
van den Besselaar, P., & Sandstrom, U. (2016). Gender differences in research performance and its impact on careers: A longitudinal case study. Scientometrics, 106 (1), 143–162. https://doi.org/10.1007/s11192-015-1775-3
Vinkers, C. H., Tijdink, J. K., & Otte, W. M. (2015). Use of positive and negative words in scientific PubMed abstracts between 1974 and 2014: retrospective analysis. BMJ , 351 , h6467. https://doi.org/10.1136/bmj.h6467
Walker, K. A. (2020). Females are first authors, sole authors, and reviewers of entomology publications significantly less often than males. Annals of the Entomological Society of America, 113 (3), 193–201. https://doi.org/10.1093/aesa/saz066
Weidmann, N. B., Otto, S., & Kawerau, L. (2018). The use of positive words in political science language. PS: Political Science & Politics, 51 (3), 625–628. https://doi.org/10.1017/S1049096518000124
Wen, J., & Lei, L. (2022a). Adjectives and adverbs in life sciences across 50 years: Implications for emotions and readability in academic texts. Scientometrics, 127 (8), 4731–4749. https://doi.org/10.1007/s11192-022-04453-z
Wen, J., & Lei, L. (2022b). Linguistic positivity bias in academic writing: A large-scale diachronic study in life sciences across 50 years. Applied Linguistics, 43 (2), 340–364. https://doi.org/10.1093/applin/amab037
Woolley, K., & Risen, J. L. (2021). Hiding from the truth: When and how cover enables information avoidance. Journal of Consumer Research, 47 (5), 675–697. https://doi.org/10.1093/jcr/ucaa030
Yani-de-Soriano, M., Hanel, P. H. P., Vazquez-Carrasco, R., Cambra-Fierro, J., Wilson, A., & Centeno, E. (2019). Investigating the role of customers’ perceptions of employee effort and justice in service recovery: A cross-cultural perspective. European Journal of Marketing, 53 (4), 708–732. https://doi.org/10.1108/Ejm-09-2017-0570
Yoo, B. (2009). Developing an overall ranking of 79 marketing journals: An introduction of PRINQUAL to marketing. Australasian Marketing Journal , 17 (3), 160-174. https://doi.org/10.1016/j.ausmj.2009.05.014
Yuan, Z. M., & Yao, M. (2022). Is academic writing becoming more positive? A large-scale diachronic case study of Science research articles across 25 years. Scientometrics , 127 (11), 6191–6207. https://doi.org/10.1007/s11192-022-04515-2
Zeina, M., Balston, A., Banerjee, A., & Woolf, K. (2020). Gender and ethnic differences in publication of BMJ letters to the editor: an observational study using machine learning. Bmj Open, 10 (12), e037269. https://doi.org/10.1136/bmjopen-2020-037269
Zhang, G., Xu, S., Sun, Y., Jiang, C., & Wang, X. (2022). Understanding the peer review endeavor in scientific publishing. Journal of Informetrics, 16 (2), 101264. https://doi.org/10.1016/j.joi.2022.101264
Zhang, L., Shang, Y. Y., Huang, Y., & Sivertsen, G. (2021). Toward internationalization: A bibliometric analysis of the social sciences in Mainland China from 1979 to 2018. Quantitative Science Studies, 2 (1), 376–408. https://doi.org/10.1162/qss_a_00102
Zhang, L., Shang, Y. Y., Huang, Y., & Sivertsen, G. (2022b). Gender differences among active reviewers: An investigation based on publons. Scientometrics, 127 (1), 145–179. https://doi.org/10.1007/s11192-021-04209-1
Zhang, M. Y., Zhang, G. P., Liu, Y., Zhai, X. R., & Han, X. Y. (2020). Scientists’ genders and international academic collaboration: An empirical study of Chinese universities and research institutes. Journal of Informetrics , 14 (4), 101068. https://doi.org/10.1016/j.joi.2020.101068
Zhao, X., Lynch Jr, J. G., & Chen, Q. (2010). Reconsidering Baron and Kenny: Myths and truths about mediation analysis. Journal of Consumer Research , 37 , 197–206. https://doi.org/10.1086/651257 .
Zhu, N., Liu, C., & Yang, Z. (2021). Team Size, Research Variety, and Research Performance: Do Coauthors’ Coauthors Matter? Journal of Informetrics , 15 (4), 101205. https://doi.org/10.1016/j.joi.2021.101205
Zhu, B., Xu, C., Wang, P., & Zhang, L. (2022). How does internal carbon pricing affect corporate environmental performance? Journal of Business Research, 145 , 65–77. https://doi.org/10.1016/j.jbusres.2022.02.071
Zhu, D. H., Deng, Z. Z., & Chang, Y. P. (2020). Understanding the influence of submission devices on online consumer reviews: A comparison between smartphones and PCs. Journal of Retailing and Consumer Services, 54 , 102028. https://doi.org/10.1016/j.jretconser.2019.102028
Zippel, K. (2020). Women in global science . Stanford University Press.
Download references
The authors thank the editor, the editorial assistant, and anonymous reviewers for their insightful comments and suggestions. The authors thank Maikun Li, Nibing Zhu, Kexin Wu, and Shuai Jin for their assistance in this paper. The authors gratefully acknowledge the grants from the National Natural Science Foundation of China (projects 72202149, 71672063 and 72072065), the grant from the Major Program of the National Social Science Fund Projects (project 19ZDA104), and the Fundamental Research Funds for the Central Universities (project 2022ZY-SX004) for financial support. The computation is completed in the HPC Platform of Huazhong University of Science and Technology.
Authors and affiliations.
School of Management, Huazhong University of Science & Technology, Wuhan, Hubei, People’s Republic of China
Yongchao Ma
Department of Marketing, City University of Hong Kong, Hong Kong, People’s Republic of China
Yongchao Ma & Ying Teng
Business School, Sichuan University, Chengdu, Sichuan, People’s Republic of China
Zhongzhun Deng
College of Public Administration, Huazhong University of Science & Technology, Wuhan, Hubei, People’s Republic of China
Li Liu & Yi Zhang
You can also search for this author in PubMed Google Scholar
Correspondence to Zhongzhun Deng .
Conflicts of interest.
The authors declare no conflict of interest.
Below is the link to the electronic supplementary material.
Rights and permissions.
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
Reprints and permissions
Ma, Y., Teng, Y., Deng, Z. et al. Does writing style affect gender differences in the research performance of articles?: An empirical study of BERT-based textual sentiment analysis. Scientometrics 128 , 2105–2143 (2023). https://doi.org/10.1007/s11192-023-04666-w
Download citation
Received : 30 November 2021
Accepted : 16 February 2023
Published : 08 March 2023
Issue Date : April 2023
DOI : https://doi.org/10.1007/s11192-023-04666-w
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
An official website of the United States government
The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.
The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.
Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .
Charlotte wrigley-asante.
1 Department of Geography and Resource Development/Centre for Gender Studies and Advocacy, University of Ghana, Legon, Ghana
2 Institute of Statistical, Social and Economic Research, University of Ghana, Legon, Ghana
3 Department of Geography and Earth Science, University of Environment and Sustainable Development, Somanya, Ghana
Data subject to third party restrictions.
Using a mixed-methods research design, this study compares academic performance of males and females studying STEM subjects or courses at the university level with that of the senior high school level performance. The factors contributing to the gender differences in academic performance at the two levels of the educational ladder were also explored. Overall, the results show that the academic performance of males was better than females at the senior high school level, whilst at the tertiary level, the academic performance of females appeared to have improved relative to that of males. Whilst gender stereotypes contributed greatly to differences in academic performance at the high school level, factors such as teaching methodologies and styles, motivation and support from parents, and advocacy campaigns on women’s empowerment accounted for the improved academic performance of females at the tertiary levels. On the other hand, males’ engagements in extra-curricular activities and other economic ventures, which are also linked to broader socio-economic influences such as economic hardship, financial constraints, and gendered ideologies tend to affect the academic performance of males at that level. We recommend that whilst emphasis is placed on getting more females in STEM disciplines and careers, it is equally important to focus on males. This requires continuous education and sensitisation of gender stereotypes and policy measures to sustain both males and females in STEM for overall national development.
Gender differences in academic performance have engaged the attention of scholars for some time now (see Hung et al. 2012 ; Jackman and Morrain-Webb 2019 ; Morita et al. 2016 ; Sparks-Wallace 2007 ). Indeed, males in the past have had a higher enrolment in STEM subjects at the tertiary levels of education compared to females, and their overall academic performance was rated higher than females (Ullah and Ullah 2019 ). This situation often translated into employment opportunities for males in science and engineering professions, whilst also allowing them to occupy high-ranking positions in these professions (Maltese and Cooper 2017 ).
According to Sparks-Wallace ( 2007 ), males’ comparative advantage when it comes to academic performance in time past reinforced the notion of males’ intellectual superiority over females and often disregarded the structural impediments and stereotypes that inhibited females’ academic abilities, especially in the sciences. Recent studies in the developed world have shown a reversal in academic performance between males and females, with females outperforming males in almost all disciplines at various levels of the educational ladder (see Grant and Behrman 2010 ; Tshabalala and Ncube 2016 ; Morita et al. 2016 ; Perez-Felkner et al. 2012 ; Workman and Heyder 2020 ). Workman and Heyder ( 2020 ) argue that females seem to do better than males in language and the arts, as well in the natural sciences, despite the latter being a traditional area of male dominance. You and Sharkey ( 2012 ) further note that females improved academic performance at elementary and higher levels of education is not because they are enrolling in easier classes or courses, but rather reflects the competencies they possess in all educational fields.
Recent studies in developing countries have also shown marked improvement in female academic performance (Ullah and Ullah 2019 ), and this is against the backdrop of persistent challenges with access to education, and under-representation in STEM programmes at the undergraduate and postgraduate levels. Further, the results from these studies challenge the notion that males perform better and are fit for science and maths subjects than females (Workman and Heyder 2020 ).
Contrary to recent studies from other parts of the world pointing to the improved academic performance of females, the majority of the studies conducted in Ghana show that males outperform females, especially in mathematics and science subjects (see Kyei and Benjamin 2011 ; Oppong 2011 ; Armah et al. 2021 ). Whilst the above studies from Ghana have been useful, and justify the need to pay attention to female students and provide them with the necessary support to enhance their studies in mathematics and science-related subjects, there are relatively unexplored areas that require research attention to better understand academic performance between males and females studying STEM educational programmes in Ghana. First, extant studies on academic performance between male and female students have focussed largely on performances at the high school levels, with little attention given to academic performance between male and female students at the university or tertiary levels. Focus on the tertiary level is important because at that stage there is a clear path towards a career that one may want to pursue and therefore it is expected that students will make much effort to excel academically at this level (Santiago et al. 2008 ). Second, comparative studies on the academic performance of male and female students at different stages of their education ladder are limited. Such a study is important given that academic performance does change as one progresses in their education (Walberg 2010 ; Johnson 2014 ).
Taking cognizance of these gaps, this study seeks to assess the academic performance of male and female university students studying STEM programmes and compare their performance to the STEM subjects or courses that they studied at the senior high school level. These subjects include core mathematics, integrated science, biology, physics, chemistry, and elective (advance) mathematics. The contributory factors explaining the gender differences, if there are, are also explored. Three research objectives are addressed in this study:
After the introduction, we present some theoretical explanations of gender and academic performance. We then examine the research context and the methodology, followed by a discussion of the research findings. The final section draws some conclusions and policy recommendations.
Academic performance of students gives educational and vocational institutions the opportunity to determine whether the educational curriculum is having the desired impact on students in terms of teaching and learning. It also gives some indication of how well teachers and students have accomplished their targeted educational goals (Arshad et al. 2015 ; Caballero et al. 2007 ). Narad and Abdullah ( 2016 ) define academic performance as an evaluation of the knowledge students have acquired at school over a certain period of time. According to Chilca ( 2017 ) it is the level of knowledge acquired in a subject area and it is usually measured by grades obtained after an assessment.
Measures of academic performance involve continuous assessment and examination results or outcomes. Other measures used in measuring academic performance include class exercise, mock tests, field examinations, and external examinations. Depending on the level in the educational ladder, a combination of these measures is often used to test students' academic performance (Noemy et al. 2017 ). In most higher or secondary level education, assessment of academic performance is often a way of determining whether students who undertake an examination are qualified to enter the tertiary level. These are typically referred to as external examinations, and in order to be admitted to tertiary institutions, students' examination results or grades must reach the threshold standards specified by those institutions (Kis 2005 ). At the tertiary level, academic performance measured by grade point averages determines the depth of subject knowledge attained and the likelihood that a student would graduate with honours. Academic performance, for all intents and purposes, provides a good indication of the educational objectives attained following a learning process (Arshad et al. 2015 ).
The structure of Ghana's educational system is 6 + 3 + 3 + 4/3, beginning with primary school and continuing through to undergraduate study (NUFFIC 2015 ). This is equivalent to six years of primary education, three years of junior high school, and three years of senior high school, followed by four years of undergraduate study or three years of other higher level of study (such as a Higher National Diploma (HND), a certificate in nursing or midwifery, or a certificate in teacher preparation). Postgraduate education normally includes 1 or 2 years of a master's degree programme and a minimum of three years of a Ph.D.
The high school structure in Ghana includes junior high and senior high school. Children completing junior high school and expected to move to senior high school are required to write a national examination known as the basic education certificate examination (BECE). Amongst academic subjects that students are examined on in the BECE include core subjects such as English, mathematics, social studies, and integrated science. Other additional subjects include basic design and technology, religious and moral education, French, and information communication technology (ICT). At the senior high school level, students are expected to choose a total of four (4) electives that fall in either the sciences, humanities, business, agriculture or technical education. The elective subjects for science students at the senior high school are biology, physics, chemistry, and elective (advanced) mathematics. In addition to the four electives, all students are expected to take four additional core subjects which include English, basic mathematics, integrated science, and social studies. At the end of the three-year senior high school programme, students are expected to sit for an external examination known West African Senior School Certificate (WASSCE).
Admission to higher education, specifically undergraduate study depends on the total aggregate score in the WASSCE examination. For most Universities such as the University of Ghana, prospective undergraduate students are expected to obtain a maximum aggregate score of 24, or grades A1 to C6 in at least three core subjects and three elective subjects whose aggregate score does not exceed 36. Table Table1 1 shows the WASSCE grades and their interpretation.
WASSCE grading system and description
WASSCE letter | Grade description | Interpretation |
---|---|---|
A1 | Excellent | Excellent |
B2 | Very good | Very good |
B3 | Good | Good |
C4 | Credit | Above average |
C5 | Credit | Above average |
C6 | Credit | Above average |
D7 | Pass | Satisfactory |
E8 | Pass | Satisfactory |
F9 | Fail | Fail |
(NUFFIC 2015 )
Students who gain admission to the University with WASSCE begin at the first year or level 100. For each year, students undergo two semesters of education or training in their areas of specialisation. At the end of each semester, students are required to undertake end of semester examination and are assessed based on the University assessment system. This, however, varies by University. At the University of Ghana, a grade point average (GPA) is used and ranges from 0 to 4. Table Table2 2 shows the interpretation of grades used for students' assessment.
University of Ghana grading system and description
Marks | Grade letter | Grade point | Description |
---|---|---|---|
80–100 | A | 4.0 | Outstanding |
75–79 | B + | 3.5 | Very good |
70–74 | B | 3.0 | Good |
65–69 | C + | 2.5 | Fairly good |
60–64 | C | 2.0 | Average |
55–59 | D + | 1.5 | Below Average |
50–54 | D | 1.0 | Marginal |
0–49 | F | 0 | Fail |
Source https://www.ug.edu.gh/aqau/sites/aqau/files/documents/DEFINITION%20OF%20GRADES%20AND%20GRADE%20POINTS-UG.pdf
With this background of academic assessment in Ghana at both the secondary and tertiary levels, the next section provides some theoretical basis on academic performances from the gender perspective.
The literature highlights several theoretical explanations in the differences in academic performance of males and females. This paper highlights some perspectives that provide the basis for understanding the factors influencing the performance levels of males and females studying STEM subjects in Ghana.
Studies on academic performance in higher education have shown that personality traits play an important role in academic performance outcomes (Conrad and Petry 2012 ; Murray et al. 2014 ; Furnham and Moutafi 2012 ; DiPrete and Jennings 2012 ). Personality traits involve attitudes, behaviours, and lifestyle that has become part of an individual. These traits are formed over time through the interaction of temperament, character, and environment (Roberts et al. 2013 ). A personality trait identified to have a strong influence on academic performance is conscientiousness (see Conrad and Petry 2012 ). Defining conscientiousness is difficult, however, it encapsulates features such as industriousness and orderliness (Keiser et al. 2016 ; DeYoung et al. 2007 ). Conscientious people are predisposed to being diligent, purposeful, and organised (Witt et al. 2002 ). Studies have shown that there are gender differences in conscientiousness (see Schmitt et al. 2008 ; Kling et al. 2013 ; Keiser et al. 2016 ), with female students found to be more conscientious than their male counterparts. Pryer et al. ( 2009 ) argue that this has translated into an edge for female students who seem to be performing better than male students in the arts and sciences. In their study, Pryer et al. ( 2009 ) found that females study for more hours than males, ask more questions than males in class and seek feedback on assignments than males. This is corroborated by Lam et al. ( 2012 ) who argues that females’ achievement in academic performance is attributed to the greater effort they put into their studies compared to males. In the opinion of DiPrete and Jennings ( 2012 ), exerting greater efforts increases the learning process, improves attitude towards education, and increases academic performance. Thus, being conscientious or exerting greater efforts is important in good habit formation and increases one’s expectation in school, and is a sin-qua-non for higher academic performance.
In addition to conscientiousness and effort, scholars have also put forward the argument of academic self-concept. Self-concept involves how people perceive themselves in terms of their behaviour, abilities, and unique characteristics (Huitt 2011 ). Academic self-concept involves peoples’ perception about their academic abilities or achievement in school (Erdogan and Sengul 2014 ; Kvedere 2014 ,). If students have a positive self-concept then it means they will approach their academic work with more seriousness, they are likely to make decisions that will positively influence their academic performance, and they are likely to increase their confidence in their abilities (Kvedere 2014 ). Self-concept is a continuous development, and therefore as people become aware of their abilities, they are likely to maintain or improve such abilities. In this respect, awareness of high academic self-concept will be a motivating factor to improve one’s academic performance. Studies have shown that academic self-concept does increase academic performance (see Ghazvini et al. 2011 ; Reyes 1984 ; Elbaum and Vaughn. 2001 ), but it is not clear whether there are differences between males and females on self-concept and how this translates into different academic performance.
Another issue of significance to academic performance and achievement is gender identity. According to Downey and Vogt Yuan ( 2005 ) certain traits and practices linked to or contrasted with masculinity or femininity can have a positive or negative impact on academic performance on both sexes. For instance, some studies have found that masculine stereotypes portray boys as dominant, competitive, and active, whilst girls are portrayed as conciliatory (Francis 2000 ; Legewie and DiPrete 2012 ). This attitude is likely to be an incentive for males to perform better knowing that they have an edge over females. On the other hand, it can be a challenge for females as they have to exert greater effort to bridge the gap between themselves and their male counterparts. Another stereotypical gender identity is that males are naturally gifted than females and as such females are expected to put in more effort in learning than males. These stereotypical gender identities if accepted and reinforced can negatively affect the academic performance of the two sexes.
Several studies show that association with peer groups has an influential role on the academic performance of male and female students (Lashbrook 2000 ; Steinberg 2005 ; Adeyemi et al. 2019 ). In the opinion of Burke and Sass ( 2013 ), the effects of peers on student academic achievement can be greater than their teachers and school. Legewie and DiPrete ( 2012 ) explain that peers' influence is a result of the desire to belong to and to fit in a group. Cheng ( 2020 ) argues that association with peers and peer groups shapes a person's attitude, perception, and motivation because one has to conform to the subculture mostly identified with these peer groups. Adeyemi et al. ( 2019 ) explains that the effect of peer groups can have either positive or negative effects on the academic performance of male and female students depending on the subculture of the group. According to Legewie and DiPrete ( 2012 ), one of the subcultures of male peer groups which often have a negative effect on academic performance is the prioritisation of other activities and conducts over academic work.
Another contextual factor identified to have a significant impact on the academic performance of male and female students is economic hardship or financial constraints. Khan ( 2014 ), argues that the cost of education increases as one progresses through the academic ladder and as such places a huge burden on parents and family members. Khan ( 2014 ) notes that the difficulties in meeting financial obligations by parents put emotional and psychological stress on their wards and often lead to low academic performance or in the worst-case discontinuance of the education of their wards. Crage and Fairchild ( 2007 ) also argue that some students often end up doing other economic activities to meet the cost of their education. In a study conducted by Jha and Kelleher ( 2006 ) they found that boys' underperformance in school was due to the socio-economic and occupational practices they were engaged in which consequently reduced the amount of time needed to engage in academic work. Similarly, observations have been reported in the Philippines and Thailand concerning male academic underperformance (Jha et al. 2012 ).
Data used for this paper came from a pilot study that explored the nexus between academic programme choices, academic performance, and career aspirations of male and female students reading STEM-related programmes at the University of Ghana. A sequential explanatory mixed-methods design was used. This involved the collection and analysis of quantitative data at an initial stage of the research, followed by the collection and analysis of qualitative data at the second stage (Creswell 2014 ). This research design was appropriate in the context of this study because the quantitative data allowed the researchers to first assess differences in academic performance between male and female students, whilst the qualitative data which was collected and analysed subsequently was used to explain the observations.
The survey data was collected in 2020, using an online-based questionnaire survey developed using Google forms. This was deemed appropriate due to the ease of reaching the target population (as a result of the Covid-19 pandemic), the cost-effectiveness of the method, tracking of responses in real time, and assessment of the pattern of responses. The University of Ghana was conveniently sampled as the study area for this study. Two reasons informed this. First, it is the first public university in the country and has long-established STEM programmes with high enrolment for both sexes over the years. Second, the intent was to unravel what pertains at the University of Ghana with regard to gender differences in academic performance in STEM programmes, and then based on that broaden the scope to include other public universities. In view of this, students from five STEM departments (engineering, mathematics, statistics and actuarial science, biological sciences, and computer science) at the University of Ghana formed the target population for the sample survey. Permission was sought from the Dean and heads of the respective departments before carrying out the online survey. In all, 252 students responded to the survey, out of which 54% were males and 46% were females. Further, 82% were between the ages of 20–25 years, with 18% below 20 years.
On the survey instrument, there was a statement explaining the aim of the study, background information about the study, and assurance of confidentiality of the survey respondents. For this study, questions extracted from the survey data included demographic background of respondents, reported grades obtained in six subjects in the WASSCE exams, assessment of academic performance since enrolling in the University of Ghana, and current cumulative grade point average (CGPA).
The dataset were analysed using non-parametric tests such as chi-square test and Mann–Whitney U test. Chi-square test was used to ascertain whether there was a significant difference between male and female students grades obtained in the senior high school final examination on six subjects (i.e. core mathematics, integrated science, biology, physics, chemistry, and elective mathematics). Further, a chi-square test was used to assess the perceived differences in academic performance between male and female students pursuing STEM programmes at the University of Ghana. A Mann–Whitney U test was also used to assess differences in cumulative grade point average (CGPA) scores between male and female students pursuing STEM programmes at the University of Ghana. The Mann–Whitney U test, which is a non-parametric test was used because the CGPA scores were not normally distributed. Indeed, a Kolmogorov–Smirnov test performed showed that the cases did not follow a normal distribution ( D (251) = 0.000, p > 0.005).
The qualitative data was gathered from in-depth interviews and focus group discussions (FDGs) with selected students who participated in the online survey. 1 Twenty in-depth interviews were conducted with students conveniently sampled from the pool of respondents from the online survey. Ten (10) were males and the other 10 were females. In addition, three focus group discussions were held: all male, all female, and one mixed group. The in-depth interviews and FGDs provided deeper insights into motivations in studying and pursuing STEM courses and careers, the factors that influence academic performance levels at both the secondary and tertiary levels, and the gender differences. The main themes that were identified whilst analysing the interview transcripts include the differences in performance levels in specific STEM-related subjects at the secondary level and the reasons for the differences, the gender differences in the performance at the tertiary level, and the factors that were likely to influence these, from the respondents' perspectives. Statements or quotes were categorised under the themes and used in the discussion of the results together with the survey data.
Performance assessment at senior secondary/high school level.
Table Table3 3 is a cross-tabulation between grades obtained in six science subjects in the WASSCE exams and the sex of respondents. Comparing results for core mathematics between male and female students, the results show that 35.6% of males had A1, whilst 29.7% of females had A1. Approximately 94.7% of males obtained grades between A1 and B3, whilst for females this was 87.33%. Even though the percentage share of males who obtained between A1 and B3 were more than the percentage share of females, the chi-square test shows that there is no statistically significant difference between males and females regarding grades obtained in core mathematics. Essentially, the difference in grades is not so wide to make a case that being a male or a female played a significant role in the grades obtained in core mathematics. Even though this finding does support previous studies that show that male students do perform better than females in mathematics (see Kyei and Benjamin 2011 ; Oppong 2011 ), the non-significance of this finding indicates that the difference in performance in core mathematics between males and female students is not wide and that with support to females, there are prospects in closing in on this gap.
Assessment of grades obtained in WASSCE exams between male and female respondents
WASSCE grades | Core mathematics (N = 252) | Integrated science (N = 252) | Biology (N = 252) | Physics (N = 252) | Chemistry (N = 252) | Elective mathematics (N = 252) | ||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Male | Female | Male | Female | Male | Female | Male | Female | Male | Female | Male | Female | |
A1 | 35.61 | 29.73 | 40.15 | 33.64 | 26.21 | 23.76 | 3.62 | 3.92 | 28.57 | 12.75 | 46.56 | 33.94 |
B2 | 35.61 | 24.32 | 40.15 | 34.55 | 36.89 | 30.69 | 28.83 | 15.69 | 24.11 | 19.61 | 19.08 | 24.77 |
B3 | 23.48 | 33.33 | 15.91 | 25.45 | 32.04 | 33.66 | 45.95 | 40.20 | 36.61 | 39.22 | 18.32 | 21.10 |
C4 | 2.27 | 6.31 | 2.27 | 5.45 | 3.88 | 7.92 | 12.61 | 29.41 | 8.93 | 16.67 | 9.16 | 10.09 |
C5 | 0.76 | 1.80 | 0.76 | 0.00 | 0.97 | 1.98 | 6.31 | 5.88 | 0.89 | 5.88 | 0.76 | 3.67 |
C6 | 2.27 | 4.50 | 0.76 | 0.91 | 0.0 | 0.99 | 2.70 | 1.96 | 0.89 | 5.88 | 5.34 | 1.83 |
D7 | – | – | – | – | 0.0 | 0.99 | 0.0 | 0.98 | – | – | 0.76 | 1.83 |
E8 | – | – | – | – | – | – | 0.0 | 0.98 | – | – | 0.0 | 1.83 |
F9 | – | – | – | – | – | – | 0.0 | 0.98 | – | – | 0.0 | 0.92 |
test | ( = 9.071, = 5, > .106) | ( = 6.370, = 5, > .272) | ( = 4.549, = 6, > .603) | ( = 15.162, = 8, > .054) | ( = 17.606, = 5, > .003) | ( = 12.015, = 8, > .151) |
Regarding integrated science, the results showed that 40.1% of males obtained A1, whilst 33.6% of females obtained A1. Male respondents who obtained between A1 and B3 for integrated science were about 96.3%, whilst in the case of females, about 93.6% obtained similar grades. The result shows that the difference is not wide. Unsurprisingly, the chi-square test shows no significant difference between male and female students regarding grades obtained in integrated science. Essentially, both male and female students were at par, and therefore attribution of difference in grades is not due to sex differences.
The findings show that about 95.2% of male respondents compared to 88.2% of female respondents obtained grades between A1 and B3 in biology. In the case of elective mathematics, 83.9% of males compared to 79.8% of females obtain grades between A1 and B3. The chi-square tests show that the difference in biology and elective mathematics grades between male and female respondents are not statistically significant. Like the other subjects (i.e. core mathematics and integrated science), the difference reported for these two subjects (i.e. biology and elective mathematics) is not very wide between the two sexes. Even though the percentage share of male respondents who obtained A1 to B3 was more than the female respondents in the four subjects (i.e. core mathematics, integrated science, biology, and elective mathematics), they were too close to call, and thus we cannot say that males outperformed females in these four subjects.
In the case of physics, 3.6% of male respondents obtained A1, whilst 3.9% of female respondents obtained A1. The proportion of respondents who obtained grades between A1 and B3 for male respondents was about 78.4%, whilst that of female respondents was about 59.8%. The chi-square test shows a significant difference between male and female respondents in terms of grades obtained in physics. In essence, the difference was not close and thus, we can say that male respondents did appreciably better than females. Similarly, in chemistry, 89.3% of male respondents compared to 71.7% of female respondents obtained grades between A1 and B3. The Chi-square test shows a significant difference between the two sexes regarding grades obtained in chemistry. This shows that female students are lagging behind their male counterparts in physics and chemistry in particular, and some of the factors explaining this trend have been discussed in subsequent sections.
This section of the paper presents results on differences in academic performance of male and female respondents at the tertiary level. Two statistical tests were used and include Mann Whitney U test (Table (Table4) 4 ) and chi-square test (Table (Table5). 5 ). Table Table4 4 shows differences in CGPA scores between male and female respondents. The test statistic reported in Table Table4 4 shows that the mean rank CGPA for male and female respondents were not significantly different. This can be interpreted to mean that even though in nominal terms the mean rank of CGPA for females was slightly higher than males, this is not a significant difference. In essence both groups are almost at par, despite the slight edge for females.
Mean rank differences in CGPA test scores for male and female students
Sex of respondents | Ranks | Test statistics | ||||
---|---|---|---|---|---|---|
Mean rank | Sum of ranks | Mann–Whitney U | Wilcoxon | Sig (2-tailed) | ||
Male | 133 | 120.76 | 16,061.50 | 7150.500 | 16,061.50 | 0.07 |
Female | 117 | 130.88 | 15,313.50 | |||
Total | 250 |
Perceived academic performance of male and female respondents
Academic performance assessment | Sex of respondents | ||
---|---|---|---|
Total | Male | Female | |
Declining | 24.50 | 29.55 | 18.97 |
(2.73) | (3.99) | (3.66) | |
Remaining same | 18.47 | 21.21 | 15.52 |
(2.46) | (3.57) | (3.38) | |
Improving | 57.03 | 49.24 | 65.52 |
(3.14) | (4.37) | (4.43) | |
Chi-square test | ( = 6.766, = 2, > .034) |
Table Table5 5 is a cross-tabulation of the sex of respondents and respondents’ subjective assessment of their academic performance since their enrolment in the University. The result shows that 29.5% of males compared to 18.9% of females opined that their academic performance was declining. Further, 21.2% of males as against 15.5% of females responded that their academic performance has remained the same. The result shows that 49.2% of males compared to 65.5% of females responded that their academic performance was improving. The result demonstrates that more females compared to males opined that their academic performance has improved. The chi-square test also shows a statistically significant difference between responses for males and that of females with regards to academic performance assessment. Essentially, the subjective assessment of academic performance suggests that females’ academic performance is improving compared to their male counterparts.
The results reported in Tables Tables4 4 and and5 5 are not contradictory. Rather, both results affirm in one way or the other that females are doing well. Female respondents’ subjective assessment of their academic performance indicates that they have improved, whilst their average CGPA score in nominal terms was higher than males, albeit the differences were not significant. Given that males have had the upper hand in academic performance at the high school level (see Table Table3), 3 ), reported results from the two assessments (i.e. subjective assessment of academic performance and CGPA) suggest that females’ academic performance has improved. This result resonates with recent findings that show that female students’ performance in STEM subjects has improved remarkably (Workman and Heyder 2020 ).
In line with some of the theoretical perspectives, the next section highlights some of the factors that explains the performance levels of males and females at both the senior high school and tertiary levels.
Several factors explain the performance levels of males and females at both the senior high school and tertiary levels. Amongst others, these include the socialisation process and gender norms, nature of subject and learning styles and individual-level factors.
Whilst not ruling out the influential role of personality traits such as attitudes and behaviours of an individual on performance in a subject area, we found that some of the factors contributing to better performance of males as compared to females in certain STEM subjects particularly physics and chemistry, were centred around gender stereotypes and perceptions about science subjects in general. These stereotypes arise as a result of the socialisation process. The general view was that the gendered nature of society and how society is structured with females generally socialised not to take up perceived challenging tasks including STEM subjects in schools tend to have some negative impact on females as compared to males. For instance, some female students explained that STEM subjects are still considered as ‘ masculine subjects ’ amongst their peers and this perception creates a situation where females perceive these courses to be challenging and thus reduce their interest in the subjects resulting in lower performance levels as explained by a 21 year old female student:
“…right from the basic level, girls lack total interest in these core subjects because of the perception that they are difficult subjects. It appears the majority of girls are brought up with the idea that science is difficult and not necessarily for girls. This pushes many girls to move into more “softer subjects compared to males’’ because they feel they would perform better... even when it comes to choosing science subjects, many girls would opt for biology rather than physics or chemistry…so it’s all about the perception that science is for boys (21-year old female student)
Thus, the perception that science is the preserve of males (Workman and Heyder 2020 ) indirectly affect females’ interest and performance in the subject, especially at the senior high school level. It was explained that most females would opt for reading courses in the Arts rather than the science-oriented subjects. But even within the sciences, females would prefer biology to physics and chemistry since there is more reading in biology. In effect, females tend to be attracted to subjects that are perceived as less difficult and easily comprehensible. Again, discussions with informants revealed that the patriarchal nature of society where males are socialised to take up challenging tasks including subjects studied at school is an additional contributory factor. Males are expected to behave competitively and so they are more inclined to push and challenge themselves to pursue science subjects, which in turn reinforces these patriarchal ideologies as explained by these male respondents:
‘‘…all originate from how society has been structured and how we’ve been brought up. Males are expected to do well in certain subjects and females are also expected otherwise. I realised that some males do not necessarily like science but because of the perception that boys are expected to be intelligent, they put in a lot of effort to do these courses and end up performing better because they want to prove to their peers and society that they can do it…’’ (21 year old male student).
Similarly another male respondent noted:
…because of the perception that boys must succeed, they put in a lot of effort especially when exams is approaching …they become more aggressive with their studies because many boys feel hurt when girls beat them in exams and because they don’t want that, it drives many males to outperform females particularly in STEM related subjects (22 year old male student).
These notions as expressed point to the fact that gender ideologies and identities about males portray them as dominant and competitive as compared to females (Legewie and DiPrete 2012 ). These are inculcated in males’ right from the basic level, and pushes them to take up challenging tasks including ‘ perceived difficult subjects and work themselves out to outperform females’ as explained by a male student.
In the bid to encourage more girls to study science and pursue science careers, teachers generally play a major role in motivating females’ in-class and support them to excel in these STEM subjects (Wrigley-Asante et al. 2022 ). Whilst females are somehow ‘ pampered ’ by way of attention given to them, males more often than not do not receive such attention. The discussions revealed that in some cases females were provided with in-class notes and other teaching aids whilst males were challenged to study textbooks and often encouraged to study on their own. This could also be linked to some of the gender ideologies and perceptions that “ males should be able to do things on their own because they are males” as noted by one male respondent which somehow reflects in the teaching styles often used by teachers. This attitude towards males somehow pushes them to go beyond their boundaries and to develop strategies such as peer-learning discussion groups, which in-turn results in better performance. A male respondent explained the situation as follows:
“…I think the teachers’ pay particular attention to girls than boys…most at times the girls are encouraged and supported but the boys are told to study on their own, and because most boys also feel that they are [men] and have to succeed, they push hard to study on their own. In fact, most often we form study groups to discuss amongst ourselves and we go to the teachers, if need be, to explain the areas that we don’t understand. I think the teachers rather push us hard to study to make better grades…” (21 year old male student).
In effect, whilst encouraging and stirring up interest of females to take up science courses, teachers may be unconsciously challenging the males to go beyond what is being taught in class and to study on their own. This however, seem to impact positively on their performance in STEM subjects in general as males tend to support themselves in this context. This supports Cheng ( 2020 ) and Adeyemi et al. ( 2019 ) argument that peer groups can have either positive or negative effect on the academic performance of male and female students depending on the subculture of the group. It also shapes a person's attitude and motivation and impact on learning and performance. In this context, the subculture of the male peer group is to come together to support themselves in learning and understanding of the subjects which impacts positively on their academic performance. It also support the gender identity theoretical argument that academic achievement intersects with the conception of masculinity since it is reinforced by teachers and ends up as an incentive (Willis 1981).
Further, the nature of some STEM subjects (notably physics and chemistry) vis-a-vis the learning styles of males and females also affect performance levels, especially at the high school level. For instance, it was understood that both physics and chemistry (in particular) have certain principles and formulae which are fundamental. Males seem to challenge themselves and go a step further to learn these principles off-hand or memorise them, whilst many females would want to have a better understanding of these principles and sometimes the rationale before applying them. Whilst this may be a better learning style, it sometimes affects females academic performance in one way or the other as explained by a female respondent:
“…physics and chemistry in particular have formulas that one has to apply. While our male colleagues would accept everything as it is, most females would want to understand aformulae before applying…sometimes we girls tend to even challenge some of these and I believe that also affect our performance’’ (20 year old female student).
Whilst this may imply that there are differences in learning styles between males and females which also impacts on their performance levels, it is also embedded in the stereotypes surrounding STEM courses particularly on the part of females. This is one area that may require further research and explanations.
The respondents highlighted the fact that individual-level factors do influence academic performance levels for both females and males in addition to gender stereotypes. For instance, it was explained that students who perform well in class would want to maintain the status quo and would therefore put in extra effort to maintain their performance levels. Such individuals, most often would not want both their peers (whether male or female) to out-perform them so “they will go at all length to maintain their scores all the time to always stay at the top” as explained by one male respondent.
As noted by Conrad and Petry, ( 2012 ) and Murray et al, ( 2014 ), such personality traits play an important role in academic performance outcomes as it involves attitudes, behaviours, and lifestyle of that individual. In this context, such persons, most often males as explained in the FGDs, would want to maintain the status quo of always out-performing their peers. Such individuals may also possess positive self-concept and will approach their academic work with more seriousness (Ayodele 2011 ; Erdogan and Sengul 2014 ). This will continue to increase their confidence in their abilities (Kvedere 2014 ) and overall performance.
As shown in Table Table5, 5 , females compared to males opined that their academic performance has improved. Discussions with the students revealed that several factors contribute to the differences in the observed responses of males and females academic performance. These include the teaching style and methodology, economic factors vis-à-vis gendered ideologies, motivation, and self-efficacy.
A key factor accounting for females’ improved academic performance at the tertiary level was due to the teaching methodologies used at this level. These teaching methodologies allow students to explore, ask critical questions, analyse problems, and come up with one’s own ideas. This allows room for many females to explore, understand key challenging issues and proffer solutions based on their own understanding as explained by a 21 year old female:
“…I think studying at the tertiary level is different from the senior high school level… at this level, both males and females are encouraged to study on their own and come out with solutions and so it allows us [females] to explore better. This teaching method favours many females and helps improve on our academic performance. Most of the courses are more applied so we develop our own ideas and respond to issues based on our understanding...” (21 year old female)
The above assertion seem to be in line with Pryer et al. ( 2009 ) argument that female students seem to perform better than male students in the arts and sciences at the tertiary level ask more questions than males in class and seek feedback on assignment than males. This obviously allows them to express themselves better as compared to the high school level and a positive effect on overall performance.
The high unemployment rate and uncertainties in the job market came up as a major factor affecting academic performance in the sense that it is also linked to the issue of time management. The FGDs revealed that whilst females tend to spend more of their time effectively on their studies males were more likely to spend their time on extra-curricular activities such as seeking economic ventures, and exploring other job-related opportunities. Some also tend to engage themselves with part-time productive activities with the hope of being absorbed by such organisations immediately after school as explained by this male respondent:
“… these days it’s not easy to secure a job after school, there’s the fear that one will not be able to get a job because employers are looking out for particular skills which one can learn on the job…some of us [males]do part time jobs while in school so we can learn that skill as we move on. In the process, we spend time juggling between work and our studies. We are not able to put in that much efforts as compared to when we were in high school and this somehow affects our performance…” (22 year old male).
This narrative resonates with many of the male students and in line with Jha and Kelleher ( 2006 ) findings that boys underperformance in school is due to their socio-economic and occupational practices that they engage in which consequently reduced the amount of time needed to engage in academic work. It also shows that females tend to study for more hours than males (Pryer et al. 2009 ), Whilst male informants accepted the fact that these extra-curricular activities tend to affect their academic performance, they explained that it is to support themselves financially and to meet the cost of their education, a point which is corroborated by Craig and Fairchild ( 2007 ).
Whilst these are linked to the broader contextual and political economy issues, the discussion also revealed that they are embedded in the social ideology of perceptions about males and females. For instance, it was explained that some male students perceive themselves to be “breadwinners” in future and therefore have to be responsible even to their college ‘partners’. 2 This forces them to seek for alternative jobs in order to be financially secured so they could ‘ support themselves and their partners even whilst in school’ as explained by one male respondent.
In effect, gender ideologies imbibed as a result of the socialisation processes contributes to this perceptions and attitude amongst males, which in turn affect the time spent on their studies and overall performance.
Further, some also explained that parents do support females financially better than they do for males, thus they need to secure part-time jobs to support themselves. Subsequently, the engagement in economic ventures affect for their academic performance.
Another major factor for the improved performance of females is the education and sensitisation programmes undertaken over the years. This includes female empowerment programmes, and the awareness that science is not only for males but also for females particularly when one makes it to the tertiary level. The support received from parents and other role models and the advocacy campaigns provides some motivation for females at the tertiary level to perform better as explained:
“… at this level many of us girls are aware of what we want to do in life and therefore we become more focused. We also receive a lot of support from our parents and other women’s associations such as Women in Engineering (WiNE). They support us and motivate us to aspire to higher heights…” (20 year old female).
This also implies that motivation and self-efficacy plays an important role in all of these. Females thus tend to be more conscientious than their male counterparts (Pryer et al. 2009 ) at the tertiary level.
The aim of this paper was in three folds. First to compare academic performance between males and females in the STEM subjects or courses that they pursued during their high school level; to assess the performance in the STEM subjects or courses that male and female students pursue at the tertiary level and to explore the factors accounting for the changes (if there are) in the academic performances of males and females at both levels. The study proceeded from the premise that such comparative analysis of academic performances for male and female students at the different educational levels were limited especially in the Ghanaian context.
The results show that there are gender differences in the academic performance at the senior high school level for all six subjects, with much of the differences occurring in Chemistry and Physics subjects, where males performed better than females. However, we found that the gender performance gap in favour of males switches in favour of females by the time they get to the tertiary level. The qualitative interviews revealed that different factors may account for these differences. These include individual-level factors such as the perceptions about STEM subjects in general vis-a-vis the stereotypes surrounding them. These contribute to the lower performance of females as compared to males, particularly in Physics and Chemistry subjects at the senior high school level. Generally, these subjects are perceived as ‘male subjects’ and tend to influence attitude and approach in the learning styles of males and females. Again, the teaching methods and styles often used by teachers are somewhat influenced by these gendered ideologies, which rather play to the benefit of males and subsequently affect their overall performance.
Interestingly, the mean rank of CGPA for females was found to be bit higher than that of males at the tertiary level though the differences in the rank scores were not statistically significant. The chi-square test also shows a statistically significant difference between males and with regards to subjective assessment of academic performance, with majority of females indicating improved academic performance as compared to their male counterparts. We found that the teaching approach at this level which allows students to explore was favourable for females. Motivation and support from parents and the advocacy campaigns on women’s empowerment, all do have positive effects on females. On the other hand, whilst females tend to spend more of their time on studies, males often engage in extra-curricular activities and other economic ventures, which are also linked to wider broader and contextual level influences such as economic hardship or financial constraints as well as gendered ideologies. At this higher level, it appears that female students are more conscientious than their male colleagues as argued by Furnham and Moutafi 2012 and DiPrete and Jennings 2012 ) in the sense, females are more disciplined and committed to their studies as compared to males.
Whilst it is imperative to continue to encourage females to study STEM courses and pursue STEM field, it is equally important to develop policy measures to address males’ performance at the tertiary level. This should include sociocultural interventions especially intended to stimulate the motivation and interest of males lagging behind. We recommend that whilst emphasis is placed on getting more females in STEM disciplines and careers, it is also important to focus on the males. The chief aim must be to address the attitude of males towards substituting economic interest for academic interest at the tertiary level. This requires continuous education and sensitisation of the gender stereotypes and policy measures to sustain both males and females in STEM for overall national development.
The authors gratefully acknowledge the financial support of the Women and Science Chair, a Paris Dauphine-PSL University and its Foundation Chair, in partnership with Fondation l’Oréal, La Poste, Generali France, Safran and Talan. We would also like to thank all students who participated in the study.
CWA, CGA: conceptualization. CWA, LKF: data input and curation. CGA, LKF: formal analysis. CWA, LKF: literature review. CWA, LKF: methodology. CWA, CGA: supervision. CWA, LKF: initial draft. CWA, CGA, LKF: writing—review and editing.
Women and Science Chair, Paris Dauphine-PSL University and its Foundation Chair, in partnership with Fondation l’Oréal, La Poste, Generali France, Safran and Talan.
Declarations.
The authors declare no conflict of interest.
This study is not verified or reviewed through any institutional review board.
Permission was sought from respondents as to whether they would like to participate in the study, and their participation was taken into consideration once they gave their consent.
Confirmation that all research was performed in accordance with relevant guidelines/regulations applicable when human participants are involved (e.g. Declaration of Helsinki, or similar); N/A.
1 Students were asked to provide personal details such as name, phone number, and email address at the end of the online survey form/questionnaire. The personal information provided was used to follow-up on respondents for further qualitative information.
2 Partners here refers to their colleagues of the opposite sex who they are dating or having amorous relationship with.
Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
Scientific Reports volume 14 , Article number: 18309 ( 2024 ) Cite this article
174 Accesses
Metrics details
The research purpose is to evaluate the effectiveness of leadership in the process of distance learning from the perspective of the psychological theories of leadership, gender, and cross-cultural issues. The present research is based on such methods as surveys, testing, quantitative and qualitative analysis, and statistical data processing. The subjective (the experience of the respondents) and objective (machine calculation of clusters) assessments allowed the scholars to generate more arguments on leadership in the learning process. The sample consisted of 600 female and male students (300 from each sex, respectively) aged 18–20 years from Abu Dhabi University, American University in The Emirates, and the United Arab Emirates University. The research stated that the majority of students, regardless of gender, suppose that both sexes can develop similar leadership traits (80%). The research claims that female leaders have to be more dynamic and demonstrate higher intelligence (26% vs. 20%) and confidence (20% vs. 15%) than male leaders. Cultural and socio-demographic characteristics do not play a significant role in leadership development (10%). The main cause for the choice of a leader is behavioural and communication characteristics (50%) as well as personal qualities (35%). These results can be used for the online design of distance learning courses in universities (both group and individual), as well as for psychologists to study the aspect of individuals’ predisposition to leadership. It makes sense for further research to explore the issue of differences in the perception of educational leadership in Asian and Western European countries based on the cross-cultural aspect, that is, the influence of national culture on the choice of leader in the educational environment.
Introduction.
In the twenty-first century, leadership is subjected to global processes, including the international political environment, the development of innovative technologies, globalisation, etc. 1 . These processes have significantly influenced the understanding of how to develop well-structured organisational models in modern society. The procedures significantly influence how leadership is perceived, thereby heightening its significance in attaining individual success. The research admits that the focus on communication has changed to less individualistic. Modern society experiences a high level of digitalisation, which cannot but affect the worldview and the development of character traits 2 .
Leadership concepts and culture are the most popular topics in the scientific literature on management and education. As part of leadership, gender stereotypes play a significant role, but they do not include gender expectations, based on implicit beliefs. However, early works by European scientists discussed that these concepts were considered by researchers separately in the context of leadership 3 . The analysis of the key concepts may have caused an incomplete understanding of leadership, paying no attention to important questions such as the role of leadership in modern society and its effectiveness for the educational system 4 .
Recently, insufficient information was collected on how national culture influenced leadership styles. Therefore, additional research is needed to evaluate the relationship and role of gender and leadership in the national context. The experimental data should be collected on the global leadership processes and their role in distance education faced by society in the past 5 years.
On the global scale, open online learning and distance learning requires innovation and updated strategies at all levels due to paradigm shifts and global trends towards increasing digitalisation in all sectors of society 5 . Education should focus on new trends in executive leadership, paradigm shifts, and innovative approaches to popularise leadership and management practices 6 .
Globally, humankind must reconsider leadership as part of open, online and distance learning, making it an innovative, redefined and re-evaluated process. The main topics discussed by modern researchers are the new vision of digitalisation, the solutions to emerging social problems, global open, online, and distance learning, and leadership of open online learning available to students on a global scale 7 .
Modern leaders are the individuals who embrace and promote teaching, research, governance and society to move towards in-person global open online learning. The present research focuses on the issue of leadership behaviour because a leader’s style or behaviour theory is one of the main theories of leadership used in cross-cultural research. Future research is needed to evaluate the differences in leadership processes between male and female leaders. It should be considered in terms of What leaders do and how they act 8 .
The proposed experiment is additional cross-cultural research that focuses on issues of gender, social status and leadership. The research goal is to discuss these concepts and fill a gap in the scientific literature on gender studies from the cross-cultural perspective and distance education. The research is the synthesis and the generalised conceptual model that supports gender, educational, and cultural studies.
Conducting research on evaluating leadership development in distance education with a focus on gender, psychology, and cultural dimensions is quite important because understanding gender dynamics in leadership development can help address and mitigate gender biases and disparities. Research would help identify specific needs and best practices, ensuring that leadership development programs are culturally sensitive and relevant. Leadership development is deeply intertwined with psychological constructs such as motivation, self-efficacy, emotional intelligence, and resilience. Investigating these aspects can enhance the effectiveness of leadership training by tailoring approaches to individual psychological profiles.
Distance learning as a communicative educational tool.
Globally, distance education has entered many spheres of life influenced by open universities and the latest computer technologies. In modern Europe, many educational institutions offer distance learning programmes, including well-known universities such as Dublin City University, EU Business School, and the University of Turin 9 . Similar universities are present in North and South America, along with Asia. An interest in technology education programmes, including satellite communications services and networking, has increased significantly over the past decades.
Previously unpopular, distance learning has been introduced to many industries and has become a major initiative for both for-profit and non-profit colleges and universities. Different factors increased the demand for distance education, including the need to occupy a niche in the market, competition between higher education institutions, increasing popularity, the development of new trends in digital technologies, and changing priorities in higher education 10 . All these factors supported the introduction and implementation of distance education programmes in higher education.
Distance learning is a promising new technology designed to involve students in independent learning 11 . The system is based on electronic communication technology that supports interaction between students and teachers from different locations (geographic places), time zones, or both factors 12 .
At the beginning of the last century, a group of American scholars researched the influence of command forces on human behaviour patterns and paid special attention to the concept of leadership. The research claims that the group can control the leader’s behaviour, managing the dominant style using time constraints 13 .
In distance education, the leader is a mentor who manages students’ behaviour at the administrative level 14 . In some cases, a team of leaders is formed to find the best ways and methods of training (including distance learning). Leadership is a decision-making process. Thus, management is a process of making optimal decisions, even if these decisions may not satisfy the interests of the majority of participants involved in the learning process 15 . A good leader encourages his colleagues to take on challenges at work being limited by different factors, helps employees progress unlocks their inner strength, and makes them feel comfortable about getting the job done. The students’ interest in selflessness, a sense of responsibility and pride in their team is crucial for effective team management.
Management and leadership operate at different levels. Management helps scholars identify qualitative decisions and solve problems. Leadership stipulates what should be done for this. Leadership psychology views this process from the perspective of individual experience, focusing on factors such as gender, ability, potential, and social aspects 16 .
The Asian scholars analysed in detail the relations and co-dependence of gender and leadership qualities. Many scholars suppose that no direct relationships exist between the two concepts 17 . They pay special attention to the psychological theories of leadership. Researchers usually define leadership according to the goals of their research. Thus, many definitions of leadership exist and serve the needs of different research subjects. Leadership is a process in which an individual interacts with others to achieve goals 18 .
This research investigates the best environment for distance learning from the leadership theory perspective. Psychologists identified different styles of leadership. The theory of charismatic leadership defines a leader as an individual with unique personality traits. The leaders develop absolute trust in the group based on charisma and encourage others to follow 19 . Situational leadership suggests that leadership styles change in different circumstances. This theory successfully functions in the context of social and cultural factors that influenced the development of the team of students from different countries (when communication is based on different styles). The situational leader may demonstrate leadership qualities in one situation but omit them in another 18 . Situational leadership is flexible and allows a leader to use leadership qualities in turbulent times (i.e., educational, commercial, political, cultural, and gender spheres).
Relational leadership suggests that an individual focuses on unity and develops connections between group members 20 . This theory encompasses the configuration of the leader’s personality traits, duration of team engagement, and social-cultural dimensions. At the same time, leadership implies joint performance when each group member brings something new to the collective decision. Goals and objectives, rather than gender and cultural factors, are the key drivers of leadership. National researchers support this leadership theory.
Distance learning eliminates the need for teachers’ physical presence in the classroom. The development of a lesson, learning period, task completion, and assessment are separated in time. The scholars consider that the student is more focused and motivated as he learns the lesson in a convenient place and time 19 . The new approaches to learning allow teachers to track the progress through electronic systems that can be accessed from any electronic device 21 . Learning online, the student does not miss lessons but uses the free time more rationally to learn the main programme and concentrate on scientific areas of interest. The need to visit an educational institution each day limits the independence of students and prevents individuals from choosing free time (for example, the need to do homework). The student has limited opportunities to select a social environment and a way to interact with the outside world 17 . The physical distance between teachers and students reduces internal tension and communication barriers. Communication skills can be improved by both educators and students, including fewer stereotypes, social clichés and compliance.
The communication models described above allow students to recognise the stimuli and the new behaviour patterns but, at the same time, generate stereotypes. The experiment should focus on the attribution and distribution of social roles to overcome the contradictions between expectations based on stereotypes and the behaviour of individuals in a team 20 .
In distance learning, teamwork has acquired new features. For its successful functioning, a group needs a leader who can unite different types of individuals but overcome subjective factors of influence such as space, time and cultural differences. The experiment evaluated the role of the leader in this process and destroyed the social stereotypes that existed in society. The research purpose is to investigate the effectiveness of leadership in distance learning influenced by psychological theories of leadership, gender characteristics and cross-cultural factors. The following key tasks should be implemented within the framework of the experiment to achieve the research goal:
evaluate the influence of leadership qualities on teamwork;
evaluate the impact of leadership on distance learning;
determine the differences in the perception of leadership by males and females;
identify how cross-cultural factors influence leadership in distance learning.
Research design.
The research aims to achieve the research goal, which is the development of leadership in distance learning. The proposed models and characteristics associated with management and leadership were identified. The scholars evaluated the distance learning environment to determine the presence or absence of possible problems and limitations. Using the formula Proof by Contradiction the research proved the possibility of developing leadership in distance learning 22 .
This research uses mixed methods of the research such as surveys, testing , quantitative and qualitative analysis , and statistical data processing . The wide and in-depth qualitative analysis of interview answers (the examples are given in the “ Results ” section) 23 allows us to understand the general nature of the research. The methods of subjective (the experience of the respondents) and objective (machine calculation of clusters) assessment allowed the scholars to apply more arguments to leadership in the learning process.
Induction methods helped the scholars identify the specifics of the direct implementation of distance learning. The synthesis is used to determine how distance learning approaches affect an educational institution in a society.
The sample involved 600 female and male students aged 18–20 years from Abu Dhabi University, American University in The Emirates, and the United Arab Emirates University. These universities were chosen for the experiment because they educated foreign students (4/5 of the total number of students). In such a way it became possible to investigate cultural concepts in the field of leadership education. The presence of both male and female students minimized the aspect of gender inequality and provided the possibility of gender leadership investigation.
This form of education requires distance learning to provide all students with the required learning materials. As part of this process, students should periodically interact and complete team projects to demonstrate leadership qualities. Communication involves interaction based on different genders as well as cross-cultural aspects. A focus group of 600 students included 50% males and 50% females tested to collect more accurate data during the research (Table 1 ).
This section may be divided by subheadings. It should provide a concise and precise description of the experimental results, their interpretation, and the experimental conclusions that can be drawn.
The questionnaire on leadership stereotypes helped the scholars collect the data and determine the main leadership models in distance learning. The processing data are distributed according to t-correlation, which validates the processing data reliability. The questionnaire on leadership stereotypes includes two blocks of questions processed using the STATISTICS cluster analysis programme. Block 1 includes questions about leadership stereotypes, the role of a leader in a group, and the perception of leadership by females and males. Block 2 includes questions about social and cultural factors that influence interpersonal perceptions.
The main research limitation is that it collects data from one educational institution. The effectiveness of the experiment can be increased by using several private universities, proposing the distance learning format. If these universities are located in different countries (isolated geographical areas), the effectiveness of the survey will increase several times.
All research participants followed the main principles of the Declaration of Helsinki and acted with the permission of the educational institution. The students were informed about the research objectives and signed personal consent to participate in the research. The ethical issues cover how to collect and disseminate personal data. This research does not involve animal studies.
The study was conducted in accordance with the Declaration of Helsinki, and approved by the Institutional Review Board of United Arab Emirates University (protocol No. 003 of 12.08.2023).
Informed consent was obtained from all subjects involved in the study.
The experiment claimed that participants working in groups of up to 20 students comfortably attended an online lesson: the teacher sees each of them, and the students can also see each other. Thus, socialisation is not excluded but supported in student chats.
In an online lesson, students work directly with each other on some tasks. This practice allows the student to test different communication strategies and ensure the same rapport with peers. The student develops leadership qualities through communication with new partners.
The Distribution of key factors influencing the perception of a leader (developed by the author).
A student survey confirms that the leader exhibits identification factors. Most students pay special attention to how other students perceive them, which is consistent with the theory of charismatic leadership. The key factors influencing how exactly a group member perceives someone as a leader are the following: behaviour (50%), open-mindedness and intelligence (35%), appearance (i.e. physical attractiveness, personal style, etc.) (15%) (Fig. 1 ).
Respondent 1: I think there are several key features that are essential for a good leader. First and foremost, a leader should have strong communication skills. This means not only being able to clearly articulate their vision and expectations but also being a good listener. It’s important that a leader listens to their team and considers their feedback. Respondent 4: Another important feature is open-mindedness. A leader should be open to new ideas and different perspectives. This means being willing to consider alternative approaches and being adaptable when circumstances change. Open-mindedness also involves recognizing and valuing the diverse backgrounds and experiences of team members. Respondent 38: In terms of behaviour, I think a leader should demonstrate integrity and honesty. They need to lead by example and act ethically in all situations. Trust is built when leaders are consistent in their actions and transparent in their decision-making.
The respondents admit that a leader possesses different qualities (Fig. 2 ) that form the image of an ideal leader. In many ways, this image may have changed and transformed from the native culture and educational environment that these respondents share. The table illustrates the differences in most leadership qualities but not significant. The research states that a female leader should have more intelligence (26%) and self-confidence (20%) as well as better communication skills (24%) to gain favour as a group leader than males. Male leaders need more rigidity (5%) and punctuality (15%). The rest of the indicators showed no significant difference.
Respondent 21: I think intelligence is crucial for female leaders. They often have to prove their capabilities more than their male counterparts. For example, in my last team, our female leader was incredibly knowledgeable and always on top of things. This really helped her earn our respect and trust. Respondent 12: For male leaders, punctuality stands out. In my team, our male leader’s punctuality set a standard for the rest of us. It showed he was serious about our work and respected our time.
The Distribution of the leadership qualities depending on gender (developed by the author).
After cluster analysis of the questionnaires, the following groups are formed for the Indicator Leaders:
Absolute leaders or individuals who achieve and maintain the status of both formal and informal leaders (35% of cases).
Instrumental leaders or accepted as leaders only for their goal-oriented behaviour (20%).
Expressive leaders or individuals selected as leaders for their ability to develop emotional strength (45%) (Fig. 3 ).
The Classification of Leadership in distance learning (developed by the author).
The research revealed well-developed leadership acquisition models relevant to distance learning. The test of twenty statements, according to the recommendations of Locatelli, was adapted to the research task and contained an understanding of the question: What am I, if I am a leader? The proposed statement helped more accurately determine the leader’s characteristics perceived by the respondents.
The leaders’ indicators were reflected in the statements about the personal qualities of a leader and related role models:
Leadership role or socio-demographic characteristics (10%).
Personal leadership qualities (clever, kind, etc.) (35%).
Appearance (appearance and style of clothing) (15%).
Behavioural and communication characteristics (50%).
The testing revealed a high homogeneity of responses among respondents (with a standard sampling error of 2.2%). The majority of respondents (80.5%) possessed a high level of personal qualities. These characteristics are supported by communicative behaviour patterns as well as appearance and leadership role characteristics. Regardless of gender, the key personal qualities are responsibility, kindness, intelligence, honesty, strong character, and willpower.
The research underlines that these characteristics influence the image of a leader in the survey. This signifies that the subjective assessment by respondents, based solely on personal experience, aligns with the machine-generated calculation of potential leadership models. This can serve as evidence of both a well-developed research process and a homogeneous sample as well as the fact that the questionnaire did not go beyond the cultural template, thereby creating a cross-section of Eastern culture.
The research illustrates that disability, serious illness, or unremarkable appearance cannot prevent a student from gaining leadership qualities. Therefore, distance learning provides conditions for inspiring leadership and directly increases the opportunities for winning it in small and large groups since such a learning format reduces the pressure of society on the formation of a personality. Students of distance learning programmes develop leadership qualities and use them in everyday life. Moreover, students increase their social status and improve their motivation for learning (learning always means a person’s desire for self-improvement).
Distance learning stimulates leadership and works for students, teachers, methodologists and educational institutions 24 . The proposed approach is found in cultural and social aspects, including the low cost of education for both students and teachers (since there is no need to rent a room, spend money on the travel to the place of education, etc.) and reduced time on the road. Moreover, the approach suggests the independent planning of time, place and lessons as well as training for a large number of individuals at the same time. The research supposes that the new model will help educators improve the quality of education, using modern tools and electronic libraries, and a unified educational environment (important for corporate training) 21 .
In contrast to the standardised approach, the average approach of schooling applied to distance learning in higher education caused a significant advantage. Distance learning uses interactive and hybrid or blended courses that offer flexible learning for students of all ages, including individual and team learning 25 . Both learning perspectives can be adapted to the needs and expectations of a specific group. This approach is a framework for successful leadership development and understanding social relations regardless of gender and country 26 . The research claims that gender is not the main issue in becoming a leader in a learning group. The attention is paid not to appearance (15%) but to the behavioural factor (50%) and mental abilities (35%) of a leader.
In different circumstances, males and females demonstrate leadership qualities in different ways, depending on how they acquired these qualities (whether they were nurtured, learnt or life experiences) 27 . Some Asian scholars describe the style of female leaders as democratic and flexible. The research underlines that empathy, sociability, adaptability, and less aggression are traits that are rarely found in females 28 . German scientists emphasize that male leaders dominate large groups, are less open and expansive and prefer old-fashioned communication patterns with staff 29 . American scientists do not distinguish between male and female leaders, evaluating common characteristics such as competence, pomposity, efficiency, and creativity 30 . In their opinion, these leaders possess high self-esteem, a clear sense of personal goals, self-awareness, coolness, and independence. The research claims that to be successful, a female leader must demonstrate more intelligence (26% and 20%, respectively) and more confidence (20% and 15%, respectively) than male leaders.
The research on gender in leadership covers six issues related to the relationship between leadership and gender, namely the number of males and females in leadership positions; behaviour patterns; leadership effectiveness; gender distribution in the group; desire for leadership; and gender identity of the leaders 31 . At the same time, the main issue is that the effectiveness of education should not be questioned if the research relies heavily on the stereotypes of gender co-dependence.
The cultural characteristics of the leader’s country of origin influence the national organisation and depend on the ability of the leader to manage organisations in terms of informal and formal communication, introduce a unified communication system, and access reliable and complete sources of information 27 . The research confirms that socio-demographic characteristics are not in the first place (10%) in the survey. Competences to address this issue are effectively formed through distance education, which explains the need for educational institutions 32 .
The findings from research on evaluating leadership development in distance education, considering gender, psychology, and cultural dimensions can lead to the creation of leadership development programs that are customised to meet the specific needs of different genders, cultural backgrounds, and psychological profiles. Different educational programs can be designed to be more inclusive, addressing gender biases and ensuring equitable participation and engagement for all learners. Policymakers can use research findings to develop regulations and standards that ensure leadership development programs are inclusive, effective, and culturally sensitive. Educational institutions can leverage findings to strategically plan and implement leadership development initiatives that are aligned with the diverse needs of their student population.
The research evaluates the problem that a student becomes a leader if he develops strong personal qualities and has similar features to the image of a leader formed in the views of the social group. However, leadership behaviour is formed before the inclusion of an individual to the team. Distance learning can be seen as an effective means of the leaders’ development. Leadership has become a significant feature of the modern world. An individual strives for and can achieve social success as well as change the educational world, closely connected with information technologies, allowing many students to access information and acquire new skills, regardless of their geographical location. Distance learning is becoming an effective way to achieve leadership without the influence of social stereotypes, clichés, gender inequality and other social and cultural barriers.
The research underlines that the majority of students, regardless of gender, suppose that both sexes can be leaders (80%). Factors influencing how one or another group member perceives the leader of the organisational structure (formal or informal) of the social group depend on the emotional dimension of intra-group relations. The collected data demonstrates the absence of limitations in stereotypical perceptions of leadership. At the same time, the research finds that to achieve success, a female leader has to show more intelligence (26% and 20%, respectively) and more confidence (20% and 15%, respectively) than male leaders. Cultural and socio-demographic characteristics do not play a key role in the development of a leader (10%). The key issues for this selection are behavioural and communication characteristics (50%) as well as leadership and personal qualities (35%).
These results can be used by educators to develop online distance learning courses in universities (both group and individual) as well as by psychologists who evaluate the personal qualities of individuals and the social environment to develop leadership skills. Educators are encouraged to cultivate and integrate curriculum content that encompasses a wide array of cultural perspectives and leadership practices, incorporating case studies, examples, and readings drawn from diverse cultural contexts. The results of this research would help to use a mix of teaching methods such as discussions, role-plays, and simulations that cater to different learning styles and cultural backgrounds. Further research is needed to assess the difference in the perception of leadership in learning in Asian and Western European countries from a cross-cultural perspective.
All data generated or analysed during this study are included in this published article.
Yukl, G., Mahsud, R., Prussia, G. & Hassan, S. Effectiveness of broad and specific leadership behaviors. Pers. Rev. 48 (3), 774–783 (2019).
Article Google Scholar
Sousa, M. J. Entrepreneurship skills development in higher education courses for team leaders. Adm. Sci. 8 (2), 18 (2018).
Article ADS Google Scholar
Wong, H. C., Ramalu, S. S. & Chuah, F. An overview of leadership and the emerging of relational leadership. J. Hum. Resour. Leadersh. 4 (1), 32–43 (2019).
Google Scholar
Wowk, K. et al. Evolving academic culture to meet societal needs. Palgrave Commun. 3 (1), 35 (2017).
Vasconcelos, M. C. C. Education at home: Unschooling prospects or freedom of choice? Pro-Posições 28 (2), 122–140 (2017).
McKim, A. J. & Velez, J. J. Informing leadership education by connecting curricular experiences and leadership outcomes. J. Leadersh. Educ. 16 (1), 81–95 (2017).
Vázquez-Toledo, S., Latorre-Cosculluela, C. & Liesa Orús, M. Towards an educational leadership: Functions and failures perceived by teachers and management teams. J. Res. Leadersh. Educ. 19 (2), 147–166 (2024).
Raymer, S. D., Dobbs, J., Kelley, C. P. & Lindsay, D. R. Leadership education and development: Theory driven evolutions. J. Leadersh. Educ. 17 (2), 138–148 (2018).
Rosch, D. M., Collier, D. & Thompson, S. E. An exploration of students’ motivation to lead: An analysis by race, gender, and student leadership behaviors. J. Coll. Stud. Dev. 56 (3), 286–291 (2015).
Ossiannilsson, E. Leadership in global open, online, and distance learning. In Online Course Management: Concepts, Methodologies, Tools, and Applications (ed. Khosrow-Pour, M.) 2212–2224 (IGI Global, 2018).
Chapter Google Scholar
Simpson, O. Supporting Students in Online, Open and Distance Learning 10–12 (Routledge, 2018).
Book Google Scholar
Blose, S. & Khuzwayo, N. Q. Teacher leadership in action: An inquiry into the lived experiences of subject heads in secondary schools. Int. J. Leadersh. Educ. 26 (2), 244–259 (2023).
Seemiller, C. Leadership competency development: A higher education responsibility. New Dir. High. Educ. 2016 (174), 93–104 (2016).
Soria, K., Snyder, S. & Reinhard, A. P. Strengthening college students’ integrative leadership orientation by building a foundation for civic engagement and multicultural competence. J. Leadersh. Educ. 14 (1), 55–71 (2015).
Nicholson, J. & Kurucz, E. Relational leadership for sustainability: Building an ethical framework from the moral theory of ‘ethics of care’. J. Bus. Ethics 156 , 25–43 (2019).
Salihu, M. J. A conceptual analysis of the leadership theories and proposed leadership framework in higher education. Asian J. Educ. Soc. Stud. 5 (4), 1–6 (2019).
Kownacki, A., Barker, D. & Arghode, V. A grounded theory approach for exploring shared leadership: Evidence from urban primary schools in Pennsylvania. Int. J. Leadersh. Educ. 26 (2), 201–222 (2023).
Kromydas, T. Rethinking higher education and its relationship with social inequalities: Past knowledge, present state and future potential. Palgrave Commun. 3 (1), 1 (2017).
Lash, C. L., Burton, A. S. & Usinger, J. Developing learning leaders: How modeling PLC culture in principal preparation can shift the leadership views of aspiring administrators. J. Res. Leadersh. Educ. 19 (2), 167–195 (2023).
Kosse, F., Deckers, T., Pinger, P., Schildberg-Hörisch, H. & Falk, A. The formation of prosociality: Causal evidence on the role of social environment. J. Pol. Econ. 128 (2), 434–467 (2020).
Kim, J. J., Waldman, D. A., Balthazard, P. A. & Ames, J. B. Leader self-projection and collective role performance: A consideration of visionary leadership. Leadersh. Q. 34 (2), 101623 (2023).
Gómez Gutiérrez, S. Contradictions and rationality: An analysis of two Biblical cases. In Beyond Faith and Rationality: Essays on Logic, Religion and Philosophy (ed. Silvestre, R. S.) 153–169 (Springer, 2020).
Cheung, K. K. C. & Tai, K. W. The use of intercoder reliability in qualitative interview data analysis in science education. Res. Sci. Technol. Educ. 41 (3), 1155–1175 (2023).
Daniëls, E., Hondeghem, A. & Dochy, F. A review on leadership and leadership development in educational settings. Educ. Res. Rev. 27 , 110–125 (2019).
Eacott, S. Educational Leadership Relationally: A Theory and Methodology for Educational Leadership 9–19 (Management and Administration. Springer, 2015).
Gilbride, N., James, C. & Carr, S. The ways school headteachers/principals in England at different stages of adult ego development work with organisational complexity. Educ. Manag. Adm. Leadersh. https://doi.org/10.1177/17411432231170581 (2023).
Haber-Curran, P. & Tillapaugh, D. Gender and student leadership: A critical examination. New Dir. High. Educ. 2017 (154), 11–22 (2017).
Aymoldanovna, A. A., Zhetpisbaeva, B. A., Kozybaevna, K. U. & Kadirovna, S. M. Leadership development university students in the activities of student government. Procedia Soc. Behav. Sci. 197 , 2131–2136 (2015).
Bradbury, B. A mixed methods study of distance learning graduate students’ motivation and program satisfaction at an American State University. In VII International Scientific and Technical Internet Conference “Information Technologies in Education, Science and Production”, November 16–17, 2019 (ed. Kondratyonok, E.V.) 11–15 (Belarusian National Technical University, 2019).
James Jacob, W. Interdisciplinary trends in higher education. Palgrave Commun. 1 (1), 15001 (2015).
Hammad, W., Sawalhi, R., Salim Al-Harthi, A., Alamri, F. & Morad, H. Perceptions of teacher leadership in the Arab region: A comparative analysis of three countries. Educ. Manag. Adm. Leadersh. https://doi.org/10.1177/17411432231166888 (2023).
Crisp, G. & Alvarado-Young, K. The role of mentoring in leadership development. New Dir. Stud. Leadersh. 2018 (158), 37–47 (2018).
Article PubMed Google Scholar
Download references
Authors and affiliations.
Sharjah Education Academy, Sharjah, United Arab Emirates
Asma Khaleel Abdallah
You can also search for this author in PubMed Google Scholar
Conceptualization, methodology, software, validation, formal analysis, investigation, re-sources, data curation, writing—original draft preparation, writing—review and editing, visualization, supervision, project administration, A.K.A. The author has read and agreed to the published version of the manuscript.
Correspondence to Asma Khaleel Abdallah .
Competing interests.
The author declares no competing interests.
Publisher's note.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/ .
Reprints and permissions
Cite this article.
Abdallah, A.K. The impact of gender, psychology, and cultural dimensions on leadership development in distance education. Sci Rep 14 , 18309 (2024). https://doi.org/10.1038/s41598-024-68495-4
Download citation
Received : 05 February 2024
Accepted : 24 July 2024
Published : 07 August 2024
DOI : https://doi.org/10.1038/s41598-024-68495-4
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
By submitting a comment you agree to abide by our Terms and Community Guidelines . If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.
Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.
Linking gender differences with gender equality: a systematic-narrative literature review of basic skills and personality.
There is controversy regarding whether gender differences are smaller or larger in societies that promote gender equality highlighting the need for an integrated analysis. This review examines literature correlating, on a national level, gender differences in basic skills—mathematics, science (including attitudes and anxiety), and reading—as well as personality, to gender equality indicators. The aim is to assess the cross-national pattern of these differences when linked to measures of gender equality and explore new explanatory variables that can shed light on this linkage. The review was based on quantitative research relating country-level measures of gender differences to gender equality composite indices and specific indicators. The findings show that the mathematics gender gap from the PISA and TIMMS assessments, is not linked to composite indices and specific indicators, but gender differences are larger in gender-equal countries for reading, mathematics attitudes, and personality (Big Five, HEXACO, Basic Human Values, and Vocational Interests). Research on science and overall scores (mathematics, science, and reading considered together) is inconclusive. It is proposed that the paradox in reading results from the interrelation between basic skills and the attempt to increase girls’ mathematics abilities both acting simultaneously while the paradox in mathematics attitudes might be explained by girls being less exposed to mathematics than boys. On the other hand, a more nuanced understanding of the gender equality paradox in personality is advanced, in which a gene–environment-cultural interplay accounts for the phenomenon. Challenges for future cross-national research are discussed.
Despite Western countries having considerably advanced in gender equality, gender horizontal segregation remains among the main drivers of economic gender inequality ( Cech, 2013 ). Women have entered the labor market at increasingly high rates since the 70s, nevertheless, they often still work in specific sectors with substantial effects on their income ( Cortes and Pan, 2018 ). Gender segregation is already visible at the educational level where girls are overrepresented in disciplines such as Social Sciences and Humanities; these subjects are characterized by lower labor market prospects and income ( van de Werfhorst, 2017 ). On the other hand, boys prefer STEM fields which offer high-salaried and more status-related careers ( Barone and Assirelli, 2020 ). To explain the phenomenon, scholars in sociology and psychology have been particularly interested in basic skills and personality gender variances due to their influence on gendered career choices and outcomes ( Rosenbloom et al., 2008 ; Dekhtyar et al., 2018 ; Stoet and Geary, 2018 ).
Regardless of doubts about their magnitude ( Hyde, 2005 ; Archer, 2019 ; Hirnstein et al., 2022 ), gender differences in basic skills and personality are well-established in the literature ( Halpern, 2000 ; Halpern et al., 2007 ; Geary, 2010 ; Weisberg et al., 2011 ). The gender gaps favoring boys in mathematics and science are close to zero on average but observable at the upper and lower tails of the distribution ( Halpern et al., 2007 ; Wai et al., 2018 ). Conversely, differences in reading skills (women > men) are more pronounced and already noticeable when comparing men’s and women’s statistical means ( Halpern, 2000 ; Moè et al., 2021 ). Regarding personality (Big Five, HEXACO, Basic Human Values, and Vocational Interests), gender variances, although small to medium, occur across models and share a similar pattern. On the one hand, women score higher in negative emotions and reciprocity as well as prefer to “work with people.” On the other hand, men have more realistic preferences and regard status-related values more ( Schwartz and Rubel, 2005 ; Schmitt et al., 2008 ; Su et al., 2009 ; Lee and Ashton, 2018 ). On a national level, however, the link between these gender differences and gender equality, measured using conventional indicators such as the World Economic Forum’s Global Gender Gap Index (GGI), remains unclear with scholars making contrasting predictions.
Numerous social-role theories of gender differences expect that the gaps between men and women will decrease as equality between them is achieved ( Eagly and Mitchell, 2004 ; Else-Quest et al., 2010 ). These theories argue that cognitive and personality gender differences are derived from socially constructed gender identities based on erroneous essential beliefs (stereotypes) that men and women are intrinsically different ( Wood and Eagly, 2013 ). Gender stereotypes originate from the division of labor in ancient hunter-gatherer societies, in which greater strength allowed men to engage in more power-related activities, while women were tasked with nurturing duties because of their ability to breastfeed ( Eagly and Wood, 1999 ). Stereotypes would emerge early in life, with elementary school children already consistently engaging in gender essentialism, gender stereotyping, and implicit gender associations ( Meyer and Gelman, 2016 ). Parents, teachers, and friends are responsible for reinforcing them, rewarding children for behaving according to gendered expectations ( Gunderson et al., 2012 ), thereby making gender a “primary framing device for social relations” ( Ridgeway, 2006 ). As a result, boys and girls grow up into adults who have gender-specific roles in society and experience gender-conforming environments that shape their distinct skills and personalities ( Diekman and Schneider, 2010 ). The common assumption underlying these theories predicts that essentialist beliefs decrease in countries with higher gender equality. If this is true, empirical research will find smaller gender differences in more gender-equal nations.
Other studies have theorized an opposite trend, with men and women becoming increasingly dissimilar in gender-equal countries ( Charles and Bradley, 2009 ; Kaiser, 2019 ). Recently, Stoet and Geary (2018) labeled this phenomenon “the gender equality paradox.” Some have proposed that this paradox results from an emphasis on individualism and a societal system designed to accommodate women in what is perceived to be their gendered role ( Charles and Grusky, 2018 ). Others have applied an evolutionary approach and argued that in less unequal environments, men and women freely express their intrinsic differences as the privileged access to resources in “more prosperous and more egalitarian” societies favors the emergence of specific gender-evolved behaviors ( Schmitt et al., 2008 ).
Although the topic of gender difference has been widely discussed, whether men and women become progressively similar or different when greater equality between them has been achieved remains uncertain. This paper reviews several theories hypothesizing contrasting patterns, and then turns to the recent scientific debate on gender differences in basic skills from the PISA and TIMMS assessments, as well as personality (Big Five, HEXACO, Basic Human Values, and Vocational Interests) to consider how they relate to measures of gender equality on a national level. Several challenges for future cross-national research are also highlighted. Specifically, the present review indicates that the correlation between gender differences in mathematics and gender equality may derive from the lack of country-level effects in the models, while ecological stress (food consumption and historical levels of pathogen prevalence) may confound the results for personality. In addition, the paper examines explanations of the paradox in different domains and proposes a novel theory to explain the gender equality paradox in personality, where a “feedback-loop” effect (gene–environment-culture interplay) might account for the phenomenon.
The narrative approach was assessed to be the most suitable method for this study. Compared to more analytical methods, it allows for deeper insights into the ongoing debate ( Graham, 1995 ). However, issues may arise with this method due to bias in paper selection and interpretation ( Dijkers, 2009 ). To avoid these issues, the author implemented a systematic approach based on PRISMA guidelines together with the narrative method.
To be eligible for inclusion, papers had to have been published between 2009 and 2022, and they had to describe quantitative cross-national research analyzing gender differences associated with measures of gender equality (composite indices or specific indicators) utilizing international data. The selected studies were divided into two groups—basic skills and personality—then further divided into multiple subgroups: mathematics, science (including attitudes and anxiety), reading, and overall scores for basic skills, as well as the Big Five, the HEXACO model, basic human values and vocational interests for personality factors. Since they had fewer available papers, the Big Five and HEXACO, as well as basic human values and vocational interests categories were combined.
Published studies were selected from Scopus, Web of Science, Social Science Database, and Google Scholar. The final search was conducted on all databases in November 2022.
The research focused on gender differences in basic skills and personality due to their strong relationships with horizontal gender segregation. Thus, the main search words were “gender/sex differences in mathematics/reading/science,” “gender/sex differences in personality,” “gender/sex differences in basic human values” and “gender/sex differences in vocational interests.” The search was then refined using “gender equality/egalitarianism/inequality” as parameters.
Only papers published in English were considered, and they were selected based on their titles, abstracts, and keywords. This study’s author was primarily responsible for the selection, although two other scholars supervised the process and ensured systematic application of the selection criteria.
Ninety-one papers were preselected; 35 were excluded after deeper screening because they did not match the selection criteria. An additional 25 studies were excluded because they studied gender differences outside the domains of interest. Consequently, 31 papers were included in the study.
3.1. gender differences.
On a national level, gender differences in basic skills and personality have been repeatedly described. Research has shown that boys slightly outperform girls in complex mathematical riddles ( Reilly et al., 2019 ); this difference has been associated with men’s overrepresentation in STEM fields ( Dekhtyar et al., 2018 ). Although the difference approaches zero, gaps are especially visible among the top and lower performers because of the higher variability in boys ( Lindberg et al., 2010 ; Wai et al., 2018 ). Stated otherwise, while there are barely any differences on average, the men’s distribution has a flatter curve, yielding higher values at both the lowest and highest ends. Similarly, men appear to have a small advantage over women in science, with differences particularly visible at the top end of the distribution; however, men are also overrepresented among the lowest performers ( Halpern, 2000 ).
Mathematics and science achievement is influenced not only by skills, but also by mathematics and science attitudes, test anxiety, and self-efficacy ( Ashcraft and Moore, 2009 ; Geary et al., 2019 ). These dimensions are believed to be strong determinants of STEM careers and contribute to the underrepresentation of women in these fields ( Moakler and Kim, 2014 ; Sax et al., 2015 ). Research has shown that men generally report more enjoyment and positive attitudes than women when engaging in mathematical activities ( Ganley and Vasilyeva, 2011 ; Devine et al., 2012 ).
By contrast, women perform substantially better than men on verbal tasks ( Moè et al., 2021 ), with girls using a broader vocabulary than boys on average by age two ( Halpern, 2000 ; van der Slik et al., 2015 ). Verbal abilities comprise various skills, and gender differences are most prominent in the reading dimension, where the girls’ advantage is three times wider than the boys’ advantage in mathematics ( Stoet and Geary, 2013 ). Nevertheless, Hirnstein et al. (2022) have cast some doubts on the magnitude of gender differences in verbal abilities claiming that publication bias might have influenced the results.
Cognitive abilities are largely interrelated. For example, high math skills predict higher reading scores and vice versa ( Bos et al., 2012 ; Reilly, 2012 ). Women’s mean overall scores considerably outperform men’s, even though the latter appears to be better positioned at the top and lower tails of the distribution, a finding that supports the higher men variability hypothesis ( Halpern et al., 2007 ; Bergold et al., 2017 ).
Turning to personality, gender differences are reported across the Big Five traits (openness, conscientiousness, extraversion, agreeableness, and neuroticism) and the HEXACO model (honesty–humility, emotionality, extraversion, agreeableness, conscientiousness, and openness), suggesting small to moderate gaps depending on the test and dimension analyzed. Specifically, women score higher in both neuroticism and agreeableness ( Costa et al., 2001 ; Schmitt et al., 2008 ; Murphy et al., 2021 ), although findings have been inconclusive for openness, extraversion, and conscientiousness, with some studies showing women and others showing a men’s advantage ( Goodwin and Gotlib, 2004 ; Shokri et al., 2007 ). The HEXACO model displays a similar pattern, with emotionality and honesty–humility both substantially higher in women than men ( Lee and Ashton, 2004 , 2018 ).
Men and women also differ in value priority and vocational interests. According to Schwartz’s theory ( Schwartz, 1999 ), values define the motivations behind behaviors that regulate attraction in diverse fields. Although the variations are small to medium, research has consistently shown gender gaps, with men scoring higher in power, stimulation, hedonism, achievement, and self-direction and women scoring higher in universalism and benevolence ( Schwartz and Rubel, 2005 ). On the other hand, vocational interests ( Holland, 1997 ) describe how personality interacts with career environments and are important determinants of gender-typed career trajectories ( Kuhn and Wolter, 2022 ). Previous studies have shown that men prefer to be employed in realistic fields, while women favor working with people ( Lippa, 2010a ), suggesting that men have more realistic and investigative interests, preferring careers in engineering, science, and mathematics. By contrast, women prefer “working with people” as they have more artistic, social, and conventional tendencies, which facilitate social science careers ( Su et al., 2009 ).
The social role theory ( Eagly and Wood, 1999 ) posits that variations between men and women derive from the interaction, reinforced by socio-psychological processes, between evolved gender differences in physicality and the socio-cultural context in which these differences are expressed. Eagly and Wood (2012) have argued that, historically, men’s greater strength, endurance, and speed allowed them to conduct physically challenging duties. Conversely, women developed the ability to breastfeed, making them better suited for nurturing tasks. These evolved physical predispositions for specific activities shaped the domestic division of labor between men and women in ancient hunter-gatherer societies ( Eagly and Wood, 2012 ).
As societies developed, the division of labor began to be influenced by physical gender differences in interaction with the social environment ( Eagly and Wood, 1999 ). In modern countries, the socioeconomic setting dictates the relevance of those activities for which men and women have evolved peculiar physical predispositions. In this context, division of labor no longer relates solely to the domestic sphere but also encompasses paid labor, with men and women being segregated into different occupations. This gender segregation “derives in part from male and female biology—that is, mainly their evolved physical attributes, especially women’s reproductive activities and men’s size and strength, which can allow some activities to be more efficiently performed by one sex or the other depending on the socioeconomic and ecological context” ( Wood and Eagly, 2013 ). Thus, the interaction of evolved physical gender differences with the social environment in which they are expressed is likely to be the main process shaping gender segregation.
Within societies, social-psychological processes reinforce gender segregation and make it appear “natural and sensible” ( Wood and Eagly, 2013 ). Most people, when observing differential behaviors, assume that men and women are intrinsically dissimilar and construct specific “multifaceted” gender roles that include either essentially masculine or essentially feminine features ( Beckwith, 2005 ; Wood and Eagly, 2012 ). Individuals then internalize these roles through societal mechanisms that reward people who comply and penalize those who deviate, leading both men and women to develop specific skills and personality ( Friedman and Downey, 2002 ; Eagly and Wood, 2012 ). Consequently, gender differences in basic skills and personality are derived from the great effort that societies have undertaken to perpetuate gender segregation and comply with constructed gender roles ( Wood and Eagly, 2013 ). It follows that in countries where gender roles are relaxed, gender segregation and, as a result, gender differences in basic skills and personality will be smaller ( Eagly and Mitchell, 2004 ).
The gender stratification hypothesis ( Baker and Jones, 1993 ) is consistent with the theory presented above. Although originally formulated to explain gender gaps in mathematics, it has also been applied in other spheres. The theory suggests that essentialist gender beliefs interact with individual goals, thereby generating gender differences. These differences emerge because men in patriarchal societies can connect their skills with career outcomes, whereas women cannot do so due to unequal opportunities ( Else-Quest et al., 2010 ). In sum, societies that exhibit more gender stratification offer fewer opportunities for women to experience and develop the same skills and personalities as men.
Drawing from expectancy-value theory ( Wigfield, 1994 ) and cognitive social learning theory ( Bussey and Bandura, 1999 ), the gender stratification hypothesis argues that people undertake a task only if they value it and expect success. Perceptions of a task’s value are shaped by socio-cultural stereotypes about characteristics assumed to be gender-essential. Thus, women, due to gender stereotypes, would not find it valuable to invest in domains perceived as “masculine” because they would not expect to succeed in them. Instead, they would prefer to develop more “feminine” skills, and this predilection generates gender variances ( Frome and Eccles, 1998 ).
The above process is ostensibly reinforced by environmental processes that highlight those behaviors that are generally linked to gender in a given cultural setting. In this context, environment relates to the social influences that could be imposed, selected, or contracted according to “levels of personal agency,” that is, the extent to which people feel they are in charge of their decisions ( Bandura and Walters, 1977 ). According to this perspective, the immediate environment provides gender-essentialist information through parents, friends, and the media. Individuals regulate their behaviors according to the social expectations conveyed by this information and, through “direct tuition,” inform others about how different behaviors are linked to gender ( Bussey and Bandura, 1999 ).
According to the above theories, gender differences derive from false essentialist beliefs that diminish opportunities for subjective growth, making differences the result of unequal social treatment ( Figure 1 ). Gender essentialism is conceived as a “powerful ideological” force that legitimates gendered choices and limits personal development ( West and Zimmerman, 1987 ). Stated otherwise, gender not only represents the lens through which people see the world, but it also constitutes the basis for categorizing individuals ( Bussey and Bandura, 1999 ). However, as the above theories emphasize, any visible variation between men and women results not from innate biological differences but from social impositions. If men and women were treated alike, gender stereotypes would fade, exposing them to similar stimuli and, consequently, eliminating gender differences in both basic skills and personality ( Baker and Jones, 1993 ; Eagly and Wood, 1999 ). Thus, gender equality is likely to be associated with reduced gender variation. As Else-Quest et al. (2010) claimed, “where there is greater gender equity, gender similarities … will be evident.” Eagly et al. (2004) argued in the same vein, maintaining that “the demise of many sex differences with increasing gender equality is a prediction of social role theory.”
Figure 1 . Overview of social-role theories of gender differences. Gender differences are generated by essentialist beliefs that men and women are intrinsically different which are in turn influenced by social norms in tandem with the division of labor derived from gender physical specialization.
Drawing on gender essentialism, Charles and Bradley (2009) theorized an opposite effect—that gaps might increase with greater gender equality. They posited that, even if societies are gender equal, gender stereotypes endure because of the emphasis on individualism and self-expression in these societies. Specifically, gender equality stresses the expression of subjective preferences; however, it does not question how that preference emerges—an emergence that, Charles and Bradley (2009) ascribe to societal mechanisms influencing individuals based on their gender. These mechanisms strengthen essentialist beliefs about differences between men and women, in turn reinforcing gender-related roles ( Levanon and Grusky, 2018 ).
According to the foregoing analysis, societal systems are characterized by internal structural diversification that is conceptualized to accommodate individual “expressive choices” but functions, instead, to increase stereotypes as people act out their internalized gender identities rather than their subjective preferences ( Rawlings, 2007 ; Charles et al., 2014 ). In addition, long periods of care leave and advanced family policies, which are generally found in gender-equal countries, tend to influence horizontal gender segregation and compel women to enter into roles typically considered more gender-appropriate ( Freiberg, 2019 ), widening even further the prevailing gender gaps. Thus, even when a society becomes more gender equal, “a preponderance of gender-typical choices” and an increase in gender variances can be expected ( Charles and Bradley, 2009 ). Supporting this statement, some scholars have argued that gender stereotypes increase in more gender-equal nations ( Breda et al., 2020 ; Napp and Breda, 2022 ). Others have stated that “cultural individualism” is often the strongest predictor of gender gaps in equal societies ( Bleidorn et al., 2016 ; Kaiser, 2019 ).
Evolutionary theorists claim that differences between men and women are magnified in more gender-equal environments because privileged access to resources allows them to freely express specific gender “ambitions and desires” ( Schmitt et al., 2008 ; Stoet and Geary, 2018 ). These theorists argue that from an evolutionary perspective, the possibility that men and women evolved with identical characteristics is a “theoretical impossibility” and maintain that gender differences are derived, in part, from innate predispositions ( Vandermassen, 2011 ). Specifically, variations are expected to be visible in those domains in which the evolutionary pressure, mainly sexual selection, has influenced men and women differently ( Schmitt, 2015 ). According to this view, the interplay between “sex-linked” genes and environmental stressors is responsible for the more pronounced gender dimorphism in modern nations ( Schmitt et al., 2008 ).
In ancient hunter-gatherer societies, men and women evolved specific, intrinsic differences as a result of evolutionary adaptation ( Mealey, 2000 ). Nevertheless, environmental conditions suppressed these innate differences that have subsequently re-emerged in developed societies characterized by reduced ecological pressure stemming from favorable economic circumstances. Gender differences in sensitivity to environmental change have played a key role in explaining this re-emergence. Generally, in the animal kingdom, the larger animal between the two sexes shows sharper fluctuations in behavior when ecological settings vary. The same appears to be true among humans, where men are more influenced by environmental changes ( Teder and Tammaru, 2005 ). It follows that both men and women, but especially men, are less affected by environmental components in resource-rich countries, where they are free to follow their intrinsic characteristics ( Schmitt et al., 2017 ). Conversely, in countries that offer fewer economic opportunities, choices are constrained, and reduced gender differences might be evident ( Stoet and Geary, 2018 ).
Thus, according to the evolutionary hypothesis, increased gender variations in more gender-equal societies are mainly a product of the sexual selection that men and women have undergone during evolution together with gender differences in sensitivity to environmental changes ( Schmitt et al., 2008 ). This interplay of gender-linked genes and environmental influences is relevant for some gender variances, such as height, since men in more developed societies are reported to be more sensitive to environmental changes ( Sohn, 2015 ).
Most studies on gender differences in basic skills have focused on the Trends in International Mathematics and Science Study (TIMMS) and the Program for International Student Assessment (PISA). TIMMS targets fourth- and eighth-grade students worldwide and reports their academic achievements every 4 years. Similarly, PISA is a triennial test of mathematics and science administered to 15-year-old adolescents in several countries. The PISA and TIMMS tests have been related to only a few gender equality indices; the most commonly used are the World Economic Forum’s Gender Gap Index (GGI) and the United Nations’ Gender Empowerment Measure (GEM). Both indicators are based on sub-indices that assess gender equality in numerous domains, such as educational attainment, political empowerment, and health.
As Table 1 shows, the math gender gap does not usually relate to gender equality when analyzing TIMMS data; in the PISA data, however, the findings appear to be more divergent.
Table 1 . Correlations between mathematics gender differences (men > women) and both composite indices and specific indicators of gender equality.
Else-Quest et al. (2010) found that higher gender equality leads to slightly smaller differences between men and women in mathematics, although with variation across indices ( r = 0.09–0.14). Similarly, Hyde and Mertz (2009) showed that more equitable index scores result in more women being among the top performers; however, their analysis used a small country sample and excluded Scandinavian nations (more on this below). Moreover, Gevrek et al. (2018) argued that moving toward gender equality predicts a reduced gender gap in mathematics in the part that cannot be explained by “observable characteristics,” that is, explained by elements that can be controlled for in statistical analyses.
However, the results appear to depend on the years that were considered in the analysis. For example, Stoet and Geary (2013 , 2015) found that only the 2003 PISA assessment was consistent with theories hypothesizing that gender equality is linked with smaller gender differences. For other years, gender-equal practices were unrelated to a mathematics gap. Additionally, the results are sensitive to the inclusion of Scandinavian and gender-segregated, Muslim countries as well as gender-equal nations in which boys considerably underperform girls ( Fryer and Levitt, 2010 ; Kane and Mertz, 2012 ; Stoet and Geary, 2015 ). However, some have raised doubts about including Muslim countries in the sample ( Kane and Mertz, 2012 ). Other scholars have proposed that the positive findings derive from a spurious correlation between the GGI and country-specific unobserved variances ( Anghel et al., 2019 ). Finally, as reported in Table 1 , Gevrek et al. (2020) recently reversed their findings, strengthening the evidence that gender equality, measured by composite indicators, is not linked to gender differences in mathematics achievement.
However, composite indices may fail to account for explicit factors influencing the mathematics gender gap while specific indicators may be more suitable for measuring how gender differences vary in relation to gender equality. As Table 1 shows, having more women in research, higher levels of female participation in economic activities, a higher ratio of women to men holding parliamentary seats, and greater educational equality seem to predict reduced gender variation ( Else-Quest et al., 2010 ; Penner and Cadwallader Olsker, 2012 ). More recently, Gevrek et al. (2020) extended their research by decomposing the mathematics gender gap into that which could be explained by “observable characteristics” and that which could not. Their finding suggests that the men-to-women ratio in tertiary education and the lower gender wage gap are not related to the explainable part of the gender gap, although they predicted a reduction in the unexplained part.
As mentioned earlier, also the findings for specific indicators depend on the year and countries considered. For instance, the results for the “women in research” indicator are unreliable because they sharply fluctuate across PISA assessments ( r = −0.16, r = −0.68; Reilly, 2012 ; Stoet and Geary, 2015 ). The relation is mainly driven by countries that are, on average, less gender-equal but display lower gender discrepancies, such as Latvia, Serbia, Tunisia, and Thailand, as well as non-OECD nations ( Reilly, 2012 ; Stoet and Geary, 2015 ).
Regarding “women’s economic activity,” Stoet and Geary (2015) analyzed four PISA assessments (2000, 2003, 2006, and 2009) and concluded that only the 2000 and 2003 results were consistent with theories predicting that gender equality is linked to smaller gender differences. In addition, “females in parliamentary seats” never reached statistical significance; only in the 2003 assessment did a link appear by excluding either non-OECD or Nordic countries from the sample ( Stoet and Geary, 2015 ). Further, while Penner and Cadwallader Olsker (2012) showed that countries with more women participation in the labor force tended to have higher mathematics gender differences, the gender gap was not linked to gender equality in their analysis, contrary to the predictions. In sum, only “women in research” demonstrated a significant negative relationship with the gender gap in mathematics, although the magnitude of this relationship is in doubt. Additionally, the gender equality paradox had no empirical support when analyzing mathematics abilities. Girls outperformed boys in diverse socio-cultural environments, such as Finland and Qatar, demonstrating that egalitarian attitudes do not explain gender discrepancies in this dimension ( Stoet and Geary, 2015 ). However, more gender equality had a positive effect on individuals, with both men and women increasing their mathematics scores in this context, without any specific advantages for either group ( Kane and Mertz, 2012 ).
In line with the gender equality paradox, mathematics attitudes and anxiety gender gaps are higher in gender-equal countries ( Else-Quest et al., 2010 ; Stoet et al., 2016 ). Else-Quest et al. (2010) explained this phenomenon by arguing that mathematics anxiety is “a luxury, most often experienced by individuals who are not preoccupied with meeting more basic needs.” However, at the national level, both men and women tend to be less anxious about mathematics in equal societies, even though men benefit more from this lack of anxiety, enhancing gender differences as a consequence ( Stoet et al., 2016 ). Only Goldman and Penner (2016) showed contrary results to that of the above research, arguing that gender differences in mathematics attitudes remain stable, even in gender-equal countries. Recently, Marsh et al. (2021) proposed that the gender equality paradox in these dimensions is “illusory” as it vanishes when accounting for country-level academic achievements and socioeconomic status; however, further studies are needed to support their argument. According to the women’s political representation index, gender-equal nations also have wider self-efficacy and motivation gaps. By contrast, other specific indicators, such as “equality in wages” and “parity in secondary and tertiary education,” predict smaller gaps ( Else-Quest et al., 2010 ; Gevrek et al., 2020 ). Similarly, anxiety differences decline when there is equal political representation between men and women because women gain more than men in politically equal environments ( Else-Quest et al., 2010 ; Gevrek et al., 2020 ).
In conclusion, gender equality is negatively related to gender differences in mathematics attitudes when analyzing composite indices; however, specific indicators are either inversely or directly related. It appears that pursuing equal political representation counteracts the results achieved by parity in wages and education, putting the overall advantage into question. Moreover, although self-efficacy and motivational gender gaps increase as equality is achieved in political representation, parity in tertiary education and wages shows an opposite trend.
Table 2 shows the science gender gap’s mixed results for composite indicators. Analyzing the GGI, Reilly (2012) concluded that the gender gap in science achievement decreases as gender equality increases ( r = 0.29); nevertheless, men are better represented among the top scorers. By contrast, Ireson (2017) failed to replicate any meaningful relationships. However, a recent meta-analysis reported that gender-equal societies are characterized by “a pattern of higher male achievement, while for nations with lower gender equality, we see a pattern of higher female achievement” ( Reilly et al., 2019 ).
Table 2 . Correlations between gender differences in science, reading, and overall scores (men–women) and both composite indices and specific indicators of gender equality.
As reported in Table 2 , also the specific indicators provide mixed results. No connection with the science gender gap is established for the “relative status of women,” whereas “women in research” is linked with increased gender differences ( r = −0.39; Reilly, 2012 ).
These studies were based on inter-group comparisons, which may not have been appropriate for analyzing the relationship in question given the small mean gender gap in science. However, analyzing intra-individual strengths could move the debate forward because these are strongly related to career choices ( Wang and Degol, 2017 ). Studies have shown that men are more likely to have higher abilities in mathematics or science than in reading, generating a “math tilt,” whereas women generate a “verbal tilt,” with differences more visible at the distribution’s right tail ( Wai et al., 2018 ). In other words, although the mean gender variation in science approaches zero, an increasing number of men as compared to women have their top skill in science as opposed to reading, whereas the opposite trend holds true for women (see below). Analyzing 67 nations, Stoet and Geary (2018) pointed out that gender variances in science (and mathematics) intra-individual strength are higher in favor of boys in gender-equal nations. This trend among men could facilitate their preference for scientific careers because they would have the highest likelihood of success and especially so in gender-equal environments ( Dekhtyar et al., 2018 ).
Regarding attitudes, “almost everywhere” girls display a lower science self-concept than boys, even when their academic skills are equal to those of their male peers ( Sikora and Pokropek, 2012 ). Supporting the gender equality paradox, research has noted that gender differences in science self-efficacy, science enjoyment, and interest tend to be larger in gender-equal nations ( Stoet and Geary, 2018 ; Liou et al., 2022 ).
Table 2 shows that studies on reading differences, although few, have substantially converged, demonstrating an increased gender gap in favor of women when there is more equality between genders. Although no correlation is found for the GGI, gender equality results in higher women representation among top-performing students ( Reilly, 2012 ). Notably, the GGI has recently been linked to an increased reading gender gap in advanced societies ( Gevrek et al., 2020 ). Analyzing specific indicators, Reilly (2012) showed that “women in research” directly relates to gender differences in reading achievement, thus predicting progressively higher variations. Gevrek et al. (2020) reached similar conclusions, arguing that the reading gender gap is wider in favor of girls in countries where there is more gender equality in the labor market. Furthermore, studies on intra-individual strengths have also been consistent, showing that girls’ tilt in reading skills is larger than that of boys in gender-equal societies ( Stoet and Geary, 2018 ).
Few studies have focused on gender differences at the aggregate skills level, and those that exist have shown mixed results (see Table 2 ). Similar to the results for mathematics ability, Stoet and Geary (2015) found a significant increase in aggregate skill differences between boys and girls in nations with higher gender equality (GGI), although only in the 2003 PISA assessment. However, excluding either Iceland or Finland from the sample significantly weakened the link, and it disappeared when considering other years ( Stoet and Geary, 2015 ; Ireson, 2017 ). Recently, inspired by research on gender differences in gray and white matter, Stoet and Geary (2020) argued that the basic skills pattern should be considered as a whole to understand the full magnitude of gender variation. Assessing the overall pattern in mathematics, science, and reading performance, it appears that the gap is greater than previously measured, corresponding to a large statistical difference, and it widens in more gender-equal environments.
Some researchers have proposed that egalitarian values, have a “more pervasive influence” and might offer a better understanding of the topic ( Eriksson et al., 2020 ). An examination of these values suggests that “one standard deviation higher in gender equal values is on average 5.2 points more beneficial for boys” ( Eriksson et al., 2020 ). This observation holds true for the GGI.
Contrary to theories predicting that gender equality is linked with smaller gender differences, “male/female enrollment in tertiary education” is inversely related to gender differences in overall achievement in countries with gender-neutral enrollment rates that also have more men among the top performers ( r = 0.19; Bergold et al., 2017 ). Conversely, “women’s labor market participation,” “women’s share of research positions,” and “the ratio of women to men with at least a secondary education” have medium-size negative correlations (from r = 0.33–0.42), which may account for 28.7% of the gender variation ( Bergold et al., 2017 ).
In sum, few studies have examined the link between gender equality and gender differences in science, reading, and overall scores, making it difficult to draw any firm conclusions. The findings for science and overall scores are contradictory, while for reading, there is substantial agreement about there being a gender equality paradox favoring women. Furthermore, due to their interrelatedness, a communal pattern between these skills emerges when examining intra-individual strengths. This pattern is characterized by increasingly wider science/mathematics and reading tilts for boys and girls, respectively. The tilt for girls shows that when girls have a science or mathematics score similar to boys, they tend to have better grades in reading, a trend that is especially observed in gender-equal nations ( Stoet and Geary, 2018 ). However, scholars have only recently begun to consider intra-individual strengths, which represent a great opportunity for future studies on gender segregation.
5.1. the big five and the hexaco model.
Evidence supporting a paradox emerged as early as 2001 when Costa et al. (2001) concluded that men’s and women’s personalities differ more in gender-equal countries. Schmitt et al. (2008) replicated these findings across 55 nations, again suggesting a positive correlation between gender differences and gender equality. More recently, larger gender differences in agreeableness favoring women have been found in gender-equal nations (see Table 3 ), mainly because of lower agreeableness in men in these nations with gender being the strongest predictor of individual levels ( Lippa, 2010b ). Conversely, the gender gap in neuroticism (women > men) has not been found to be affected by gender equality, even though the UN’s gender development and empowerment index predicts a decrease in negative emotions in both men and women ( Lippa, 2010b ).
Table 3 . Correlations between gender differences in personality (men–women) and composite indices of gender equality.
While these findings are illuminating, looking only at single dimensions may lead to counterintuitive results because personality is multifaceted ( Vianello et al., 2013 ). Although the average gender gap for a given personality trait is small, the overall variance is conventionally regarded as large, implying a significant difference between men and women ( Del Giudice, 2009 ). Based on the latter premise, Mac Giolla and Kajonius (2019) noted a strong relationship between gender personality differences and gender equality, with overall differences being broader in “gender-friendly” countries ( r = 0.69). Other studies have supported these results, observing the same widening pattern ( Kaiser, 2019 ). Similarly, the emotionality gap from the HEXACO model displays a direct relationship with the GGI ( r = 0.56), with women having an increasingly higher level than men in more gender-equal countries. However, honesty–humility fails to display any association with gender equality ( Lee and Ashton, 2020 ).
Further evidence for a gender equality paradox in personality emerges from the study by Falk and Hermle (2018) that, building upon the above personality models, related gender differences in economic preferences – positive reciprocity, patience, altruism, trust, risk-taking (higher in women), and negative reciprocity (higher in men) – to gender equality measures. They concluded that the differences are characterized by sharp increases in more gender-equal countries ( r = 0.67).
Basic human values (see Table 3 ) of power, achievement and stimulation are generally considered more important for men, whereas benevolence and universalism are valued among women. Past research has found that these gender differences are broader when men and women are treated equally, even though both genders regard masculine values to be less significant ( Schwartz and Rubel-Lifschitz, 2009 ). More recently, Fors Connolly et al. (2020) extended the research on human values by adding a temporal dimension. Their analysis replicated the results cross-nationally, although temporal examination displayed a convergence between men and women in benevolence (over time, Cohen’s d −15%), with universalism and stimulation gaps remaining constant ( Fors Connolly et al., 2020 ). However, as the authors noted, this convergence resulted from factors not linked to gender equality, indicating that the correlation might be spurious and caused by confounding factors related to both gender equality and personality. This additional finding suggests that gender equality could not cause gender differences in values and that the gender equality paradox needs further exploration.
For vocational interests, few studies have examined how gender differences change with gender equality. Using the Brinkman Model of Interests, one study found that ‘gender differences in musical and persuasive interests decreased in countries with high gender egalitarianism; nevertheless, clerical and scientific interests were higher when gender egalitarianism was high’ ( Ott-Holland et al., 2013 ). However, most differences did not show any variance. More recently Tao et al. (2022) offered a more comprehensive overview highlighting that across all dimensions of vocational interest analyzed, increased gender equality was associated with wider gender differences. As Table 3 shows, gender personality differences generally increase in gender-equal countries. This finding is consistent across models and it appears to be valid also for dimensions not analyzed in this review (see Discussion for a more in-depth analysis).
The systematic narrative literature review investigated recent studies on gender differences in basic skills and personality to determine whether cross-national relationships can be found with gender equality. The goal was to assess whether theories predicting that gender equality is linked with smaller gender differences have empirical support or whether a gender equality paradox has emerged in recent years. The general trend considers gender equality as either being connected to an increase in gender variations or having no relation with them, with a gender equality paradox occurring for gender gaps in some cognitive domains (attitudes toward mathematics, mathematics self-efficacy, mathematics anxiety, and reading) and personality.
Based on the foregoing literature review, it can be seen that research supporting reduced gender differences in more gender-equal countries is scarce and inconsistent. A negative correlation is generally detected when analyzing gender differences in mathematics skills utilizing PISA data, although the correlation is influenced by either the year considered in the study or the sample country (see below). Moreover, “women in research” is the only specific indicator consistently negatively linked to the mathematics gender gap, albeit with disagreement about the strength of the association. Lastly, no connection between gender differences in mathematics and gender equality indicators is found when analyzing the TIMMS assessment. However, many studies have focused solely on mean differences in mathematics abilities, which are small or non-existent. Only Bergold et al. (2017) and Hyde and Mertz (2009) assessed the right tail of the distribution, where gender differences are more pronounced. This lack of studies on top performers highlights a gap in the research that needs to be filled. Also important is analyzing intra-individual strengths when studying the mathematics gender gap, as Stoet and Geary (2018) have emphasized.
Research supporting a positive link between gender variances and gender equality measures appears to be more robust and consistent. The literature on mathematics attitudes and anxiety shows that composite indicators predict a widening gender gap as equality between men and women advances. In addition, scholars agree that gender equality is connected with a larger advantage for women in reading and evidence further shows that gender personality differences are larger in more gender-equal nations. Men and women are less alike, especially in personality traits and basic human values, in countries that have invested the most in gender equality. Further support for a gender equality paradox in personality also emerges when examining other personality domains not included in this review. For example, wider gender gaps in self-esteem and narcissism (higher in men) exist in more gender-equal nations where women have more reproductive control, more executive positions, and their education is either similar to or higher than that of men ( Bleidorn et al., 2016 ; Jonason et al., 2020 ).
Specific indicators are either directly or inversely related to the mathematics gender gap, raising doubt about them being related to a general advantage ( Table 4 ). In addition, findings on science and overall scores are uncertain, even though both science anxiety and science intra-individual strengths follow a trend opposite to that anticipated by theories predicting a link between gender equality and smaller gender differences. Interestingly, other skills, such as episodic memory and visuospatial ability, show the same widening tendency, strengthening the case for a possible paradox in this area ( Lippa et al., 2010 ; Asperholm et al., 2019 ).
Table 4 . Summary of the papers included in the review.
Understanding the possible reasons for the increase in gender differences in countries that promote gender equality is important and relevant since these countries may be leading men and women toward gendered trajectories, a path that is already observable in higher education. Charles and Bradley (2009) noted that the most advanced societies demonstrate more pronounced gender segregation in education. Stoet and Geary (2018) also observed that more gender-equal nations (measured by the GGI) have the widest gender gap among STEM graduates. Supporting these results, research has shown that gender differences using “interest in math careers” as a predictor of future major subjects are greater in countries with higher gender equality, with both men and women being, on average, less interested in mathematics than those in other countries ( Goldman and Penner, 2016 ; Charles, 2017 ; Breda et al., 2020 ). The same pattern is observed in the job market, where horizontal segregation is more pronounced in more gender-equal environments ( Blackburn and Jarman, 2006 ; Wong and Charles, 2020 ). Several investigations have documented this phenomenon and concluded that “Scandinavian countries are notable for their exceptionally high degrees of segregation” despite their advancement in gender equality ( Jarman et al., 2012 ). However, more recent findings have also detected desegregation patterns in more gender-equal nations ( Hustad et al., 2020 ).
The question of why gender differences are sometimes higher in more gender-equal countries remains. Some have proposed that the paradox in mathematics anxiety and attitudes might originate from the better economic conditions needed for these emotions to emerge. In countries where women are highly oppressed, these are more concerned about meeting more basic needs. Conversely, where economic, political, and educational circumstances are more favorable for women, anxiety toward mathematics activities is more likely to emerge ( Else-Quest et al., 2010 ). However, at the national level, both men and women are less anxious about mathematics in developed, gender-equal countries, indicating that alternative explanations are needed ( Stoet et al., 2016 ). In fact, others have suggested that, in gender-equal nations, men and women set aside financial drives and follow more intrinsic career interests because of easier access to economic resources. Hence, women are less exposed than men to STEM activities, “giving them less opportunity to reduce their negative feelings about mathematics” ( Stoet et al., 2016 ).
With respect to reading abilities, the paradox might result from the interaction of two factors: the interrelation between basic skills and Western societies’ strong efforts to equalize boys’ and girls’ mathematics performance that has instead, paradoxically, increased reading skills in girls. Notably, where mathematics gender differences are reduced, the reduction is mainly due to an improvement in women’s reading ( Guiso et al., 2008 ). It follows that countries with smaller mathematics gender differences have the largest reading gaps ( Stoet and Geary, 2013 ). As mathematics is promoted in girls, their reading skills appear to benefit. However, because boys’ disadvantage in reading is, on average, less of a concern among policymakers, gender variations in this dimension have widened.
Some researchers have explained the gender equality paradox in personality by arguing that only differences in self-reported domains are increased ( Eagly and Wood, 2012 ). Here, the reference-group effect ( Heine et al., 2002 ) might conceal variances in less gender-equal countries, where men and women compare themselves with others of their own gender ( Guimond et al., 2007 ). If this explanation holds true, the gap in gender-equal nations would be a better estimate of personality differences between the genders because in these nations both women and men have a more accurate comparative term that includes the whole population rather than just a subset ( Schmitt et al., 2017 ).
Another explanation may be that personality is strongly culturally influenced. According to this view, individualism and self-expressive values act in tandem with gender stereotypes, promoting gender variance as individuals act out their “gendered self” ( Charles and Bradley, 2009 ; Breda et al., 2020 ). This explanation of the gender equality paradox corresponds to the findings in gender-equal nations that cultural mechanisms are at play accommodating women-typical roles, such as job flexibility and high parental care—roles that encourage women to embark on gendered paths and experience more communal traits ( Levanon and Grusky, 2018 ). Thus, it should not be surprising that, in gender-equal countries, men and women appear to differ more than in non-gender-equal countries and that this difference is expanding as women-typical roles are becoming more prevalent. Rather than expressing intrinsic gender differences, in these nations, there is a reinforcement of gender essentialist beliefs, which constitute an artifact of social expectations about how men and women should comply with gender stereotypes ( England, 2010 ).
While this argument is somewhat persuasive, research aiming at linking gender stereotypes with gender equality suffers from several theoretical and methodological limitations. Often scholars apply broad assumptions and rely on a limited, as well as unreliable, set of items to capture latent dimensions of implicit stereotypes hidden in survey data. For instance, in their recent article Napp and Breda (2022) used solely one item to grasp an alleged stereotype that girls lack talent by arguing that systematic gender difference in answering the question would highlight “the magnitude of the (internalized) stereotype associating talent with boys rather than girls.” In addition, several studies have argued that stereotypes about group features, when measured reliably, appear to be accurate ( Jussim et al., 2015 ; Moè et al., 2021 ). Löckenhoff et al. (2014) observed that perceived gender differences in personality substantially match those found in self- and observer-rated personality tests. The authors concluded that gender stereotypes constitute “valid social judgments about the size and direction of sex differences” that are more relevant than socialization processes and ascribed cultural gender roles ( Löckenhoff et al., 2014 ). This is not to say that culture plays no role in the emergence of gender differences, but that the social mechanisms amplifying gender variances—mechanisms that social-role theorists have identified—also capture intrinsic gender differences.
Evolutionary theorists propose a different explanation for the gender equality paradox. As they argue, some gender variations are sensitive to context-related fluctuations, demonstrating a gene–environment interplay. In societies in which conditions are favorable, gender-specific genes flourish due to a lower prevalence of diseases, lower ecological stressors, and lower starvation rates. Per this view, wider gender gaps in gender-equal nations most likely “reflect a more general biological trend toward greater dimorphism in resource-rich environments” ( Schmitt et al., 2008 ). If this explanation holds true, then heritability estimates will be higher in developed societies than in less-advanced cultures. Some evidence in this direction has recently emerged ( Selita and Kovas, 2019 ); however, the “WEIRD” gene problem—that nearly all twin studies have been conducted among Western, educated, industrialized, rich, and democratic societies—represents an obstacle for generalizing results and making inferences about cross-cultural heritability differences ( Henrich et al., 2010 ).
The present review proposes that the evolutionary explanation for the gender equality paradox might be more complex than it appears due to the presence of socio-cultural elements in the evolutionary process. As previously noted, genetic effects depend on the environmental conditions (diseases and ecological stress) under which they occur, yet the environment is embedded into society. Thus, the gene–environment interplay is enclosed within a cultural context with specific social norms and, by itself, cannot encompass all involved elements ( Figure 2 ). Stated otherwise, the gene–environment interplay is a function of culture ( Uchiyama et al., 2022 ). Therefore, gender-specific genes can be expected to be emphasized in societies embracing cultural values that would favor the expression of these genes. Consider, for example, individualism and self-expression. It is unsurprising that these values are related to the gender equality paradox, as Charles and Bradley (2009) have highlighted. In resource-rich environments that also value individualism and self-expression, intrinsic gender differences are more likely to emerge. This thesis points toward interpretation of Kaiser (2019) , which states that both cultural individualism and pathogen levels confound the gender equality paradox in personality (see below). Also, Murphy et al. (2021) reached similar conclusions. A coherent, yet opposite, prediction might see gender differences as remaining stable or even decreasing in those resource-rich environments that culturally constrain self-expression. Accordingly, favorable cultural values would trump social mechanisms that amplify gender-based genes to emerge via a feedback-loop effect or “reciprocal causation” ( Dickens and Flynn, 2001 ) according to which social structures adjust to distinct gender traits and vice versa, thus increasing gender differences.
Figure 2 . Socio-cultural evolutionary explanation of the gender equality paradox. The gears show the interrelations between gender-specific genes, social structures, and environmental components mediated by cultural values.
While searching and analyzing the literature, this review also highlighted some challenges that researchers might face when conducting cross-national studies relating gender differences to gender equality measures. For mathematics ability, results could depend on outlier countries such as Scandinavian and gender-segregated, Muslim countries. In addition, the restricted country samples in international student assessments might be problematic. Despite the strong effort of PISA and TIMMS to be more inclusive, wealthy countries have traditionally been overrepresented, although the latest rounds have had very high coverage, including over 75 participating nations worldwide. Nevertheless, researchers, when assessing gender differences in mathematics abilities, should pay close attention to the countries included in their study because either the inclusion of outliers or a lack of heterogeneity might lead to biased estimations.
Another possible source of bias in research linking gender differences to gender equality on a cultural level is participant sample sizes, with some nations being overrepresented in comparison to others. How countries are clustered may also be problematic since countries are not independent data points and, “as such, they are like members of the same family or pupils of the same classroom” ( Kuppens and Pollet, 2015 ). Therefore, appropriate statistical methods, multilevel modeling, for example, should be utilized to account for both unbalanced sample sizes and data structure.
Correlations between mathematics gender differences and gender equality might originate from a lack of country-level effects in the models. Anghel et al. (2019) argued that when time-invariant country unobserved heterogeneity is controlled for, no association between the two variables is found. Moreover, the link between gender equality and the gender gap in mathematics attitudes might be confounded by country-level academic achievements and socioeconomic status ( Marsh et al., 2021 ).
Further, the gender equality paradox could be due to measurement error. Given that many international assessments and personality models have been developed in WEIRD countries, it is plausible that measurement error could be higher in non-WEIRD nations generating an illusory gender equality paradox. However, international assessments have been constructed to prevent such bias. For instance, PISA computes each student’s score based on a set of 5/10 plausible values designed to prevent measurement error and simplify secondary data analysis ( Marsh et al., 2021 ). Also, the gender equality paradox in personality appears to hold even after correcting for measurement error ( Kaiser, 2019 ; Fors Connolly et al., 2020 ; Tao et al., 2022 ). Nevertheless, when analyzing the link between gender differences in personality and gender equality, statistical procedures that control for measurement error should be applied (see for example Schmidt and Hunter, 2015 ).
Fors Connolly et al. (2020) highlighted the need for more temporal analyses of personality because an observed cross-national pattern may result from “a spurious relationship between gender equality and differences in personality” due to different country-level elements. Kaiser (2019) identified these elements as cultural individualism, food consumption, and historical pathogen prevalence levels. Other research has also agreed that cultural individualism could be a possible confounding factor as gender differences in personality are more pronounced in nations that highly regard individual self-expression ( Costa et al., 2001 ; Schmitt et al., 2008 ; Tao et al., 2022 ).
Some scholars have called attention to the misuse of composite indicators of gender equality, raising several concerns thereof and arguing that they might not be suitable for empirical research ( Else-Quest et al., 2010 ; Hyde, 2012 ). One concern is that these indicators, which encompass various domains from politics to economics, do not measure opportunities ( Richardson et al., 2020 ). Another concern is that they are not interchangeable since they are differentially constructed. Thus, comparisons between research relying on different measures of gender equality might not be suitable. Some of the disparate findings concerning math ability might be driven by computational differences in the indices included in the analysis. Nevertheless, the gender equality composite indicators most commonly utilized (GGI, GEI, and GEM) show very high correlation coefficients ( r ≥ 0.84), while other indicators substantially relate to one another, suggesting that, although some differences occur, these indices are similar in their ability to capture the general dimension of gender equality ( Else-Quest et al., 2010 ; van Staveren, 2013 ; Stoet and Geary, 2015 ). Lastly, composite indicators may present a biased view of society due to the way gender equality is understood in the models. Often, disadvantages pertaining mostly to men are not taken into account when computing the indicators ( Benatar, 2012 ). As an example of this bias, the GGI from the World Economic Forum assumes perfect gender equality in areas where women have an advantage over men. Specifically, values higher than 1, which would assume a men’s disadvantage, in each sub-index are capped. Thus, a more simplified approach to measuring national gender inequality is preferred ( Stoet and Geary, 2019 ).
In addition, methodological issues also arise when using these indices. Some scholars have pointed out that correlations between gender gaps and the indices of gender equality could be driven by the strong economic component in these indices ( Fors Connolly et al., 2020 ). Therefore, it is important to control for appropriate economic indicators, such as GDP per capita and the Human Development Index, when linking gender differences with gender equality ( Kuppens and Pollet, 2015 ). Another difficulty may arise when contrasting results between composite indices and specific indicators occur. For mathematics attitudes, for instance, although composite indices suggest a gender equality paradox, specific indicators are either positively or negatively related to the gender gap. This may suggest that composite indices either capture an overall influence of gender equality or are unsuitable for evaluating gender differences. However, evaluation may lie outside the scope of models using these indices. Research linking gender differences with gender equality indicators has not tried to explain the paradox emerging from the analysis on the basis of gender equality per se ; instead, it has just highlighted a paradoxical pattern that would otherwise have remained concealed. Since no theory has been put forward that fully unravels the paradox, further studies are needed.
Theories considered in this review that predict that gender equality is linked with smaller gender differences do not offer a valid explanation of gender differences in basic skills and personality. In addition, for some dimensions, the gender equality paradox raises further questions about how gender variation emerges, which calls for a new approach. Based on these premises, this review explored both social-role and evolutionary hypotheses and suggested new insights that combine these views, while also highlighting explanatory variables that might cause bias in the results. Thus, specific research that more closely examines the explanations proposed is needed, especially studies with an interdisciplinary focus. Notably, Fors Connolly et al. (2020) highlighted the importance of cross-temporal analyses of the gender equality paradox because these may reveal a different path. Since country comparisons may be insufficient for fully grasping the evolution of the paradox, future research should include a thorough cross-temporal examination for a more comprehensive understanding.
Lastly, the gender equality paradox is an emerging phenomenon that has gained substantial scientific support across subjects ( Falk and Hermle, 2018 ; Campbell et al., 2021 ; Block et al., 2022 ; Vishkin, 2022 ). It requires attention from both the scientific community and the public because attempting to close gender gaps following traditional social-role theories and applying conventional methods, might end up exacerbating gender variations. In addition, the general pattern of increased gender differences in more gender-equal countries might inform that achieving equal opportunities does not go hand in hand with a reduction of gender gaps. Thus, policymakers should consider this trend when justifying interventions attempting to achieve equality of outcome between men and women.
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.
The author confirms being the sole contributor of this work and has approved it for publication.
The work was supported by the Finnish National Board for Education through a working grant.
The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Anghel, B., Rodríguez-Planas, N., and Sanz-de-Galdeano, A. (2019). Culture, Gender, and Math: A Revisitation.
Google Scholar
Archer, J. (2019). The reality and evolutionary significance of human psychological sex differences. Biol. Rev. 94, 1381–1415. doi: 10.1111/brv.12507
PubMed Abstract | CrossRef Full Text | Google Scholar
Ashcraft, M. H., and Moore, A. M. (2009). Mathematics anxiety and the affective drop in performance. J. Psychoeduc. Assess. 27, 197–205. doi: 10.1177/0734282908330580
CrossRef Full Text | Google Scholar
Asperholm, M., Nagar, S., Dekhtyar, S., and Herlitz, A. (2019). The magnitude of sex differences in verbal episodic memory increases with social progress: data from 54 countries across 40 years. PLoS One 14:e0214945. doi: 10.1371/journal.pone.0214945
Baker, D. P., and Jones, D. P. (1993). Creating gender equality: cross-national gender stratification and mathematical performance. Sociol. Educ. 66, 91–103. doi: 10.2307/2112795
Bandura, A., and Walters, R. H. (1977). Social Learning Theory. Englewood Cliffs: Prentice Hall.
Barone, C., and Assirelli, G. (2020). Gender segregation in higher education: an empirical test of seven explanations. High. Educ. 79, 55–78. doi: 10.1007/s10734-019-00396-2
Beckwith, K. (2005). A common language of gender? Pol Gender 1, 128–137. doi: 10.1017/S1743923X05211017
Benatar, D. (2012). The Second Sexism: Discrimination Against Men and Boys. Oxford: John Wiley & Sons.
Bergold, S., Wendt, H., Kasper, D., and Steinmayr, R. (2017). Academic competencies: their interrelatedness and gender differences at their high end. J. Educ. Psychol. 109, 439–449. doi: 10.1037/edu0000140
Blackburn, R. M., and Jarman, J. (2006). Gendered occupations: exploring the relationship between gender segregation and inequality. Int. Sociol. 21, 289–315. doi: 10.1177/0268580906061380
Bleidorn, W., Arslan, R. C., Denissen, J. J., Rentfrow, P. J., Gebauer, J. E., Potter, J., et al. (2016). Age and gender differences in self-esteem—a cross-cultural window. J. Pers. Soc. Psychol. 111, 396–410. doi: 10.1037/pspp0000078
Block, K., Olsson, M. I. T., Schmader, T., Van Laar, C., Martiny, S. E., Schuster, C., et al. (2022). The gender gap in the care economy is larger in highly developed countries: socio-cultural explanations for paradoxical findings. PsyArXiv . Preprint. doi: 10.31234/osf.io/k6g5d.
Bos, W., Wendt, H., Ünlü, A., Valtin, R., Euen, B., Kasper, D., et al. (2012). “Leistungsprofile von Viertklässlerinnen und Viertklässlern in Deutschland [Proficiency profiles of fourth grade students in Germany]” in IGLU 2011. Lesekompetenzen von Grundschulkindern in Deutschland im internationalen Vergleich [IGLU 2011. International Comparison of Reading Competencies of Primary School Children in Germany]. eds. W. Bos, I. Tarelli, A. Bremerich-Vos and K. Schwippert (Münster: Waxmann), 227–259.
Breda, T., Jouini, E., Napp, C., and Thebault, G. (2020). Gender stereotypes can explain the gender-equality paradox. Proc. Natl. Acad. Sci. U. S. A. 117, 31063–31069. doi: 10.1073/pnas.2008704117
Bussey, K., and Bandura, A. (1999). Social cognitive theory of gender development and differentiation. Psychol. Rev. 106, 676–713. doi: 10.1037/0033-295X.106.4.676
Campbell, O. L. K., Bann, D., and Patalay, P. (2021). The gender gap in adolescent mental health: a cross-national investigation of 566, 829 adolescents across 73 countries. SSM Popul. Health 13:100742. doi: 10.1016/j.ssmph.2021.100742
Cech, E. A. (2013). The self-expressive edge of occupational sex segregation. Am. J. Sociol. 119, 747–789. doi: 10.1086/673969
Charles, M. (2017). Venus, mars, and math: gender, societal affluence, and eighth graders’ aspirations for STEM. Socius 3:237802311769717. doi: 10.1177/2378023117697179
Charles, M., and Bradley, K. (2009). Indulging our gendered selves? Sex segregation by field of study in 44 countries. Am. J. Sociol. 114, 924–976. doi: 10.1086/595942
Charles, M., and Grusky, D. B. (2018). “Egalitarianism and gender inequality” in The Inequality Reader: Contemporary and Foundational Readings in Race, Class, and Gender (New York: Routledge), 389–404.
Charles, M., Harr, B., Cech, E., and Hendley, A. (2014). Who likes math where? Gender differences in eighth-graders’ attitudes around the world. Int. Stud. Sociol. Educ. 24, 85–112. doi: 10.1080/09620214.2014.895140
Cortes, P., and Pan, J. (2018). “Occupation and gender” in The Oxford Handbook of Women and the Economy . eds. S. L. Averett, L. M. Argys and S. D. Hoffman. (Oxford Oxford: University Press), 425–452.
Costa, P. T., Terracciano, A., and McCrae, R. R. (2001). Gender differences in personality traits across cultures: robust and surprising findings. J. Pers. Soc. Psychol. 81, 322–331. doi: 10.1037/0022-3514.81.2.322
Dekhtyar, S., Weber, D., Helgertz, J., and Herlitz, A. (2018). Sex differences in academic strengths contribute to gender segregation in education and occupation: a longitudinal examination of 167,776 individuals. Intelligence 67, 84–92. doi: 10.1016/j.intell.2017.11.007
Del Giudice, M. (2009). On the real magnitude of psychological sex differences. Evol. Psychol. 7:147470490900700. doi: 10.1177/147470490900700209
Devine, A., Fawcett, K., Szűcs, D., and Dowker, A. (2012). Gender differences in mathematics anxiety and the relation to mathematics performance while controlling for test anxiety. Behav. Brain Funct. 8:33. doi: 10.1186/1744-9081-8-33
Dickens, W. T., and Flynn, J. R. (2001). Heritability estimates versus large environmental effects: the IQ paradox resolved. Psychol. Rev. 108, 346–369. doi: 10.1037/0033-295X.108.2.346
Diekman, A. B., and Schneider, M. C. (2010). A social role theory perspective on gender gaps in political attitudes. Psychol. Women Q. 34, 486–497. doi: 10.1111/j.1471-6402.2010.01598.x
Dijkers, M. P. J. M. (2009). The value of “traditional” reviews in the era of systematic reviewing. Am. J. Phys. Med. Rehabil. 88, 423–430. doi: 10.1097/PHM.0b013e31819c59c6
Eagly, A. H., and Mitchell, A. A. (2004). Social role theory of sex differences and similarities: implications for the sociopolitical attitudes of women and men.
Eagly, A. H., and Wood, W. (1999). The origins of sex differences in human behavior: evolved dispositions versus social roles. Am. Psychol. 54, 408–423. doi: 10.1037/0003-066X.54.6.408
Eagly, A. H., and Wood, W. (2012). “Social role theory” in Handbook of Theories of Social Psychology (London: SAGE Publications Ltd), 458–476.
Eagly, A. H., Wood, W., and Johannesen-Schmidt, M. C. (2004). “Social role theory of sex differences and similarities: implications for the partner preferences of women and men” in The Psychology of Gender . 2nd ed (New York, NY: The Guilford Press), 269–295.
Else-Quest, N. M., Hyde, J. S., and Linn, M. C. (2010). Cross-national patterns of gender differences in mathematics: a meta-analysis. Psychol. Bull. 136, 103–127. doi: 10.1037/a0018053
England, P. (2010). The gender revolution: uneven and stalled. Gend. Soc. 24, 149–166. doi: 10.1177/0891243210361475
Eriksson, K., Björnstjerna, M., and Vartanova, I. (2020). The relation between gender egalitarian values and gender differences in academic achievement. Front. Psychol. 11:236. doi: 10.3389/fpsyg.2020.00236
Falk, A., and Hermle, J. (2018). Relationship of gender differences in preferences to economic development and gender equality. Science 362:eaas9899. doi: 10.1126/science.aas9899
Fors Connolly, F., Goossen, M., and Hjerm, M. (2020). Does gender equality cause gender differences in values? Reassessing the gender-equality-personality paradox. Sex Roles 83, 101–113. doi: 10.1007/s11199-019-01097-x
Freiberg, T. (2019). Effects of care leave and family social policy: spotlight on the United States. Am. J. Econ. Sociol. 78, 1009–1037. doi: 10.1111/ajes.12293
Friedman, R. C., and Downey, J. I. (2002). Sexual orientation and psychoanalysis (pp. 225–263).
Frome, P. M., and Eccles, J. S. (1998). Parents’ influence on children’s achievement-related perceptions. J. Pers. Soc. Psychol. 74, 435–452. doi: 10.1037/0022-3514.74.2.435
Fryer, R. G., and Levitt, S. D. (2010). An empirical analysis of the gender gap in mathematics. Am. Econ. J. Appl. Econ. 2, 210–240. doi: 10.1257/app.2.2.210
Ganley, C. M., and Vasilyeva, M. (2011). Sex differences in the relation between math performance, spatial skills, and attitudes. J. Appl. Dev. Psychol. 32, 235–242. doi: 10.1016/j.appdev.2011.04.001
Geary, D. C. (2010). Male, Female: The Evolution of Human Sex Differences . 2nd Edn. Washington: American Psychological Association.
Geary, D. C., Hoard, M. K., Nugent, L., Chu, F., Scofield, J. E., and Ferguson Hibbard, D. (2019). Sex differences in mathematics anxiety and attitudes: concurrent and longitudinal relations to mathematical competence. J. Educ. Psychol. 111, 1447–1461. doi: 10.1037/edu0000355
Gevrek, Z. E., Gevrek, D., and Neumeier, C. (2020). Explaining the gender gaps in mathematics achievement and attitudes: the role of societal gender equality. Econ. Educ. Rev. 76:101978. doi: 10.1016/j.econedurev.2020.101978
Gevrek, Z. E., Neumeier, C., and Gevrek, D. (2018). Explaining the gender test score gap in mathematics: the role of gender inequality. Available at SSRN 3111114.
Ghasemi, E., Burley, H., and Safadel, P. (2019). Gender differences in general achievement in mathematics: an international study. New Waves Educ. Res. Dev. J. 22, 27–54.
Goldman, A. D., and Penner, A. M. (2016). Exploring international gender differences in mathematics self-concept. Int. J. Adolesc. Youth 21, 403–418. doi: 10.1080/02673843.2013.847850
Goodwin, R. D., and Gotlib, I. H. (2004). Gender differences in depression: the role of personality factors. Psychiatry Res. 126, 135–142. doi: 10.1016/j.psychres.2003.12.024
Graham, S. (1995). Narrative versus meta-analytic reviews of race differences in motivation: a comment on Cooper and Dorr. Rev. Educ. Res. 65, 509–514. doi: 10.3102/00346543065004509
Guimond, S., Branscombe, N. R., Brunot, S., Buunk, A. P., Chatard, A., Désert, M., et al. (2007). Culture, gender, and the self: variations and impact of social comparison processes. J. Pers. Soc. Psychol. 92, 1118–1134. doi: 10.1037/0022-3514.92.6.1118
Guiso, L., Monte, F., Sapienza, P., and Zingales, L. (2008). Culture, gender, and math. Science 320, 1164–1165. doi: 10.1126/science.1154094
Gunderson, E. A., Ramirez, G., Levine, S. C., and Beilock, S. L. (2012). The role of parents and teachers in the development of gender-related math attitudes. Sex Roles 66, 153–166. doi: 10.1007/s11199-011-9996-2
Halpern, D. F. (2000). Sex Differences in Cognitive Abilities . New York: Psychology Press.
Halpern, D. F., Benbow, C. P., Geary, D. C., Gur, R. C., Hyde, J. S., and Gernsbacher, M. A. (2007). The science of sex differences in science and mathematics. Psychol. Sci. Public Interest 8, 1–51. doi: 10.1111/j.1529-1006.2007.00032.x
Heine, S. J., Lehman, D. R., Peng, K., and Greenholtz, J. (2002). What’s wrong with cross-cultural comparisons of subjective Likert scales?: the reference-group effect. J. Pers. Soc. Psychol. 82, 903–918. doi: 10.1037/0022-3514.82.6.903
Henrich, J., Heine, S. J., and Norenzayan, A. (2010). The weirdest people in the world? Behav. Brain Sci. 33, 61–83. doi: 10.1017/S0140525X0999152X
Hirnstein, M., Stuebs, J., Moè, A., and Hausmann, M. (2022). Sex/gender differences in verbal fluency and verbal-episodic memory: a meta-analysis. Perspect. Psychol. Sci. 18, 67–90.
Holland, J. L. (1997). Making Vocational Choices: A Theory of Vocational Personalities and Work Environments. Odessa: Psychological Assessment Resources.
Hustad, I. B., Bandholtz, J., Herlitz, A., and Dekhtyar, S. (2020). Occupational attributes and occupational gender segregation in Sweden: does it change over time? Front. Psychol. 11:554. doi: 10.3389/fpsyg.2020.00554
Hyde, J. S. (2005). The gender similarities hypothesis. Am. Psychol. 60, 581–592. doi: 10.1037/0003-066X.60.6.581
Hyde, J. S. (2012). Nation-level indicators of gender equity in psychological research: theoretical and methodological issues. Psychol. Women Q. 36, 145–148. doi: 10.1177/0361684312441448
Hyde, J. S., and Mertz, J. E. (2009). Gender, culture, and mathematics performance. Proc. Natl. Acad. Sci. U. S. A. 106, 8801–8807. doi: 10.1073/pnas.0901265106
Ireson, G. (2017). Gender achievement and social, political and economic equality: a European perspective. Educ. Stud. 43, 40–50. doi: 10.1080/03055698.2016.1237868
Jarman, J., Blackburn, R. M., and Racko, G. (2012). The dimensions of occupational gender segregation in industrial countries. Sociology 46, 1003–1019. doi: 10.1177/0038038511435063
Jonason, P. K., Żemojtel-Piotrowska, M., Piotrowski, J., Sedikides, C., Campbell, W. K., Gebauer, J. E., et al. (2020). Country-level correlates of the dark triad traits in 49 countries. J. Pers. 88, 1252–1267. doi: 10.1111/jopy.12569
Jussim, L., Crawford, J. T., and Rubinstein, R. S. (2015). Stereotype (in) accuracy in perceptions of groups and individuals. Curr. Dir. Psychol. Sci. 24, 490–497. doi: 10.1177/0963721415605257
Kaiser, T. (2019). Nature and evoked culture: sex differences in personality are uniquely correlated with ecological stress. Personal. Individ. Differ. 148, 67–72. doi: 10.1016/j.paid.2019.05.011
Kane, J. M., and Mertz, J. E. (2012). Debunking myths about gender and mathematics performance. N. Am. Math. Soc. 59:10. doi: 10.1090/noti790
Kuhn, A., and Wolter, S. C. (2022). Things versus people: gender differences in vocational interests and in occupational preferences. J. Econ. Behav. Organ. 203, 210–234. doi: 10.1016/j.jebo.2022.09.003
Kuppens, T., and Pollet, T. V. (2015). Gender equality probably does not affect performance at the Olympic games: a comment on Berdahl, Uhlmann, and Bai (2015). J. Exp. Soc. Psychol. 61, 144–147. doi: 10.1016/j.jesp.2015.06.002
Lee, K., and Ashton, M. C. (2004). Psychometric properties of the HEXACO personality inventory. Multivar. Behav. Res. 39, 329–358. doi: 10.1207/s15327906mbr3902_8
Lee, K., and Ashton, M. C. (2018). Psychometric properties of the HEXACO-100. Assessment 25, 543–556. doi: 10.1177/1073191116659134
Lee, K., and Ashton, M. C. (2020). Sex differences in HEXACO personality characteristics across countries and ethnicities. J. Pers. 88, 1075–1090. doi: 10.1111/jopy.12551
Levanon, A., and Grusky, D. B. (2018). “Why is there still so much gender segregation?” in Inequality in the 21st Century (New York: Routledge), 371–379.
Lindberg, S. M., Hyde, J. S., Petersen, J. L., and Linn, M. C. (2010). New trends in gender and mathematics performance: a meta-analysis. Psychol. Bull. 136, 1123–1135. doi: 10.1037/a0021276
Liou, P.-Y., Lin, Y.-M., Huang, S.-C., and Chen, S. (2022). Gender differences in science motivational beliefs and their relations with achievement over grades 4 and 8: a multinational perspective. Int. J. Sci. Math Educ. 21, 233–249. doi: 10.1007/s10763-021-10243-5
Lippa, R. A. (2010a). Gender differences in personality and interests: when, where, and why? Soc. Personal. Psychol. Compass 4, 1098–1110. doi: 10.1111/j.1751-9004.2010.00320.x
Lippa, R. A. (2010b). Sex differences in personality traits and gender-related occupational preferences across 53 nations: testing evolutionary and social-environmental theories. Arch. Sex. Behav. 39, 619–636. doi: 10.1007/s10508-008-9380-7
Lippa, R. A., Collaer, M. L., and Peters, M. (2010). Sex differences in mental rotation and line angle judgments are positively associated with gender equality and economic development across 53 nations. Arch. Sex. Behav. 39, 990–997. doi: 10.1007/s10508-008-9460-8
Löckenhoff, C. E., Chan, W., McCrae, R. R., De Fruyt, F., Jussim, L., De Bolle, M., et al. (2014). Gender stereotypes of personality: universal and accurate? J. Cross-Cult. Psychol. 45, 675–694. doi: 10.1177/0022022113520075
Mac Giolla, E., and Kajonius, P. J. (2019). Sex differences in personality are larger in gender equal countries: replicating and extending a surprising finding. Int. J. Psychol. 54, 705–711. doi: 10.1002/ijop.12529
Marsh, H. W., Parker, P. D., Guo, J., Basarkod, G., Niepel, C., and Van Zanden, B. (2021). Illusory gender-equality paradox, math self-concept, and frame-of-reference effects: new integrative explanations for multiple paradoxes. J. Pers. Soc. Psychol. 121, 168–183. doi: 10.1037/pspp0000306
Mealey, L. (2000). Sex Differences: Developmental and Evolutionary Strategies . San Diego: Academic Press.
Meyer, M., and Gelman, S. A. (2016). Gender essentialism in children and parents: implications for the development of gender stereotyping and gender-typed preferences. Sex Roles 75, 409–421. doi: 10.1007/s11199-016-0646-6
Moakler, M. W., and Kim, M. M. (2014). College major choice in STEM: revisiting confidence and demographic factors. Career Dev. Q. 62, 128–142. doi: 10.1002/j.2161-0045.2014.00075.x
Moè, A., Hausmann, M., and Hirnstein, M. (2021). Gender stereotypes and incremental beliefs in STEM and non-STEM students in three countries: relationships with performance in cognitive tasks. Psychol. Res. 85, 554–567. doi: 10.1007/s00426-019-01285-0
Murphy, S. A., Fisher, P. A., and Robie, C. (2021). International comparison of gender differences in the five-factor model of personality: an investigation across 105 countries. J. Res. Pers. 90:104047. doi: 10.1016/j.jrp.2020.104047
Napp, C., and Breda, T. (2022). The stereotype that girls lack talent: a worldwide investigation. Sci. Adv. 8:eabm3689. doi: 10.1126/sciadv.abm3689
Ott-Holland, C. J., Huang, J. L., Ryan, A. M., Elizondo, F., and Wadlington, P. L. (2013). Culture and vocational interests: the moderating role of collectivism and gender egalitarianism. J. Couns. Psychol. 60, 569–581. doi: 10.1037/a0033587
Penner, A. M., and Cadwallader Olsker, T. (2012). “Gender differences in mathematics and science achievement across the distribution: what international variation can tell us about the role of biology and society” in Towards Equity in Mathematics Education (New York: Springer), 441–468.
Prescott-Allen, R. (2001). The Wellbeing of Nations: A Country-by-Country Index of Quality of Life and the Environment. Washington: Island press.
Rawlings, C. M. (2007). Higher Education as a Prism: The Role of Organizational Structures in the Gender Segregation and Stratification of American Undergraduates, 1970–1995. Working paper. University of California, Santa Barbara, Department of Sociology.
Reilly, D. (2012). Gender, culture, and sex-typed cognitive abilities. PLoS One 7:e39904. doi: 10.1371/journal.pone.0039904
Reilly, D., Neumann, D. L., and Andrews, G. (2019). Investigating gender differences in mathematics and science: results from the 2011 trends in mathematics and science survey. Res. Sci. Educ. 49, 25–50. doi: 10.1007/s11165-017-9630-6
Richardson, S. S., Reiches, M. W., Bruch, J., Boulicault, M., Noll, N. E., and Shattuck-Heidorn, H. (2020). Is there a gender-equality paradox in science, technology, engineering, and math (STEM)? Commentary on the study by Stoet and Geary (2018). Psychol. Sci. 31, 338–341. doi: 10.1177/0956797619872762
Ridgeway, C. L. (2006). “Gender as an organizing force in social relations: implications for the future of inequality” in The Declining Significance of Gender (New York: Russell Sage), 265–287.
Rosenbloom, J. L., Ash, R. A., Dupont, B., and Coder, L. (2008). Why are there so few women in information technology? Assessing the role of personality in career choices. J. Econ. Psychol. 29, 543–554. doi: 10.1016/j.joep.2007.09.005
Sax, L. J., Kanny, M. A., Riggers-Piehl, T. A., Whang, H., and Paulson, L. N. (2015). “But I’m not good at math”: the changing salience of mathematical self-concept in shaping Women’s and Men’s STEM aspirations. Res. High. Educ. 56, 813–842. doi: 10.1007/s11162-015-9375-x
Schmidt, F. L., and Hunter, J. E. (2015). Methods of Meta-Analysis: Correcting Error and Bias in Research Findings London: SAGE Publications, Ltd.
Schmitt, D. P. (2015). “The evolution of culturally-variable sex differences: men and women are not always different, but when they are …it appears not to result from patriarchy or sex role socialization” in The Evolution of Sexuality Evolutionary Psychology . eds. T. K. Shackelford and R. D. Hansen (Cham: Springer International Publishing), 221–256.
Schmitt, D. P., Long, A. E., McPhearson, A., O’Brien, K., Remmert, B., and Shah, S. H. (2017). Personality and gender differences in global perspective. Int. J. Psychol. 52, 45–56. doi: 10.1002/ijop.12265
Schmitt, D. P., Realo, A., Voracek, M., and Allik, J. (2008). Why can’t a man be more like a woman? Sex differences in big five personality traits across 55 cultures. J. Pers. Soc. Psychol. 94, 168–182. doi: 10.1037/0022-3514.94.1.168
Schwartz, S. H. (1999). A theory of cultural values and some implications for work. Appl. Psychol. 48, 23–47. doi: 10.1111/j.1464-0597.1999.tb00047.x
Schwartz, S. H., and Rubel, T. (2005). Sex differences in value priorities: cross-cultural and multimethod studies. J. Pers. Soc. Psychol. 89, 1010–1028. doi: 10.1037/0022-3514.89.6.1010
Schwartz, S. H., and Rubel-Lifschitz, T. (2009). Cross-national variation in the size of sex differences in values: effects of gender equality. J. Pers. Soc. Psychol. 97, 171–185. doi: 10.1037/a0015546
Selita, F., and Kovas, Y. (2019). Genes and gini: what inequality means for heritability. J. Biosoc. Sci. 51, 18–47. doi: 10.1017/S0021932017000645
Shokri, O., Kadivar, P., and Daneshvar Pour, Z. (2007). Gender differences in subjective well-being: role of personality traits. Iran. J. Psychiatry Clin. Psychol. 13, 280–289.
Sikora, J., and Pokropek, A. (2012). Gender segregation of adolescent science career plans in 50 countries. Sci. Ed. 96, 234–264. doi: 10.1002/sce.20479
Sohn, K. (2015). The influence of birth season on height: evidence from Indonesia: the influence of birth season on height. Am. J. Phys. Anthropol. 157, 659–665. doi: 10.1002/ajpa.22763
Stoet, G., Bailey, D. H., Moore, A. M., and Geary, D. C. (2016). Countries with higher levels of gender equality show larger national sex differences in mathematics anxiety and relatively lower parental mathematics valuation for girls. PLoS One 11:e0153857. doi: 10.1371/journal.pone.0153857
Stoet, G., and Geary, D. C. (2013). Sex differences in mathematics and reading achievement are inversely related: within- and across-nation assessment of 10 years of PISA data. PLoS One 8:e57988. doi: 10.1371/journal.pone.0057988
Stoet, G., and Geary, D. C. (2015). Sex differences in academic achievement are not related to political, economic, or social equality. Intelligence 48, 137–151. doi: 10.1016/j.intell.2014.11.006
Stoet, G., and Geary, D. C. (2018). The gender-equality paradox in science, technology, engineering, and mathematics education. Psychol. Sci. 29, 581–593. doi: 10.1177/0956797617741719
Stoet, G., and Geary, D. C. (2019). A simplified approach to measuring national gender inequality. PLoS One 14:e0205349. doi: 10.1371/journal.pone.0205349
Stoet, G., and Geary, D. C. (2020). Sex-specific academic ability and attitude patterns in students across developed countries. Intelligence 81:101453. doi: 10.1016/j.intell.2020.101453
Su, R., Rounds, J., and Armstrong, P. I. (2009). Men and things, women and people: a meta-analysis of sex differences in interests. Psychol. Bull. 135, 859–884. doi: 10.1037/a0017364
Tao, C., Glosenberg, A., Tracey, T. J. G., Blustein, D. L., and Foster, L. L. (2022). Are gender differences in vocational interests universal?: moderating effects of cultural dimensions. Sex Roles 87, 327–349. doi: 10.1007/s11199-022-01318-w
Teder, T., and Tammaru, T. (2005). Sexual size dimorphism within species increases with body size in insects. Oikos 108, 321–334. doi: 10.1111/j.0030-1299.2005.13609.x
Uchiyama, R., Spicer, R., and Muthukrishna, M. (2022). Cultural evolution of genetic heritability. Behav. Brain Sci. 45:e152. doi: 10.1017/S0140525X21000893
van de Werfhorst, H. G. (2017). Gender segregation across fields of study in post-secondary education: trends and social differentials. Eur. Sociol. Rev. 33, 449–464. doi: 10.1093/esr/jcx040
van der Slik, F. W. P., van Hout, R. W. N. M., and Schepens, J. J. (2015). The gender gap in second language acquisition: gender differences in the Acquisition of Dutch among immigrants from 88 countries with 49 mother tongues. PLoS One 10:e0142056. doi: 10.1371/journal.pone.0142056
van Staveren, I. (2013). To measure is to know? A comparative analysis of gender indices. Rev. Soc. Econ. 71, 339–372. doi: 10.1080/00346764.2012.707398
Vandermassen, G. (2011). Evolution and rape: a feminist Darwinian perspective. Sex Roles 64, 732–747. doi: 10.1007/s11199-010-9895-y
Vianello, M., Schnabel, K., Sriram, N., and Nosek, B. (2013). Gender differences in implicit and explicit personality traits. Personal. Individ. Differ. 55, 994–999. doi: 10.1016/j.paid.2013.08.008
Vishkin, A. (2022). Queen’s gambit declined: the gender-equality paradox in chess participation across 160 countries. Psychol. Sci. 33, 276–284. doi: 10.1177/09567976211034806
Wai, J., Hodges, J., and Makel, M. C. (2018). Sex differences in ability tilt in the right tail of cognitive abilities: a 35-year examination. Intelligence 67, 76–83. doi: 10.1016/j.intell.2018.02.003
Wang, M.-T., and Degol, J. L. (2017). Gender gap in science, technology, engineering, and mathematics (STEM): current knowledge, implications for practice, policy, and future directions. Educ. Psychol. Rev. 29, 119–140. doi: 10.1007/s10648-015-9355-x
Weisberg, Y. J., DeYoung, C. G., and Hirsh, J. B. (2011). Gender differences in personality across the ten aspects of the big five. Front. Psychol. 2:178. doi: 10.3389/fpsyg.2011.00178
West, C., and Zimmerman, D. H. (1987). Doing gender. Gend. Soc. 1, 125–151. doi: 10.1177/0891243287001002002
Wigfield, A. (1994). Expectancy-value theory of achievement motivation: a developmental perspective. Educ. Psychol. Rev. 6, 49–78. doi: 10.1007/BF02209024
Wong, Y. L. A., and Charles, M. (2020). “Gender and occupational segregation” in Companion to Women’s and Gender Studies . ed. N. A. Naples (Oxford: Wiley), 303–325.
Wood, W., and Eagly, A. H. (2012). “Biosocial construction of sex differences and similarities in behavior” in Advances in Experimental Social Psychology. Vol. 46. eds. J. M. Olson and M. P. Zanna (Burlington: Academic Press), 55–123.
Wood, W., and Eagly, A. H. (2013). Biology or culture alone cannot account for human sex differences and similarities. Psychol. Inq. 24, 241–247. doi: 10.1080/1047840X.2013.815034
Keywords: gender equality paradox, gender equality, gender differences, basic skills, personality
Citation: Balducci M (2023) Linking gender differences with gender equality: A systematic-narrative literature review of basic skills and personality. Front. Psychol . 14:1105234. doi: 10.3389/fpsyg.2023.1105234
Received: 22 November 2022; Accepted: 27 January 2023; Published: 16 February 2023.
Reviewed by:
Copyright © 2023 Balducci. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Marco Balducci, ✉ [email protected]
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Student research journal, gender differences and sentencing: a critical literature review.
This review focuses on various pieces of literature that surrounds the perceived differences in sentencing gender. Also, literature examining the reasons why these differences are taking place between genders, and theories that could be applied when explaining these differences, will be scrutinised in order to give an indication as to whether a reason for gender differences in sentencing has been identified. The two theories that will be focused are the Chivalry theory (including Selective Chivalry) and the Double Deviance/ Evil Woman theory. Some other factors effects on sentencing, and the literature surrounding them, were also looked at as it would be negligent to say that only one factor could cause the perceived disparity between male and female sentencing. This review mainly focused on bodies of work based in the United States of America. This is because a large amount of research has been done in this area in the United States. Therefore, any questions answered will mostly be only applicable to that country due to cultural and legal differences in other parts of the world. Throughout the review a lot of bodies of research can be seen to be relatively supportive of the ideas that Double Deviance and Selective Chivalry has on the sentencing process, less so for regular chivalry. This is because various other factors seem to have some sort of effect as well as gender. Therefore, it is perhaps inaccurate to point to gender being the factor that decisively affects the sentencing outcomes. More research should be done in this area to fully grasp the relationship between gender and sentencing outcomes, while taking into account a larger number of relevant factors (legal and extra-legal) in order to not over attribute the outcomes to gender.
Author: Kieran Malon, April 2020
BA (Hons) Criminology with Psychology
A question that has long been discussed in various forms of academic literature is why there seems to be a difference in how genders are treated during the sentencing phase of trials. Within the United States the male population in prisons massively outnumber the female population. This may suggest a difference in how genders are treated at some stage during the criminal justice process in the United States. The stage that will be focused on within this review will be the sentencing phase of the system. A focus will also be made on two theories that have been looked at in various pieces of academic literature as well as other factors that may be contributing to this disparity in treatment. The literature will then be reviewed, and its validity will be looked at in relevancy and ability to test and explain the differences between genders.
The sentencing phase of the criminal justice process was chosen because there is evidence to suggest that this area is where the most difference in how the differing genders are treated in relation to what they are sentenced to do. This is suggested by academic literature like authors such as Steffensmeier et, al (1998), who used the gender effect as one of the factors that affect how an individual is sentenced, and Gelsthorpe (2013) who also looked at this factor as well as whether that number is justified in the crimes the crimes they commit. The justifiability may come in the form of whether the different genders get the same treatment when it comes to being sentenced to a crime that is similar in nature. If after looking at that justifiability differences do appear, we will then begin the process of looking at why that may be. Some theories that have been hypothesised will be looked at in relation to any differences in treatment found and the literature surrounding these theories will be reviewed and scrutinised in order to find if they have any relevant effects on modern sentencing outcomes.
The two theories that will be focused on when it comes to this topic are the Double Deviance theory and the Chivalry theory. These two are theories that have been discussed frequently in literature when it comes to this area of the criminal justice system. The literature surrounding these theories will be further discussed later in the text and their relevance, or lack of, will also be discussed further on. Beyond these two theories however, I will also be briefly looking at further factors that have been hypothesised to affect this stage as well as the gender of a defendant. This may include popular factors in research in this area such as race and age of a defendant. Also, as these factors may interrelate with each other it is important to discuss how these may be advantageous to some groups of people and disadvantageous to other groups. This may lead to different sentencing being given to different groups of people depending on characteristics that may be out of their control.
Within this review, three studies have been chosen as the focus for each of the theories (Chivalry and Double Deviance/ Evil Woman). These studies will be analysed as well as various other forms literature on these theories and relevant studies will be mentioned in reference to whether they support the studies in focus. An attempt at looking at a variety of different crimes has been made to investigate whether the theories apply or not across a spectrum of different sentencing events. In order to take into account other factors that may contribute to any differences, other than gender, in sentencing, two legal and two extra-legal factors have been chosen for further discussion. However, it should be noted that there are a range of other factors that will not be discussed in as much detail due to the large amount. But it is noted that these other factors exist that may influence sentencing decisions. This will be finished with a discussion on future studies that could be conducted as well as the limitations of this review and a conclusion on what can be found from the reviewed literature.
This is an idea that was put forward by Otto Pollak (1950) to suggest that women within the criminal justice system are treated much more leniently than men due to the idea of chivalry towards women. Later Paternalism would be identified as something that frequently follows the chivalry aspect. It is suggested under this theory that law officials/judges see women as child-like and defenceless in their behaviour (Herzog and Oreg, 2008) and therefore are in need of protection, this leads to said leniency I favour of women. It can be said that Pollak’s research could be seen as outdated, we look at whether elements of the theory can be seen in today’s criminal justice system when looking at sentencing. Paternalism has led to modifications to the Chivalry theory however, this has been called Selective Chivalry. It is suggested by Farnworth and Teske (1995) that the leniency that comes with this chivalry idea is only open to white women and those who have wealthy backgrounds (Jeffries and Bond, 2013). There have been studies conducted on a range of different crimes which then have looked at how chivalry could possibly influence how sentencing decisions turn out the way in which they do.
Holland and Prohaska (2018) conducted a study in which they looked at whether females were more likely than males to receive shorter sentences while also controlling for relevant factors that could possibly affect the sentences also. One factor that they did want to account for and investigate further, in addition to gender, was race. Racial effects will be discussed in more depth further in the review, however they did want to see, as well as if there are differences between men and women, whether there are differences in sentences between women of different racial groups. This would allow a view into whether just gender could possibly effect sentencing between the males and females, or if other factors also need to be present to effect sentencing. Therefore, a second hypothesis was that white women would receive more lenient sentences than women of colour, which would support the work of Farnworth and Teske (2008) who suggested chivalry would only apply to white women. They also took into account geography when making various assumptions about what the results may show in their hypotheses. They hypothesised that women in the south of the country will be sentenced differently from women in other regions. The data collected looks at all federal cases that spanned the year between the 1 st of October 2014 and 30 th of September 2015. Also, due to the database containing information on a range of controls for legal and extra-legal factors, it means that an in depth analysis can be done to measure the various factors influence on the sentencing process and seeing how they could interact with the gender factor to lead to a sentencing outcome.
From the results, they found that their first hypothesis was proven correct. Women in general did receive shorter sentence lengths in comparison to males. This is also with legal factors considered and supports the chivalry theory. This supports the various bodies of work that have shown similar results through various experiments they have conducted (Doerner and Demuth, 2010; Rodriguez et al., 2006). This supports the idea that women are not seen by law officials as being as culpable for the crimes they committed as men are. However, the results contradicted the selective chivalry claims that would suggest that white women would mostly benefit from the leniency hypothesised in the chivalry theory. Hispanic and black women got shorter sentences when sentenced for federal drug crimes. This is surprising as the reverse has been shown in respect to selective chivalry as white women have benefitted the least from the suggested leniency shown in the results of this study. However, that may be because they have been adjudged to have been more out of line with traditional gender roles leading them to punished as doubly deviant rather then being viewed as needing protection.
There are some limitations on this study even if it does cover a population across a large area (Across multiple states). This is a problem in itself as guidelines differ across different states. This means that some judges will have more discretion than others, allowing them more freedom in decision making on sentences. Therefore, that has to be taken into account when looking at the results. Therefore, this may need to be built upon by more studies looking on a state by state basis accounting for those guidelines and taking them into account. There are also multiple other variables that could be explored. These may explain any contradiction with other forms of research that show support for selective chivalry as it does not have any detail on if these other factors could have had a effect on the sentencing process.
Embry and Lyons (2012) focused their study on the discrepancies in the way that male and female sex offenders are sentenced and how chivalry theory could possibly have an influence in these sentencing decisions. They give an initial idea of what they expect to find, which is based upon previous literature (Jeffries, Fletcher and Newbold, 2003; Curry, Lee and Rodriguez, 2004), that females receive more lenience than males do when it comes to sentencing. For their study, they collected data from the National Corrections Reporting program in order to do secondary data analysis. The sample they used was spread over ten years but they originally had more than this. They used the most recent ten years in order to get the most relevant results. This gave them a more modernised picture of what is affecting the current sentencing process as values and views can change over time. Also, using a large time frame allowed them to offset another problem which is the low number of female offenders who have been sentenced on being a sex offender. Therefore, looking at a data set over a larger period allows them to have a large sample of female offenders in the data to look at and analyse.
From the study, the evidence showed that although there is no difference in sentencing rates between men and women who commit sexual offences, men do tend to get harsher sentences. This shows that although judges can see that women should be charged (because they have committed a crime), they may not believe that they are as perhaps dangerous as male sex offenders are. Embry and Lyons (2012) earlier talk about this stereotypical image of a sex offender, which is usually a male offender with victim coming to mind as a young female according to them. This could be proven to have some accuracy if you look at the perceived leniency that could be inferred from the results of this study. Due to women not fitting the stereotypical image of a sex offender, even if they have in fact committed the crime, they may still be deemed as less dangerous as a male sex offender. This shows that although the decision that all genders need to be punished for criminal offences is equal, the severity of the punishment across genders is not equal.
There are areas that could be developed in this study in order to perhaps improve its scope and relevancy to a broader population as well as limitations that can be identified in this study. Although the number of women that were included in these studies was a fairly even split, this may not always be a study that can be compared to real life. This is because although it was a fairly even numbers, compared to a lot of studies, the offending rates of women are way below the offending rates of males. Therefore, even if they did get a better idea how the factors and theories may affect a trial. Therefore, if you did want to investigate these discrepancies women will based on even figures, women will be sampled far more than they generally offend. These results may simply seem to point to one answer when it is simply just a question of numbers in terms of offending rates. There is also an issue with the fact that the study they conducted was built from basic figures taken from the data base they were sourced. Although we can assume from the studies that support the chivalry theory that sentence lengths were affected by the gender of the individual being sentenced, we can be certain due to the lack of specificity within the statistics. Therefore, we cannot rule out the fact that there was a higher percentage of males who had committed a more severe form of the crime that they had committed in comparison to the females who were being sentenced. A final limitation would be that although they would class this as a cautious generalization, they could only possibly say that it is a cautious generalization of the population in the United States, where it was based. The sample population that had been sentenced that they were looking at was entirely from the United States, if they broadened their sample to include statistics from various other countries a much larger cautious observation could be made.
Therefore, if they were going to look at doing a further study with this as the basis, a few steps should be taken to expand on this body of research. A more detailed data set would be needed in order to see more information about crimes committed or perhaps so we can find out more about the defendant being sentenced. We would hopefully be able to see whether Chivalry was in fact taking affect in the sentencing phase, if there were legitimate reasons for sentencing for one group being to harsher degree or if other theories and factors may be able to be more relevant in affecting the process. Finally, a sample of offenders that are from a range of different countries in order to give it the best chance of it being more generalizable to the rest of the world. Different countries have different views, values and offending rates. It would therefore be interesting to see if these theories can be applied across more than just one country.
A final study that looks at the Chivalry theory is a study conducted by Spivak et al. (2014), who looked at an area that is unique from the other pieces of literature that were focussed upon. They looked at status offences committed by juvenile offenders. Status offences committed by juveniles include but not limited to, truancy, consumption of alcohol or tobacco or running away. This is an interesting area as previous literature done on this suggests that status offences are the only area in the juvenile system in which female offenders outnumber male offenders (Tracey et al., 2009). It is generally assumed that males do commit more crime than females (Messerschmidt, 2007) and so to find a category in which males do not outnumber females, and in fact females outnumber males, makes it an area for further study. The study was conducted in Oklahoma and the data was collected by a local agency that collects basic data on juvenile cases. They then cut down the cases to only look at the relevant cases in relation to the type of crime they were looking at (Status offences). In relation to this review, two of the hypotheses included looking at the cases as to whether the chivalry theory could apply to these cases. More specifically, they wanted to see if girls’ cases were filed for review, in comparison to boys. They suggest that if proven this may show a want to make sure girls’ cases are scrutinised to make sure they get a correct judgement.
From the results of the study conducted, both hypotheses relating to chivalry seem to be supported. The results show in this that girls were more likely then boys to have their cases further reviewed. An idea of why this could be explained by the chivalry theory is they want to try and protect girls from being guilty through further looking at their cases and the circumstances behind them. This perhaps leading to mitigating circumstances being shown on their behalf which could result in them receiving lesser sentences. Therefore, if chivalrous and paternalistic attitudes can be found even when it comes to looking at cases involving juveniles, it suggests that the want to protect females may start from juvenile court and be seen through most age groups once moved to be judged and sentenced as an adult. However, in this case it must be noted that although it was shown that girls did tend to get lesser sentences than boys, the relationship between gender and lesser sentences was very weak. Meaning that more studies must be done in this area as it is inconclusive when it comes to whether sentencing may differentiate between female and male juveniles, even if there is a slight relationship in favour leniency towards girls.
As is a regular problem when it comes to a lot of research in this area, a lot focus on one state for their research. This means that it cannot be generalisable as there are many differences in population and justice processes across the world and even in the United States. Therefore, more research in this area would be needed specially to help the more inconclusive aspects of this study. The database used is also quite dated for this study as they admit. This means that changes might be viewed if data was collected for juvenile cases now. If policies have been brought more recently, this may lead to a difference in results and lead to different hypotheses being drawn.
Double Deviance is theory based on the point of view that certain women are punished under the view of doing two things wrong. They are viewed as having broken societal norms and expectations of how a woman behaves, as well as breaking the law. They are then judged upon the basis that they have done doubly wrong. Murphy and Brown (2000) suggest that under this theory it creates a situation where women can either be demonised or can be shown more leniency depending on if they broke these societal norms on what is expected of women. Double Deviance theory which is sometimes referred to as the Evil Woman theory. Although this idea may seem like selective chivalry in the way in which some women may be treated more leniently. It differs greatly in the notion that women who break these societal ideas of gender norms are punished even greater than men do when they commit the same crime. Women who fit into this theory and are seen as doubly deviant are seen as more blame worthy in this case which leads to harsher sentences, even if the crime they have committed is the same (Herzog and Oreg, 2008; Tillyer, Hartley and Ward, 2015).
Tillyer et, al (2015) based their study on looking at the perceived unfairness that exists within the court systems. Although they noted various factors may contribute to these differing sentencing outcomes between various groups, they chose to focus this study on gender and the different theories surrounding the gender factor. The crime they chose to investigate was narcotic cases taken from a federal data base. Narcotic cases have been chosen because it is a crime that will be viewed as breaking traditional gender roles. Therefore, if the theory is to be accurate the results will show that women get a harsher punishment then men who commit the same crime. Some factors may effect this as well however such as criminal history as well as evidence showing that race may have an effect as well (Spohn and Holleran, 2006) therefore that also has to be taken into account when looking at the gender and sentencing differences. The dates taken from the database in order to be analysed has been specifically chosen due to it have the highest amount of women to have committed crime within this time frame, with females being most populous being sentenced for this crime than any other in the dates analysed. This should allow for a good comparison to be made between male and female sentencing cases as it gives more cases that will be analysed then other crimes and dates. According to the authors they look to answer two hypotheses. They want to test whether women with limited criminal history will get more lenient sentences and those with extensive criminal history get a harsher sentence then their male counterparts.
The results in this case showed support for both the hypotheses in their assumptions. The first hypothesis which suggested more lenient sentences for women who had limited criminal histories was correct. This can be assumed under the Double Deviance theory that it has led to those who are deemed to have acted in accordance with the image of how an idealised woman should act, which has led to them getting a more lenient sentence overall. The results gained from the database also suggested that the second hypothesis was also correct in it assumption that women who have been deemed to have broken the law and the norms of societal conduct for a women have been punished more severely than men who committed a similar offense. This assumption supports the hypothesis of Bontrager et, al (2013) and Herzog and Oreg (2008) that women are punished further for factors and occurrences that allow them to be viewed as doubly deviant. However, this could also show why there is a gap between the genders within the prison numbers. Although some women are overly punished for breaking a conceived notion of how a woman should act, some are also being given lesser sentences then men for committing similar crimes, which may explain some huge gaps in numbers between the genders in prisons. This study also shows the importance of how criminal history interacts with gender to influence the sentencing phase. That is a unique aspect about this study and has allowed it identify a key factor that has effected women who are being sentenced and may allow law officials to become more aware of factors that may be unconsciously effecting their decision making process. This sort of identification, if it increased awareness, could lead to more equal sentences in regards to these factors.
The limitations that have been noted does increasingly affect the ability to make this study generalizable. A large limitation that seems plague this study is the effect that new policy changes have on the sentencing phase. These policy changes may give judges more or less discretion when it comes to sentencing. For example, if a policy came in after this study was released that limited a judge’s discretion on sentencing for crimes like drug crimes, it may lead to a more equal distribution regardless of gender. This would be because a judge would then have less freedom when they are passing a sentence and may have to stick to more rigid guidelines when sentencing. If a policy such as this did come into law, then another study would have to be held in order to investigate how these changes may have affected sentencing and may lead to theories like the double deviance theory losing its validity. A further limitation is absence of a few different factors that may provide more information on the complexities when it comes to examining gender and sentencing. For example, they only limitedly consider factors such as family status and whether they are depended upon by others, which may include children. This is identified as a factor that could influence a sentencing decision and would be helpful if a more in depth study was done to look at how factors such as this could impact the findings and if they would give them a different look to lead to different findings.
The study conducted by Koons-Witt et, al (2012) had a smaller focus when it comes to population they focused on. They conducted their study on the state of South Carolina, whereas the previous study looked at sentences across the country. However, within this study they looked at various crimes to see how the effect of gender changes across the various crimes that are committed. As well as looking at how gender has an effect when it comes to sentencing, they consider various other factors and how they interact with interact with gender sentencing processes. These include race and gender which will be discussed later in the review. Within the study they expect to find that women are treated more leniently when it comes to sentencing, which is in line with chivalry theory. However, as mentioned within this study the environment they a basing this study in must be taken into account. According to Koons-Witt et, al (2012) the state of South Carolina historically has rather conservative views of women which means that there may be stronger view, than in most places, in the traditional gender role of women. Traditional crimes for women are often non-violent crimes such as fraud according to Rodriguez et, al (2006). Therefore, when women commit crimes that are not viewed as traditionally crimes women commit (violent crimes), the evil woman hypothesis may have a stronger effect then it may do in states or countries with less conservative views. The data for this study was collected from a now disbanded commission within the state. They focused on the latest data set that was made by the commission which was from 2001. This was due to various reforms that made the data collected before problematic to use. Therefore, they had 12 months of data to study and analyse.
The results for this study again showed support for the double deviance hypothesis. Similarly Tillyer et, al (2015), shows that only women with very limited criminal history are shown leniency in this case. Koons-Witt et, al (2012) mentions how the effect of women having an extensive criminal history background makes the leniency that is shown to women completely vanishes which then leads to them being sentenced on the same level as men. A woman with an extensive criminal history seems to then lose the protection that is often shown when it comes to sentencing. This could be as the law officials view that as the criminal history builds up, they are viewed as increasingly culpable for their actions and lose leniency that they are perceived to have. As a result, they receive greater sentences than females with lesser criminal histories might have. The support for the evil woman theory is further shown by this study because it again shows that women are shown leniency up until a certain point within the criminal justice system. Although this study focuses just on one state, it still supports a pattern that has been shown in many different pieces of literature on this topic (Belknap, 2007; Franklin & Fearn, 2008). This can also explain why there is a clear difference in how women and men are treated in this study. They note that in the sample they looked at the average for women with no criminal history was lower than the average for males with no criminal history. Therefore, this would then lead to a lack of cases where the double deviance hypothesis would take place which would, if women did have a higher average criminal history, theoretically lead to more women being sentenced on a similar or more harsh level then the males.
There some limitations to this study and suggestions on how to further this line of research in the future, if it has not already been conducted. The focus of this study was on, as the author says, a rather conservative state in relation to how women are supposed to behave. Due to this being on juts the single state, it would be interesting to see how this would compare to other states that are viewed as conservative in a similar way to South Carolina. This would give a better idea of how the ideological in the environment may change how different genders are sentenced. If similar results are seen, then it can be ruled out as an outlier and helps with generalizability. It would also be interesting to see how a more liberal state would sentence with a focus on differences between genders. This would give information on whether the conservativeness of the environment may have an effect of sentencing between genders and if it does, how much of an effect does it have. Due to it being secondary data they are working from as well, they cannot control what was recorded and what was not recorded. This has led to various factors being excluded from the recorded data that may have had an influence on the sentencing process. Although this may not have necessarily affected what was learnt from the results, some information that may have allowed a further, more detailed analysis from taking place. This would have looked at how these factors interacted with gender and sentencing outcomes, due to some of these factors often being stereotypically thought of as female roles, such as whether there are dependent children or the role of the individual with a family. These may have an effect due to women often being thought of as the primary care giver when children are involved compared to males. Daly (1987) suggested that women are more likely to receive leniency due to them holding important familial roles. This therefore that accounts for some disparity in sentencing outcomes between genders. However, without data recorded on this factor we cannot tell if this influenced sentencing outcomes in this case.
Tasca et, al (2018) based their study upon analysing how parental status and sentence length interact with each other. The study was based in the state of Arizona with data collected through self-report and official data. The questions for their study were based around whether the parenthood factor, whether they have children that they care for, get affected by gender when sentencing. They also wanted to look at how sentencing lengths vary between offenders without children as well as parents who are involved with their children and those who are not involved. Various studies have already been done in this area on how familial status may affect sentencing outcomes (Daly, 1987; Freiburger, 2011). However, the unique aspect of this study is that it also considers how involved the offender may be with their children. This will give a more complex look into how the parenting dynamic affects sentencing outcome, while considering how gender plays into the dynamic. Within this study, they also look at various smaller factors within the parenting dynamic, such as how women incarcerated often tend to be single mothers with reliant children. Therefore, with factors like these in mind it may be easier to interpret the results as well as make initial predictions. Although within this study they are not necessarily looking into how the evil woman theory plays into this study, through the results it gives you an idea of how the theory’s ideas may have had a role in affecting the sentencing outcomes.
The results, if split between men and women parents, without breaking them down into the further categories could already suggest that women might be viewed as doubly deviant in the crimes they have committed. This is because the results show that female parents received longer sentences than fathers. Under the evil woman hypothesis, you could suggest that this may be because they have broken the traditional gender role of committing a crime when they are supposedly the primary caregiver in comparison males who are traditionally the economic support for the family. These initial results contradict previous research and hypothesis (Daly, 1987; Daly, 1989), however there may be other factors to be revealed that explain this further in the results section. Once the results were then further broken down, although it did show that women who lived with their children prior to arrest received more lenient sentences compared to mothers uninvolved with their children. However, it still shows that males still received less harsh sentences then the women. The fact that the results did show less lenient sentences on women who were uninvolved with their children could suggest that double deviance has an effect. It could be argued that the women who are uninvolved with their children are being more severely punished for not performing their motherly duties and are therefore being punished for more than just the crime they committed. It is also mentioned that the sample was drawn from offenders who are not new to being involved with the criminal justice system. Therefore, the criminal history factor also has to be taken into account as it has been earlier suggested that it can negate any leniency they may have been granted under the chivalry theory (Tillyer et al, 2015; Koons-Witt et al, 2012). Therefore, it could be argued that because this study was mainly populated by a sample of women who have a criminal history, most would lose the leniency effect that would be applied to those without a criminal history. Meaning that if the evil woman theory was applied here it could explain why the women had on average a higher length of sentence then the males did. As under the theory they would be treated more harshly for breaking traditional gender roles and committing the crime.
There are couple of limitations and suggestions for future studies that could be made based from this study. One limitation of this is the need for a larger sample. This is because varying court systems throughout the United States, as well as throughout the world, may show different results. It may have been more generalizable if they at least looked at various court judgements in different states within the country. Due to the different court systems between states giving judges varying degrees of discretion, that may prove to be a key factor in how sentencing patterns show in the results. Therefore, a larger more varied sample should be looked at in further research to see if the results and hypotheses drawn from the results are applicable to more than just Arizona. As the authors mentioned also, the majority of the sample had criminal backgrounds and that may have had a potential effect on the results. Therefore, a more varied sample in terms of whether they have a vast criminal history or not should be explored. This will allow a more balanced view on how much effect certain factors have on the sentencing process as it hard to measure the effect of factors such as criminal history when many of them have criminal histories.
Although a lot of literature that has been analysed show a pattern that supports the hypotheses. There is going to be some literature that contradicts the pattern that is seen. This can be for various reasons and these reasons can be discussed later when looking at the literature. One piece of literature that was reviewed did seem to contradict the pattern of support and although there may be more, only one will be the focus here.
A study that contradicts the findings in support of the theory is one that looks at chivalry in relation to nonviolent offending. Specifically, the study undertaken by Koeppel (2012), looks at nonviolent property crime in rural areas in the state of Iowa, in the United States. This study is interesting as Rodriguez et, al (2006) mentioned that there are varying degrees of leniency in the sentencing process depending on the crime, but males do often receive harsher sentences. Rodriguez et, al (2006) and their body of work suggested that males were more likely to receive harsher sentence outcomes when it comes to property crime. This, if backed up by continuous research as well as the study of focus here, would look like it would be following a pattern on which females receive more lenient sentences across multiple types of crime. The author of this study (Koeppel, 2012) suggests that we may see differences because of the setting also, as the they are focussing on a rural setting. Steffensmeier et al. (1998) suggests that because of the volume of cases that an urban judge must get through, they succumb to using stereotypes and generalizations in order to make their decisions. Therefore, if this theory is correct, we should see a decline in the difference in sentences between male and female who commit the same crime. This could then show evidence against chivalry and suggest other factors may yet have a larger influence on sentencing.
The data was collected from five small rural counties that have similar demographics; this could then decrease the effect that different environments may have had on the process. The sample size in this case is a 507 split between 188 females and 319 males. Worth noting also was that 95% of the sample were white which will be a point brought up later when analysing this study. The presumed result for this study was based from previous research and that a clear pattern of leniency in favour of females would be present. However, the results of the study contradicted previous research and found that gender did not have a significant effect on the sentencing outcomes. This is surprising, if the Chivalry theory is to be believed, considering the nature of the crime itself. The crimes looked at were nonviolent crimes that would, in theory, be seen as crimes a woman would get more leniency under the Chivalry theory. Herzog and Oreg (2008) discuss how the judges would think of women as defenceless and weak under chivalry and paternalism. Therefore, because it is a white, female, nonviolent offender they fill that criteria in which chivalry/paternalism is suppose have an effect and lead to leniency in sentencing them. This may also give support to the idea that sentencing may vary between urban and rural with the idea of chivalry also having a different degree of effect. It also goes against Rodriguez et, al (2006) and their idea that chivalry can be seen across the entire spectrum of crime sentences and even if it does have an effect, that effect is negligible enough to not affect the results as much as it would compared to other crimes perhaps because of the non-violent nature of it. Also, due to there being no specific gender effect notable, the double deviance theory cannot be applied either as measures of traditional sex roles did not have any effect on sentencing length. Therefore, in this case the results suggest that no matter if they have broken traditional gender norms, they sentenced on equal basis with women who might not be viewed to have broken traditional gender norms.
There are some developments that could be made to this study that would allow researchers to develop a clearer and more real to life picture of this area of study. One such development would be to look at basing the study in more diverse areas or look at similar data from rural areas in different countries if they want to focus on the rural aspect of the study. This is because it has been suggested (Herzog and Oreg, 2008; Jeffries and Bond, 2013) that chivalry only really effectively applies to certain women who fit the ideal image of “womanhood” which according to Steffensmeier and Demuth (2006) is white and middle class. Therefore, because this study was done in an environment where there is a miniscule amount of individuals, compared to the sample population size, where from an ethnically diverse background, it makes it hard to be generalizable except from maybe to small counties with little ethnic diversity like the ones that have been researched in this study. Also, although it could show some partial evidence against general chivalry’s effects on sentencing for this crime, it would struggle with some other theories. It would struggle to show evidence against selective chivalry as it has very little diversity to show that certain women would get better treatment then others due to its lack of a diverse sample population. As previously referred, it would only be able to show evidence to support counties with similar populations to itself and it would struggle to provide evidence in other states and counties as the diversity varies around the country. Support for the generalization of the conclusions of these studies is very low as it would only apply to a very small group. They state themselves that the results may differ from rural counties in different states. A further limitation was the issue that for some cases they could not access their criminal background. Therefore, they were unable to take the factor of whether some may have a criminal history which would then affect the outcome. The resulting sentence could then be different because of this and would then have to be considered when looking at the results of this study. For example, the reason why in some of the cases women look like their sentence is just as long or harsh as their male counterparts is because they have a criminal history. If this was then it may be that women in this case may have only gotten similar judgements due to having a criminal history. Although, it may be unlikely that enough women had criminal histories that affected their sentence length to then affect the results and conclusion of this study, we cannot rule it out due there being no proof against this idea.
So far, the focus has been on how certain theories help explain the differences that may be seen during the sentencing phase of the criminal justice system. Various other factors have been mentioned when looking at the theories and how they may moderate and interact with gender and the theories surrounding differences in sentencing in relation to gender. However, this chapter will look more specifically at studies surrounding factors and how they may have an influence on the sentencing process. Four factors, legal and extra-legal, shall be looked at within this section, however there are many more factors that have been looked at in various research and are can be said to have some sort of effect. Each factor will look at how they interact with gender and sentencing, looking at whether they have a significant impact on how defendants are sentenced.
Within this section the race of the defendant being sentenced will be looked at in reference to seeing if there are any differences to be seen between gender and race. There are substantial bodies of research that detail how race can affect how individuals may be sentenced, with white defendants often getting the more lenient sentence (Brennan & Spohn, 2008; Spohn, 2000). The focus for this however will be how race and gender can lead to differences in sentencing with a discussion on what those differences are and what they show about the sentencing system.
What is found through looking through various pieces of literature is that black women do not receive the same level of lenience from the court system as white women do (Koons-Witt et al., 2012; Brennan, 2006; Steffensmeier et al., 1993). Therefore, you can assume the leniency that is supposed to apply to women when it comes to sentencing only applies to certain women, which supports the theory of selective chivalry. This shows that there may be some king of racial stereotype in causing this and allows them to fall out of the archetype image of women that need this protection because they believe they are have a low risk to society. However, there are also studies that contend this theory that black females are treated less leniently then white females. Many of these studies still find differences in how people of different races are sentenced, however they find that the difference mostly occurs when it comes to black and white males as earlier mentioned (Steffensmeier & Demuth, 2006; Spohn & Spears, 1997). It can also be suggested that the area that is picked to base the study may affect the outcome and results of these studies. It is suggested by Zatz (2000) that studies based in the United States seem to see a pattern of where race appears to be a larger factor in the sentencing phase and these states where you see it are in the south of the country. Therefore, you may assume that the study may vary from area to area depending on how strong traditional gender roles and views on women are valued. For example, the Koons-Witt et, al (2012) study is based in South Carolina which is in the south of the country. They mention how conservative views on women are held within the state and the results showed that black women and men appear to be on the receiving end of less lenient sentences. A lot of studies also seem to only consider black and white sentencing disparities. There are studies however, that show Hispanic defendants can often face even harsher punishment then both black and white defendants (Brennan & Spohn, 2008). Therefore, although race does seem to have a definite effect in relation to men on a general basis, it is less certain in its effect when women are the defendant. A future study could look at the effects of gender and race and compare them between various states or countries in order to determine whether there is any connection between the factors.
The factor of age has not yet been discussed in this review though it is often included as control within most literature. Most of these studies often focus on the age of male defendants rather than across the gender divide. This maybe perhaps due to it having more of an influence over male sentencing then female leading to a lack of need to test how the factor may influence sentencing of women.
Steffensmeier et al. (1998) hypothesised this when looking at how a range of different factors interact when sentencing. They found that younger males are often sentenced more harshly than older individuals. They also found that as the defendant got older, the race and gender differences diminished. This may be because they are deemed too old to be deemed as much of a threat to the general public. This was then further built upon by Spohn and Holleran (2000) who narrowed down the age in which an individual was most likely to be punished harshly. It also must be borne in mind that they looked at which ethnic groups were mostly likely to receive harsher punishment as well as the age range in which those groups would likely receive this punishment. They identified that black and Hispanic male defendants between the ages of 21 and 29 while being unemployed were most likely to receive the harshest punishments out of the different groups. Therefore, it must be noted that age can show various patterns, when interacting with other factors, that can lead to various other avenues of research and groups that can be focused on that lead to much groups being identified for further examination.
How serious an offence is may seem like a more obvious indicator into how a defendant may be sentenced with the idea that the more serious the crime, the harsher the sentence. However, this may be a key factor to explain the why there is a lot more males serving out more severe sentences. According to literature women are a lot less likely to commit serious crimes and violent crimes (Belkanp, 2007; Rodriguez et al., 2006). Non-violent crimes, as previously discussed, like fraud and theft are seen as crimes that traditionally women commit. However, this may result in women who commit violent crimes being affected by the views that are brought up in the theories above.
This is possibly the most key feature that can lessen the disparity between sentencing leniency. It is shown in research that a lot of the disparity disappears when it comes to violent crimes (Rodriguez el al., 2006; Boritch, 1992). This is because under the selective chivalry theory, this would fall out of its purview as it is crime that outside of the gender role expectations (Koons-Witt et al., 2012). They, therefore, lose the leniency that they would get under that paradigm as well as being judged as doubly deviant because of that breaking of the gender norms. Warren et al. (2011) suggest that as the seriousness of the offence increases the less discretion the judge then has when giving a sentence. This would then lead to this factor outweighing other key extra-legal factors, such as gender, because the seriousness of the crime has more influence over the sentence than any possible influence these extra-legal factors could possibly have on the outcome of the sentencing. This does mean that when the crime is less severe, the extra-legal factors can seem to have some measure of effect. Rodriguez et al. (2006) can support this view as they saw differences when it came to property crime in the way in which leniency was shown in favour of women when it came to sentence them.
Criminal history is factor that was discussed in one of the studies that was focused on earlier in this review. The results for that study suggested, in the case of narcotics cases at least, that women do tend to get more lenient sentences if they have a lack of criminal history in comparison to males with a lack of criminal history (Tillyer et al., 2015). This can be further backed by Spohn (2000) and Koons-Witt et al. (2012) in relation to chance of being incarcerated.
From the various pieces of literature that have reviewed (Tillyer et al., 2015; Spohn, 2000; Koons-Witt et al., 2012; Daly & Tonry, 1997), there is a large amount that suggest that criminal history is very useful factor to use when predicting how a defendant may be sentenced. Daly and Tonry (1997) suggest that when judges are deciding on how to sentence an individual, when they have some discretion on how sentence a defendant, they look at criminal history as means of predicting future behaviour. They suggest that they use it as a means of testing how much they respect the law. A large criminal history would suggest a lack of respect and give the impression that they have no reservations about breaking the law. If the judges have doubt that that the defendant will not commit another crime, then they will be a lot less likely to give sentence that will be lenient. They may have a lack of belief that the offender would learn from committing their crime and that it would expose the wider public to harm. All these different issues stemming from a larger criminal record would leave a law official less inclined to be lenient to what in their view may be a veteran offender.
In terms of volume of research in the area of sentencing disparities between genders, there is large amount, as shown above. However, there are a few interesting areas of future research that could be partaken in to further understanding with more specific factors. For example, an underlying factor in the United States may be what region the sentencing is taking place in (Zatz, 2000; Myers and Talarico, 1986). As mentioned earlier certain states seem to have conservative views regarding gender roles as well as Zatz (2000) mentioning that race effects can generally be seen to be stronger. A more recent investigation into the differences that can be seen across a variety of states would be an interesting area. As a detailed comparison between states that have historically been seen either conservative or liberal in their views would be able to give an idea how much of an effect the historical views of the state they are based in, is having. More studies in this area conducted outside of the United States would also be interesting. There is a very large body of work existing in this this area in the United States and not as many in other countries. Therefore, more studies done in other parts of the world may show if cultural differences still allow the ideas of the above theories to have a basis there. As well as how much the other extra-legal and legal factors have in relation to sentencing. This is a large limitation within this review as the most popular studies in this area are mostly conducted in the United States, making it harder to generalize it on continental or worldwide basis due to lack of a large body of research in other countries. Also due to there being so many different extra-legal factors that exist and could influence the sentencing process, more studies at looking into precisely how much each factor may influence a sentencing procedure. The current studies that have been conducted use databases that only give so much information on different factors. Therefore, a future study on the subject should try to address this issue which can be done by a range of measures. One such way could be creating a database that is more qualitive in its approach and includes more extra-legal factors with more information on the offender and their offences. This can allow the analysis of more than on factors, in order to establish their influence, they may have on decision, on the same sentencing cases. More research into race and gender sentencing outcomes may also be an avenue of further research as although there is research in this area a lot of them only investigate the black and white race comparison. This approach risks missing out on how other ethnic groups experience the sentencing process and if there can be witnessed differences in how they are treated in that phase of the justice system
Based from the evidence shown in the studies above and the results given. It can be suggested that Selective Chivalry and the Evil Woman/Double Deviance theories have some relevance in showing how sentencing decisions are made. Gender does appear to be a factor when determining how an offender is sentenced. It can be shown that a large body of work, in which I reviewed, shows a pattern that supports the idea that women are shown more leniency when it comes to sentencing. Yet are judged more harshly sometimes then men when other factors contribute that could break the traditional gender roles and traditional gender stereotypes such as committing violent crime, as opposed to non-violent crime. However, they also show that other factors may need to be taken into consideration in order for it to have a significant effect. Further research into how extra-legal may affect sentencing, due to the sheer number of factors that there is, and how they may be assisting the gender disparity when it comes to sentencing outcomes. Overall, the literature reviewed does generally point to the idea that gender does play a role in sentencing outcomes, but also that other factors are also considered in order to give sentence and not just gender. Chivalry, Selective Chivalry and Evil Woman/Double Deviance theories all cannot be discounted in having elements of the ideas within them being relevant to sentencing within the United States justice system.
Belknap, J. (2007). The invisible woman: Gender, crime, and justice (3rd ed.). Belmont, CA: Thomson Wadsworth Publishing Company
Bontrager, S., Barrick, K., & Stupi, E. (2013). Gender and sentencing: A meta-analysis of contemporary research. The Journal of Gender, Race, & Justice, [online] 16, 349-635. Available at: https://heinonline.org/HOL/Page?handle=hein.journals/jgrj16&id=371&collection=journals&index= [Accessed 3 April 2020]
Boritch, H., (1992). Gender and Criminal Court Outcomes: An Historical Analysis. Criminology , [online] 30(3), pp.293-326. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1745-9125.1992.tb01106.x [Accessed 3 April 2020].
Brennan, P. and Spohn, C., (2008). Race/Ethnicity and Sentencing Outcomes Among Drug Offenders in North Carolina. Journal of Contemporary Criminal Justice , [online] 24(4), pp.371-398. Available at: https://journals.sagepub.com/doi/abs/10.1177/1043986208322712 [Accessed 3 April 2020].
Curry, T., Lee, G. and Rodriguez, S., (2004). Does Victim Gender Increase Sentence Severity? Further Explorations of Gender Dynamics and Sentencing Outcomes. Crime & Delinquency , [online] 50(3), pp.319-343. Available at: https://journals.sagepub.com/doi/abs/10.1177/0011128703256265 [Accessed 3 April 2020].
Daly, K., (1987). Discrimination in the Criminal Courts: Family, Gender, and the Problem of Equal Treatment. Social Forces , [online] 66(1), p.152. Available at: https://academic.oup.com/sf/article-abstract/66/1/152/2231816 [Accessed 3 April 2020].
Daly, K., (1989). Rethinking Judicial Paternalism: Gender, Work-Family Relations, and Sentencing. Gender & Society , [online] 3(1), pp.9-36. Available at: https://journals.sagepub.com/doi/abs/10.1177/089124389003001002 [Accessed 3 April 2020].
Daly, K. and Tonry, M., (1997). Gender, Race, and Sentencing. Crime and Justice , [online] 22, pp.201-252. Available at: https://www.journals.uchicago.edu/doi/abs/10.1086/449263 [Accessed 3 April 2020].
Doerner, J. and Demuth, S., (2010). The Independent and Joint Effects of Race/Ethnicity, Gender, and Age on Sentencing Outcomes in U.S. Federal Courts. Justice Quarterly , [online] 27(1), pp.1-27. Available at: https://www.tandfonline.com/doi/full/10.1080/07418820902926197 [Accessed 3 April 2020].
Embry, R. and Lyons, P., (2012). Sex-Based Sentencing: Sentencing Discrepancies Between Male and Female Sex Offenders. Feminist Criminology , [online] 7(2), pp.146-162. Available at: https://journals.sagepub.com/doi/abs/10.1177/1557085111430214 [Accessed 3 April 2020].
Farnworth, M. and Teske, Jr., R., (1995). Gender Differences in Felony Court Processing: Women & Criminal Justice , [online] 6(2), pp.23-44. Available at: https://www.tandfonline.com/doi/abs/10.1300/J012v06n02_02 [Accessed 3 April 2020].
Fernando Rodriguez, S., Curry, T. and Lee, G., (2006). Gender Differences in Criminal Sentencing: Do Effects Vary Across Violent, Property, and Drug Offenses? Social Science Quarterly 87(2), pp.318-339. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1540-6237.2006.00383.x [Accessed 3 April 2020].
Franklin, C. and Fearn, N., (2008). Gender, race, and formal court decision-making outcomes: Chivalry/paternalism, conflict theory or gender conflict? Journal of Criminal Justice 36(3), pp.279-290. Available at: https://www.sciencedirect.com/science/article/pii/S0047235208000500 [Accessed 3 April 2020].
Freiburger, T., (2011). The Impact of Gender, Offense Type, and Familial Role on the Decision to Incarcerate. Social Justice Research , [online] 24(2), pp.143-167. Available at: https://link.springer.com/article/10.1007/s11211-011-0133-8 [Accessed 3 April 2020].
Gelsthorpe, L. (2013). ‘Sentencing and gender’ in Sheehan, R., McIvor, G., Trotter, C. (ed.) What Works with Women Offenders . Hoboken: Taylor and Francis.
Herzog, S. and Oreg, S., (2008). Chivalry and the Moderating Effect of Ambivalent Sexism: Individual Differences in Crime Seriousness Judgments. Law & Society Review , [online] 42(1), pp.45-74. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1540-5893.2008.00334.x [Accessed 3 April 2020].
Holland, M. and Prohaska, A., (2018). Gender Effects Across Place. Race and Justice , [online] p.215336871876749. Available at: https://journals.sagepub.com/doi/full/10.1177/2153368718767495 [Accessed 3 April 2020].
Jeffries, S. and Bond, C., (2013). Gender, Indigeneity, and the Criminal Courts: A Narrative Exploration of Women’s Sentencing in Western Australia. Women & Criminal Justice 23(1), pp.19-42. Available at: https://research-repository.griffith.edu.au/bitstream/handle/10072/55598/86997_1.pdf?sequence=1 [Accessed 3 April 2020].
JEFFRIES, S., FLETCHER, G. and NEWBOLD, G., (2003). PATHWAYS TO SEX-BASED DIFFERENTIATION IN CRIMINAL COURT SENTENCING. Criminology , [online] 41(2), pp.329-354. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1745-9125.2003.tb00990.x [Accessed 3 April 2020].
Koons-Witt, B., Sevigny, E., Burrow, J. and Hester, R., (2012). Gender and Sentencing Outcomes in South Carolina. Criminal Justice Policy Review , [online] 25(3), pp.299-324. Available at: https://journals.sagepub.com/doi/full/10.1177/0887403412468884 [Accessed 3 April 2020].
Messerschmidt, J., (2007). Masculinities, Crime and. The Blackwell Encyclopaedia of Sociology , Available at: https://onlinelibrary.wiley.com/doi/abs/10.1002/9781405165518.wbeosm039 [Accessed 3 April 2020].
Murphy, E. and Brown, J., (2000). Exploring gender role identity, value orientation of occupation and sex of respondent in influencing attitudes towards male and female offenders. Legal and Criminological Psychology , [online] 5(2), pp.285-290. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1348/135532500168047 [Accessed 3 April 2020].
Myers, M. and Talarico, S., (1986). The Social Contexts of Racial Discrimination in Sentencing. Social Problems 33(3), pp.236-251. Available at: https://academic.oup.com/socpro/article-abstract/33/3/236/1733372 [Accessed 3 April 2020].
Pollock, O. (1950). The criminality of women . Philadelphia: University of Pennsylvania Press
Spivak, A., Wagner, B., Whitmer, J. and Charish, C., (2014). Gender and Status Offending. Feminist Criminology , [online] 9(3), pp.224-248. Available at: https://journals.sagepub.com/doi/full/10.1177/1557085114531318 [Accessed 3 April 2020].
Spohn, C. (2000). Thirty years of sentencing reform: The quest for a racially-neutral sentencing process. In J. Horney (Ed.), Policies, processes, and decisions of the criminal justice system : Vol. 3 (pp. 427-501). Washington, DC: National Institute of Justice.
SPOHN, C. and HOLLERAN, D., (2000). THE IMPRISONMENT PENALTY PAID BY YOUNG, UNEMPLOYED BLACK AND HISPANIC MALE OFFENDERS. Criminology , [online] 38(1), pp.281-306. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1745-9125.2000.tb00891.x [Accessed 3 April 2020].
Spohn, C. and Spears, J., (1997). Gender and Case Processing Decisions. Women & Criminal Justice 8(3), pp.29-59. Available at: https://www.tandfonline.com/doi/abs/10.1300/J012v08n03_02 [Accessed 3 April 2020].
Steffensmeier, D. and Demuth, S., (2006). Does Gender Modify the Effects of Race–ethnicity on Criminal Sanctioning? Sentences for Male and Female White, Black, and Hispanic Defendants. Journal of Quantitative Criminology, [online] 22(3), pp.241-261. Available at: https://link.springer.com/article/10.1007/s10940-006-9010-2 [Accessed 3 April 2020].
STEFFENSMEIER, D., KRAMER, J. and STREIFEL, C., (1993). GENDER AND IMPRISONMENT DECISIONS. Criminology , [online] 31(3), pp.411-446. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1745-9125.1993.tb01136.x [Accessed 3 April 2020].
STEFFENSMEIER, D., ULMER, J. and KRAMER, J., (1998). THE INTERACTION OF RACE, GENDER, AND AGE IN CRIMINAL SENTENCING: THE PUNISHMENT COST OF BEING YOUNG, BLACK, AND MALE. Criminology , [online] 36(4), pp.763-798. Available at: https://onlinelibrary.wiley.com/doi/abs/10.1111/j.1745-9125.1998.tb01265.x [Accessed 3 April 2020].
Tasca, M., Cho, A., Spohn, C. and Rodriguez, N., (2018). The Role of Parental Status and Involvement in Sentence Length Decisions: A Comparison of Men and Women Sentenced to Prison. Crime & Delinquency , [online] 65(14), pp.1899-1924. Available at: https://journals.sagepub.com/doi/full/10.1177/0011128718811929 [Accessed 3 April 2020].
Tillyer, R., Hartley, R. and Ward, J., (2015). Differential Treatment of Female Defendants. Criminal Justice and Behaviour , [online] 42(7), pp.703-721. Available at: https://journals.sagepub.com/doi/full/10.1177/0093854814560624 [Accessed 3 April 2020].
Tracy, P., Kempf-Leonard, K. and Abramoske-James, S., (2009). Gender Differences in Delinquency and Juvenile Justice Processing. Crime & Delinquency , [online] 55(2), pp.171-215. Available at: https://journals.sagepub.com/doi/abs/10.1177/0011128708330628 [Accessed 3 April 2020].
Warren, P., Chiricos, T. and Bales, W., (2011). The Imprisonment Penalty for Young Black and Hispanic Males. Journal of Research in Crime and Delinquency , [online] 49(1), pp.56-80. Available at: https://journals.sagepub.com/doi/full/10.1177/0022427810397945 [Accessed 3 April 2020].
Zatz, M. S. (2000). The convergence of race, ethnicity, gender, and class on court decision-making: Looking toward the 21st century. In J. Horney (Ed.), Policies, processes, and decisions of the criminal justice system : Vol. 3 (pp. 503-552). Washington, DC: National Institute of Justice.
Your email address will not be published. Required fields are marked *
Save my name, email, and website in this browser for the next time I comment.
Gender research topics are very popular across the world. Students in different academic disciplines are often asked to write papers and essays about these topics. Some of the disciplines that require learners to write about gender topics include:
Sociology Psychology Gender studies Business studies
When pursuing higher education in these disciplines, learners can choose what to write about from a wide range of gender issues topics. However, the wide range of issues that learners can research and write about when it comes to gender makes choosing what to write about difficult. Here is a list of the top 100 gender and sexuality topics that students can consider.
Do you like the idea of writing about something controversial? If yes, this category has some of the best gender topics to write about. They touch on issues like gender stereotypes and issues that are generally associated with members of a specific gender. Here are some of the best controversial gender topics that you can write about.
This category also has some of the best gender debate topics. However, learners should be keen to pick topics they are interested in. This will enable them to ensure that they enjoy the research and writing process.
Gender-based inequality is witnessed almost every day. As such, most learners are conversant with gender inequality research paper topics. However, it’s crucial to pick topics that are devoid of discrimination of members of a specific gender. Here are examples of gender inequality essay topics.
After choosing the gender inequality essay topics they like, students should research, brainstorm ideas, and come up with an outline before they start writing. This will ensure that their essays have engaging introductions and convincing bodies, as well as, strong conclusions.
This category has ideas that slightly differ from gender equality topics. That’s because equality or lack of it can be measured by considering the representation of both genders in different roles. As such, some gender roles essay topics might not require tiresome and extensive research to write about. Nevertheless, learners should take time to gather the necessary information required to write about these topics. Here are some of the best gender topics for discussion when it comes to the roles played by men and women in society.
The list of gender essay topics that are based on the roles of each sex can be quite extensive. Nevertheless, students should be keen to pick interesting gender topics in this category.
If you want to write a paper or essay on an important gender issue, this category has the best ideas for you. Students can write about different issues that affect individuals of different genders. For instance, this category can include gender wage gap essay topics. Wage variation is a common issue that affects women in different countries. Some of the best gender research paper topics in this category include:
This category has some of the most interesting women’s and gender studies paper topics. However, most of them require extensive research to come up with hard facts and figures that will make academic papers or essays more interesting.
Students in high schools and colleges can pick what to write about from a wide range of gender studies research topics. However, some gender studies topics might not be ideal for some learners based on the given essay prompt. Therefore, make sure that you have understood what the educator wants you to write about before you pick a topic. Our experts can help you choose a good thesis topic . Choosing the right gender studies topics enables learners to answer the asked questions properly. This impresses educators to award them top grades.
Your email address will not be published. Required fields are marked *
Comment * Error message
Name * Error message
Email * Error message
Save my name, email, and website in this browser for the next time I comment.
As Putin continues killing civilians, bombing kindergartens, and threatening WWIII, Ukraine fights for the world's peaceful future.
Ukraine Live Updates
🏆 best gender differences topic ideas & essay examples, 💡 most interesting gender differences topics to write about, 📌 simple & easy gender differences essay titles, 👍 good essay topics on gender differences.
IvyPanda. (2024, February 24). 101 Gender Differences Essay Topic Ideas & Examples. https://ivypanda.com/essays/topic/gender-differences-essay-topics/
"101 Gender Differences Essay Topic Ideas & Examples." IvyPanda , 24 Feb. 2024, ivypanda.com/essays/topic/gender-differences-essay-topics/.
IvyPanda . (2024) '101 Gender Differences Essay Topic Ideas & Examples'. 24 February.
IvyPanda . 2024. "101 Gender Differences Essay Topic Ideas & Examples." February 24, 2024. https://ivypanda.com/essays/topic/gender-differences-essay-topics/.
1. IvyPanda . "101 Gender Differences Essay Topic Ideas & Examples." February 24, 2024. https://ivypanda.com/essays/topic/gender-differences-essay-topics/.
Bibliography
IvyPanda . "101 Gender Differences Essay Topic Ideas & Examples." February 24, 2024. https://ivypanda.com/essays/topic/gender-differences-essay-topics/.
IMAGES
COMMENTS
Gender Differences in Leadership 2 I. Why Women and Leadership is a Hot Topic Today There are many historical events that have set the stage to analyze gender differences between men and women in the workplace. Whether these gender differences exist in the way in which they communicate, influence, or lead, men and women have
This thesis will examine gender differences in communication styles and their influence on workplace communication and the practice of public relations in the United States, specifically ...
This thesis points toward interpretation of Kaiser (2019), which states that both cultural individualism and pathogen levels confound the gender equality paradox in personality ... Gender differences in mathematics anxiety and the relation to mathematics performance while controlling for test anxiety. Behav. Brain Funct. 8:33. doi: ...
definitions of prejudice (Hughes & Tuch, 2003) and that overall there is more evidence for gender similarities than differences (Hyde, 1984). This thesis therefore begins with the report of a meta-analysis (Study 1) that provides a comprehensive review of the literature assessing whether gender differences exist in the
Gender differences and biases based upon these have been and are an indivisible facet of the global cultural ethos. Patriarchal social design and traditional practices titled in disfavour of women ...
Cross-border students' academic performance draws people's attention, whereas perceived cultural distance might influence their performance with gender difference. Based on role theory, men and women present different roles in society, and women are good at perceptual, cognitive aspects, making them more sensitive to cultural distance. Finding shows that the negative moderation role of ...
thesis is re-using some of this data. My thesis addresses teachers coping skills with disruptive behavior along with a focus on whether or not there is a gender difference between female and male students when it comes to disruptive behavior in the classroom context. Problem statement My master's thesis addresses the following problem statement:
CMC Senior Theses. 513. This paper lays the historical background for why women and leadership is an important topic today in order to discuss gender differences in communication styles, influence tactics, and leadership styles. This paper also outlines barriers women face when trying to attain and succeed in leadership positions.
For many years the aphorism that 'men die quicker but women are sicker' was presumed to encapsulate gender differences in health. The first paper presented in the thesis challenged this dominant paradigm. First, an analysis of morbidity in two British data sets showed more similarity than difference between men and women.
psychological resilience levels of individuals in terms of gender differences. This study includes a review of the. relevant literature, the stages of meta-analysis research, analysis findings got ...
There is a particular dearth of true gender-difference studies; in fact, literature searches on "gender differences" largely turn up studies on sex differences that have used the term "gender" to refer to biologic sex. The historic neglect of women in clinical studies and the sex of animals and cells in basic research should be kept in ...
"Achieve gender equality and empower all women and girls" is essential to reduce gender disparity and improve the status of women. But it remains a challenge to narrow gender differences and improve gender equality in academic research. In this paper, we propose that the impact of articles is lower and writing style of articles is less positive when the article's first author is female ...
Gender differences in general intelligence are negligible, although men are typically found to show more variance in scores than women (Deary et al., 2007; van der Sluis et al., 2008). However, our findings are consistent with the finding that men show higher self-estimates of intelligence than women, across cultures (von Stumm et al., 2009 ...
Bornstein, a trans woman who finds gender deeply problematic, sums up this resistance nicely in her 1995 book title, Gender Outlaw: On Men, Women and the Rest of Us1. It is commonly argued that biological differences between males and females determine gender by causing enduring differences in capabilities and dispositions.
Introduction. Gender differences in academic performance have engaged the attention of scholars for some time now (see Hung et al. 2012; Jackman and Morrain-Webb 2019; Morita et al. 2016; Sparks-Wallace 2007).Indeed, males in the past have had a higher enrolment in STEM subjects at the tertiary levels of education compared to females, and their overall academic performance was rated higher ...
and special education teacher‟s earnings. The difference between male and female earnings in teaching elementary and high school is significant at the 0.001 level. Elementary school teachers. have a pay-gap between male and female teachers of 91.1% with 79.8% of elementary school.
The research on gender in leadership covers six issues related to the relationship between leadership and gender, namely the number of males and females in leadership positions; behaviour patterns ...
reported in Table 2, did not indicate that there was a significant difference between gender on. test anxiety, with F (1, 288)=0.586, p>0.05. Due to previous indication that age is a possible variable related to test anxiety, a multiple. regression was performed to account for this suspected variability.
chivalry thesis dates to the 1970s and is premised on cultural stereotypes about gender, while the more recent focal concerns theory looks specifically at the dynamics of judicial decision making. The chivalry thesis posits that gendered stereotypes about both women and men influence sentencing outcomes according to the sex of offenders.
3.3. Theories predicting that gender equality is linked with wider gender differences. Drawing on gender essentialism, Charles and Bradley (2009) theorized an opposite effect—that gaps might increase with greater gender equality. They posited that, even if societies are gender equal, gender stereotypes endure because of the emphasis on individualism and self-expression in these societies.
Abstract. This review focuses on various pieces of literature that surrounds the perceived differences in sentencing gender. Also, literature examining the reasons why these differences are taking place between genders, and theories that could be applied when explaining these differences, will be scrutinised in order to give an indication as to whether a reason for gender differences in ...
100 Gender Research Topics For Academic Papers. Gender research topics are very popular across the world. Students in different academic disciplines are often asked to write papers and essays about these topics. Some of the disciplines that require learners to write about gender topics include: Sociology. Psychology.
Get a custom essay on Gender difference. The main difference comes from the understanding and thinking of each gender. For the most part, to say generally, men are more power hungry and demand to be in a position of authority. This has come from a long history of male domination and men have gotten used to being in control and charge.
Evolutionary Explanation for Sex and Gender Differences. The evolutionary explanation indicates that men's uncertainty on their offspring is disappointing. The theory claims that men's disappointment emanates from sexual disloyalty advanced by women. Person's Individuality, Gender Differences and the Triune Brain.