Qualitative vs. Quantitative Research: Comparing the Methods and Strategies for Education Research

A woman sits at a library table with stacks of books and a laptop.

No matter the field of study, all research can be divided into two distinct methodologies: qualitative and quantitative research. Both methodologies offer education researchers important insights.

Education research assesses problems in policy, practices, and curriculum design, and it helps administrators identify solutions. Researchers can conduct small-scale studies to learn more about topics related to instruction or larger-scale ones to gain insight into school systems and investigate how to improve student outcomes.

Education research often relies on the quantitative methodology. Quantitative research in education provides numerical data that can prove or disprove a theory, and administrators can easily share the number-based results with other schools and districts. And while the research may speak to a relatively small sample size, educators and researchers can scale the results from quantifiable data to predict outcomes in larger student populations and groups.

Qualitative vs. Quantitative Research in Education: Definitions

Although there are many overlaps in the objectives of qualitative and quantitative research in education, researchers must understand the fundamental functions of each methodology in order to design and carry out an impactful research study. In addition, they must understand the differences that set qualitative and quantitative research apart in order to determine which methodology is better suited to specific education research topics.

Generate Hypotheses with Qualitative Research

Qualitative research focuses on thoughts, concepts, or experiences. The data collected often comes in narrative form and concentrates on unearthing insights that can lead to testable hypotheses. Educators use qualitative research in a study’s exploratory stages to uncover patterns or new angles.

Form Strong Conclusions with Quantitative Research

Quantitative research in education and other fields of inquiry is expressed in numbers and measurements. This type of research aims to find data to confirm or test a hypothesis.

Differences in Data Collection Methods

Keeping in mind the main distinction in qualitative vs. quantitative research—gathering descriptive information as opposed to numerical data—it stands to reason that there are different ways to acquire data for each research methodology. While certain approaches do overlap, the way researchers apply these collection techniques depends on their goal.

Interviews, for example, are common in both modes of research. An interview with students that features open-ended questions intended to reveal ideas and beliefs around attendance will provide qualitative data. This data may reveal a problem among students, such as a lack of access to transportation, that schools can help address.

An interview can also include questions posed to receive numerical answers. A case in point: how many days a week do students have trouble getting to school, and of those days, how often is a transportation-related issue the cause? In this example, qualitative and quantitative methodologies can lead to similar conclusions, but the research will differ in intent, design, and form.

Taking a look at behavioral observation, another common method used for both qualitative and quantitative research, qualitative data may consider a variety of factors, such as facial expressions, verbal responses, and body language.

On the other hand, a quantitative approach will create a coding scheme for certain predetermined behaviors and observe these in a quantifiable manner.

Qualitative Research Methods

  • Case Studies : Researchers conduct in-depth investigations into an individual, group, event, or community, typically gathering data through observation and interviews.
  • Focus Groups : A moderator (or researcher) guides conversation around a specific topic among a group of participants.
  • Ethnography : Researchers interact with and observe a specific societal or ethnic group in their real-life environment.
  • Interviews : Researchers ask participants questions to learn about their perspectives on a particular subject.

Quantitative Research Methods

  • Questionnaires and Surveys : Participants receive a list of questions, either closed-ended or multiple choice, which are directed around a particular topic.
  • Experiments : Researchers control and test variables to demonstrate cause-and-effect relationships.
  • Observations : Researchers look at quantifiable patterns and behavior.
  • Structured Interviews : Using a predetermined structure, researchers ask participants a fixed set of questions to acquire numerical data.

Choosing a Research Strategy

When choosing which research strategy to employ for a project or study, a number of considerations apply. One key piece of information to help determine whether to use a qualitative vs. quantitative research method is which phase of development the study is in.

For example, if a project is in its early stages and requires more research to find a testable hypothesis, qualitative research methods might prove most helpful. On the other hand, if the research team has already established a hypothesis or theory, quantitative research methods will provide data that can validate the theory or refine it for further testing.

It’s also important to understand a project’s research goals. For instance, do researchers aim to produce findings that reveal how to best encourage student engagement in math? Or is the goal to determine how many students are passing geometry? These two scenarios require distinct sets of data, which will determine the best methodology to employ.

In some situations, studies will benefit from a mixed-methods approach. Using the goals in the above example, one set of data could find the percentage of students passing geometry, which would be quantitative. The research team could also lead a focus group with the students achieving success to discuss which techniques and teaching practices they find most helpful, which would produce qualitative data.

Learn How to Put Education Research into Action

Those with an interest in learning how to harness research to develop innovative ideas to improve education systems may want to consider pursuing a doctoral degree. American University’s School of Education online offers a Doctor of Education (EdD) in Education Policy and Leadership that prepares future educators, school administrators, and other education professionals to become leaders who effect positive changes in schools. Courses such as Applied Research Methods I: Enacting Critical Research provides students with the techniques and research skills needed to begin conducting research exploring new ways to enhance education. Learn more about American’ University’s EdD in Education Policy and Leadership .

What’s the Difference Between Educational Equity and Equality?

EdD vs. PhD in Education: Requirements, Career Outlook, and Salary

Top Education Technology Jobs for Doctorate in Education Graduates

American University, EdD in Education Policy and Leadership

Edutopia, “2019 Education Research Highlights”

Formplus, “Qualitative vs. Quantitative Data: 15 Key Differences and Similarities”

iMotion, “Qualitative vs. Quantitative Research: What Is What?”

Scribbr, “Qualitative vs. Quantitative Research”

Simply Psychology, “What’s the Difference Between Quantitative and Qualitative Research?”

Typeform, “A Simple Guide to Qualitative and Quantitative Research”

Request Information

Chatbot avatar

AU Program Helper

This AI chatbot provides automated responses, which may not always be accurate. By continuing with this conversation, you agree that the contents of this chat session may be transcribed and retained. You also consent that this chat session and your interactions, including cookie usage, are subject to our  privacy policy .

Qualitative vs Quantitative Research Methods & Data Analysis

Saul McLeod, PhD

Editor-in-Chief for Simply Psychology

BSc (Hons) Psychology, MRes, PhD, University of Manchester

Saul McLeod, PhD., is a qualified psychology teacher with over 18 years of experience in further and higher education. He has been published in peer-reviewed journals, including the Journal of Clinical Psychology.

Learn about our Editorial Process

Olivia Guy-Evans, MSc

Associate Editor for Simply Psychology

BSc (Hons) Psychology, MSc Psychology of Education

Olivia Guy-Evans is a writer and associate editor for Simply Psychology. She has previously worked in healthcare and educational sectors.

The main difference between quantitative and qualitative research is the type of data they collect and analyze.

Quantitative data is information about quantities, and therefore numbers, and qualitative data is descriptive, and regards phenomenon which can be observed but not measured, such as language.
  • Quantitative research collects numerical data and analyzes it using statistical methods. The aim is to produce objective, empirical data that can be measured and expressed numerically. Quantitative research is often used to test hypotheses, identify patterns, and make predictions.
  • Qualitative research gathers non-numerical data (words, images, sounds) to explore subjective experiences and attitudes, often via observation and interviews. It aims to produce detailed descriptions and uncover new insights about the studied phenomenon.

On This Page:

What Is Qualitative Research?

Qualitative research is the process of collecting, analyzing, and interpreting non-numerical data, such as language. Qualitative research can be used to understand how an individual subjectively perceives and gives meaning to their social reality.

Qualitative data is non-numerical data, such as text, video, photographs, or audio recordings. This type of data can be collected using diary accounts or in-depth interviews and analyzed using grounded theory or thematic analysis.

Qualitative research is multimethod in focus, involving an interpretive, naturalistic approach to its subject matter. This means that qualitative researchers study things in their natural settings, attempting to make sense of, or interpret, phenomena in terms of the meanings people bring to them. Denzin and Lincoln (1994, p. 2)

Interest in qualitative data came about as the result of the dissatisfaction of some psychologists (e.g., Carl Rogers) with the scientific study of psychologists such as behaviorists (e.g., Skinner ).

Since psychologists study people, the traditional approach to science is not seen as an appropriate way of carrying out research since it fails to capture the totality of human experience and the essence of being human.  Exploring participants’ experiences is known as a phenomenological approach (re: Humanism ).

Qualitative research is primarily concerned with meaning, subjectivity, and lived experience. The goal is to understand the quality and texture of people’s experiences, how they make sense of them, and the implications for their lives.

Qualitative research aims to understand the social reality of individuals, groups, and cultures as nearly as possible as participants feel or live it. Thus, people and groups are studied in their natural setting.

Some examples of qualitative research questions are provided, such as what an experience feels like, how people talk about something, how they make sense of an experience, and how events unfold for people.

Research following a qualitative approach is exploratory and seeks to explain ‘how’ and ‘why’ a particular phenomenon, or behavior, operates as it does in a particular context. It can be used to generate hypotheses and theories from the data.

Qualitative Methods

There are different types of qualitative research methods, including diary accounts, in-depth interviews , documents, focus groups , case study research , and ethnography .

The results of qualitative methods provide a deep understanding of how people perceive their social realities and in consequence, how they act within the social world.

The researcher has several methods for collecting empirical materials, ranging from the interview to direct observation, to the analysis of artifacts, documents, and cultural records, to the use of visual materials or personal experience. Denzin and Lincoln (1994, p. 14)

Here are some examples of qualitative data:

Interview transcripts : Verbatim records of what participants said during an interview or focus group. They allow researchers to identify common themes and patterns, and draw conclusions based on the data. Interview transcripts can also be useful in providing direct quotes and examples to support research findings.

Observations : The researcher typically takes detailed notes on what they observe, including any contextual information, nonverbal cues, or other relevant details. The resulting observational data can be analyzed to gain insights into social phenomena, such as human behavior, social interactions, and cultural practices.

Unstructured interviews : generate qualitative data through the use of open questions.  This allows the respondent to talk in some depth, choosing their own words.  This helps the researcher develop a real sense of a person’s understanding of a situation.

Diaries or journals : Written accounts of personal experiences or reflections.

Notice that qualitative data could be much more than just words or text. Photographs, videos, sound recordings, and so on, can be considered qualitative data. Visual data can be used to understand behaviors, environments, and social interactions.

Qualitative Data Analysis

Qualitative research is endlessly creative and interpretive. The researcher does not just leave the field with mountains of empirical data and then easily write up his or her findings.

Qualitative interpretations are constructed, and various techniques can be used to make sense of the data, such as content analysis, grounded theory (Glaser & Strauss, 1967), thematic analysis (Braun & Clarke, 2006), or discourse analysis .

For example, thematic analysis is a qualitative approach that involves identifying implicit or explicit ideas within the data. Themes will often emerge once the data has been coded .

RESEARCH THEMATICANALYSISMETHOD

Key Features

  • Events can be understood adequately only if they are seen in context. Therefore, a qualitative researcher immerses her/himself in the field, in natural surroundings. The contexts of inquiry are not contrived; they are natural. Nothing is predefined or taken for granted.
  • Qualitative researchers want those who are studied to speak for themselves, to provide their perspectives in words and other actions. Therefore, qualitative research is an interactive process in which the persons studied teach the researcher about their lives.
  • The qualitative researcher is an integral part of the data; without the active participation of the researcher, no data exists.
  • The study’s design evolves during the research and can be adjusted or changed as it progresses. For the qualitative researcher, there is no single reality. It is subjective and exists only in reference to the observer.
  • The theory is data-driven and emerges as part of the research process, evolving from the data as they are collected.

Limitations of Qualitative Research

  • Because of the time and costs involved, qualitative designs do not generally draw samples from large-scale data sets.
  • The problem of adequate validity or reliability is a major criticism. Because of the subjective nature of qualitative data and its origin in single contexts, it is difficult to apply conventional standards of reliability and validity. For example, because of the central role played by the researcher in the generation of data, it is not possible to replicate qualitative studies.
  • Also, contexts, situations, events, conditions, and interactions cannot be replicated to any extent, nor can generalizations be made to a wider context than the one studied with confidence.
  • The time required for data collection, analysis, and interpretation is lengthy. Analysis of qualitative data is difficult, and expert knowledge of an area is necessary to interpret qualitative data. Great care must be taken when doing so, for example, looking for mental illness symptoms.

Advantages of Qualitative Research

  • Because of close researcher involvement, the researcher gains an insider’s view of the field. This allows the researcher to find issues that are often missed (such as subtleties and complexities) by the scientific, more positivistic inquiries.
  • Qualitative descriptions can be important in suggesting possible relationships, causes, effects, and dynamic processes.
  • Qualitative analysis allows for ambiguities/contradictions in the data, which reflect social reality (Denscombe, 2010).
  • Qualitative research uses a descriptive, narrative style; this research might be of particular benefit to the practitioner as she or he could turn to qualitative reports to examine forms of knowledge that might otherwise be unavailable, thereby gaining new insight.

What Is Quantitative Research?

Quantitative research involves the process of objectively collecting and analyzing numerical data to describe, predict, or control variables of interest.

The goals of quantitative research are to test causal relationships between variables , make predictions, and generalize results to wider populations.

Quantitative researchers aim to establish general laws of behavior and phenomenon across different settings/contexts. Research is used to test a theory and ultimately support or reject it.

Quantitative Methods

Experiments typically yield quantitative data, as they are concerned with measuring things.  However, other research methods, such as controlled observations and questionnaires , can produce both quantitative information.

For example, a rating scale or closed questions on a questionnaire would generate quantitative data as these produce either numerical data or data that can be put into categories (e.g., “yes,” “no” answers).

Experimental methods limit how research participants react to and express appropriate social behavior.

Findings are, therefore, likely to be context-bound and simply a reflection of the assumptions that the researcher brings to the investigation.

There are numerous examples of quantitative data in psychological research, including mental health. Here are a few examples:

Another example is the Experience in Close Relationships Scale (ECR), a self-report questionnaire widely used to assess adult attachment styles .

The ECR provides quantitative data that can be used to assess attachment styles and predict relationship outcomes.

Neuroimaging data : Neuroimaging techniques, such as MRI and fMRI, provide quantitative data on brain structure and function.

This data can be analyzed to identify brain regions involved in specific mental processes or disorders.

For example, the Beck Depression Inventory (BDI) is a clinician-administered questionnaire widely used to assess the severity of depressive symptoms in individuals.

The BDI consists of 21 questions, each scored on a scale of 0 to 3, with higher scores indicating more severe depressive symptoms. 

Quantitative Data Analysis

Statistics help us turn quantitative data into useful information to help with decision-making. We can use statistics to summarize our data, describing patterns, relationships, and connections. Statistics can be descriptive or inferential.

Descriptive statistics help us to summarize our data. In contrast, inferential statistics are used to identify statistically significant differences between groups of data (such as intervention and control groups in a randomized control study).

  • Quantitative researchers try to control extraneous variables by conducting their studies in the lab.
  • The research aims for objectivity (i.e., without bias) and is separated from the data.
  • The design of the study is determined before it begins.
  • For the quantitative researcher, the reality is objective, exists separately from the researcher, and can be seen by anyone.
  • Research is used to test a theory and ultimately support or reject it.

Limitations of Quantitative Research

  • Context: Quantitative experiments do not take place in natural settings. In addition, they do not allow participants to explain their choices or the meaning of the questions they may have for those participants (Carr, 1994).
  • Researcher expertise: Poor knowledge of the application of statistical analysis may negatively affect analysis and subsequent interpretation (Black, 1999).
  • Variability of data quantity: Large sample sizes are needed for more accurate analysis. Small-scale quantitative studies may be less reliable because of the low quantity of data (Denscombe, 2010). This also affects the ability to generalize study findings to wider populations.
  • Confirmation bias: The researcher might miss observing phenomena because of focus on theory or hypothesis testing rather than on the theory of hypothesis generation.

Advantages of Quantitative Research

  • Scientific objectivity: Quantitative data can be interpreted with statistical analysis, and since statistics are based on the principles of mathematics, the quantitative approach is viewed as scientifically objective and rational (Carr, 1994; Denscombe, 2010).
  • Useful for testing and validating already constructed theories.
  • Rapid analysis: Sophisticated software removes much of the need for prolonged data analysis, especially with large volumes of data involved (Antonius, 2003).
  • Replication: Quantitative data is based on measured values and can be checked by others because numerical data is less open to ambiguities of interpretation.
  • Hypotheses can also be tested because of statistical analysis (Antonius, 2003).

Antonius, R. (2003). Interpreting quantitative data with SPSS . Sage.

Black, T. R. (1999). Doing quantitative research in the social sciences: An integrated approach to research design, measurement and statistics . Sage.

Braun, V. & Clarke, V. (2006). Using thematic analysis in psychology . Qualitative Research in Psychology , 3, 77–101.

Carr, L. T. (1994). The strengths and weaknesses of quantitative and qualitative research : what method for nursing? Journal of advanced nursing, 20(4) , 716-721.

Denscombe, M. (2010). The Good Research Guide: for small-scale social research. McGraw Hill.

Denzin, N., & Lincoln. Y. (1994). Handbook of Qualitative Research. Thousand Oaks, CA, US: Sage Publications Inc.

Glaser, B. G., Strauss, A. L., & Strutzel, E. (1968). The discovery of grounded theory; strategies for qualitative research. Nursing research, 17(4) , 364.

Minichiello, V. (1990). In-Depth Interviewing: Researching People. Longman Cheshire.

Punch, K. (1998). Introduction to Social Research: Quantitative and Qualitative Approaches. London: Sage

Further Information

  • Mixed methods research
  • Designing qualitative research
  • Methods of data collection and analysis
  • Introduction to quantitative and qualitative research
  • Checklists for improving rigour in qualitative research: a case of the tail wagging the dog?
  • Qualitative research in health care: Analysing qualitative data
  • Qualitative data analysis: the framework approach
  • Using the framework method for the analysis of
  • Qualitative data in multi-disciplinary health research
  • Content Analysis
  • Grounded Theory
  • Thematic Analysis

Print Friendly, PDF & Email

Have a language expert improve your writing

Run a free plagiarism check in 10 minutes, generate accurate citations for free.

  • Knowledge Base

Methodology

  • Qualitative vs. Quantitative Research | Differences, Examples & Methods

Qualitative vs. Quantitative Research | Differences, Examples & Methods

Published on April 12, 2019 by Raimo Streefkerk . Revised on June 22, 2023.

When collecting and analyzing data, quantitative research deals with numbers and statistics, while qualitative research deals with words and meanings. Both are important for gaining different kinds of knowledge.

Common quantitative methods include experiments, observations recorded as numbers, and surveys with closed-ended questions.

Quantitative research is at risk for research biases including information bias , omitted variable bias , sampling bias , or selection bias . Qualitative research Qualitative research is expressed in words . It is used to understand concepts, thoughts or experiences. This type of research enables you to gather in-depth insights on topics that are not well understood.

Common qualitative methods include interviews with open-ended questions, observations described in words, and literature reviews that explore concepts and theories.

Table of contents

The differences between quantitative and qualitative research, data collection methods, when to use qualitative vs. quantitative research, how to analyze qualitative and quantitative data, other interesting articles, frequently asked questions about qualitative and quantitative research.

Quantitative and qualitative research use different research methods to collect and analyze data, and they allow you to answer different kinds of research questions.

Qualitative vs. quantitative research

Quantitative and qualitative data can be collected using various methods. It is important to use a data collection method that will help answer your research question(s).

Many data collection methods can be either qualitative or quantitative. For example, in surveys, observational studies or case studies , your data can be represented as numbers (e.g., using rating scales or counting frequencies) or as words (e.g., with open-ended questions or descriptions of what you observe).

However, some methods are more commonly used in one type or the other.

Quantitative data collection methods

  • Surveys :  List of closed or multiple choice questions that is distributed to a sample (online, in person, or over the phone).
  • Experiments : Situation in which different types of variables are controlled and manipulated to establish cause-and-effect relationships.
  • Observations : Observing subjects in a natural environment where variables can’t be controlled.

Qualitative data collection methods

  • Interviews : Asking open-ended questions verbally to respondents.
  • Focus groups : Discussion among a group of people about a topic to gather opinions that can be used for further research.
  • Ethnography : Participating in a community or organization for an extended period of time to closely observe culture and behavior.
  • Literature review : Survey of published works by other authors.

A rule of thumb for deciding whether to use qualitative or quantitative data is:

  • Use quantitative research if you want to confirm or test something (a theory or hypothesis )
  • Use qualitative research if you want to understand something (concepts, thoughts, experiences)

For most research topics you can choose a qualitative, quantitative or mixed methods approach . Which type you choose depends on, among other things, whether you’re taking an inductive vs. deductive research approach ; your research question(s) ; whether you’re doing experimental , correlational , or descriptive research ; and practical considerations such as time, money, availability of data, and access to respondents.

Quantitative research approach

You survey 300 students at your university and ask them questions such as: “on a scale from 1-5, how satisfied are your with your professors?”

You can perform statistical analysis on the data and draw conclusions such as: “on average students rated their professors 4.4”.

Qualitative research approach

You conduct in-depth interviews with 15 students and ask them open-ended questions such as: “How satisfied are you with your studies?”, “What is the most positive aspect of your study program?” and “What can be done to improve the study program?”

Based on the answers you get you can ask follow-up questions to clarify things. You transcribe all interviews using transcription software and try to find commonalities and patterns.

Mixed methods approach

You conduct interviews to find out how satisfied students are with their studies. Through open-ended questions you learn things you never thought about before and gain new insights. Later, you use a survey to test these insights on a larger scale.

It’s also possible to start with a survey to find out the overall trends, followed by interviews to better understand the reasons behind the trends.

Qualitative or quantitative data by itself can’t prove or demonstrate anything, but has to be analyzed to show its meaning in relation to the research questions. The method of analysis differs for each type of data.

Analyzing quantitative data

Quantitative data is based on numbers. Simple math or more advanced statistical analysis is used to discover commonalities or patterns in the data. The results are often reported in graphs and tables.

Applications such as Excel, SPSS, or R can be used to calculate things like:

  • Average scores ( means )
  • The number of times a particular answer was given
  • The correlation or causation between two or more variables
  • The reliability and validity of the results

Analyzing qualitative data

Qualitative data is more difficult to analyze than quantitative data. It consists of text, images or videos instead of numbers.

Some common approaches to analyzing qualitative data include:

  • Qualitative content analysis : Tracking the occurrence, position and meaning of words or phrases
  • Thematic analysis : Closely examining the data to identify the main themes and patterns
  • Discourse analysis : Studying how communication works in social contexts

If you want to know more about statistics , methodology , or research bias , make sure to check out some of our other articles with explanations and examples.

  • Chi square goodness of fit test
  • Degrees of freedom
  • Null hypothesis
  • Discourse analysis
  • Control groups
  • Mixed methods research
  • Non-probability sampling
  • Quantitative research
  • Inclusion and exclusion criteria

Research bias

  • Rosenthal effect
  • Implicit bias
  • Cognitive bias
  • Selection bias
  • Negativity bias
  • Status quo bias

Quantitative research deals with numbers and statistics, while qualitative research deals with words and meanings.

Quantitative methods allow you to systematically measure variables and test hypotheses . Qualitative methods allow you to explore concepts and experiences in more detail.

In mixed methods research , you use both qualitative and quantitative data collection and analysis methods to answer your research question .

The research methods you use depend on the type of data you need to answer your research question .

  • If you want to measure something or test a hypothesis , use quantitative methods . If you want to explore ideas, thoughts and meanings, use qualitative methods .
  • If you want to analyze a large amount of readily-available data, use secondary data. If you want data specific to your purposes with control over how it is generated, collect primary data.
  • If you want to establish cause-and-effect relationships between variables , use experimental methods. If you want to understand the characteristics of a research subject, use descriptive methods.

Data collection is the systematic process by which observations or measurements are gathered in research. It is used in many different contexts by academics, governments, businesses, and other organizations.

There are various approaches to qualitative data analysis , but they all share five steps in common:

  • Prepare and organize your data.
  • Review and explore your data.
  • Develop a data coding system.
  • Assign codes to the data.
  • Identify recurring themes.

The specifics of each step depend on the focus of the analysis. Some common approaches include textual analysis , thematic analysis , and discourse analysis .

A research project is an academic, scientific, or professional undertaking to answer a research question . Research projects can take many forms, such as qualitative or quantitative , descriptive , longitudinal , experimental , or correlational . What kind of research approach you choose will depend on your topic.

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the “Cite this Scribbr article” button to automatically add the citation to our free Citation Generator.

Streefkerk, R. (2023, June 22). Qualitative vs. Quantitative Research | Differences, Examples & Methods. Scribbr. Retrieved August 29, 2024, from https://www.scribbr.com/methodology/qualitative-quantitative-research/

Is this article helpful?

Raimo Streefkerk

Raimo Streefkerk

Other students also liked, what is quantitative research | definition, uses & methods, what is qualitative research | methods & examples, mixed methods research | definition, guide & examples, get unlimited documents corrected.

✔ Free APA citation check included ✔ Unlimited document corrections ✔ Specialized in correcting academic texts

  • MS in the Learning Sciences
  • Tuition & Financial Aid

SMU Simmons School of Education & Human Development

Qualitative vs. quantitative data analysis: How do they differ?

Educator presenting data to colleagues

Learning analytics have become the cornerstone for personalizing student experiences and enhancing learning outcomes. In this data-informed approach to education there are two distinct methodologies: qualitative and quantitative analytics. These methods, which are typical to data analytics in general, are crucial to the interpretation of learning behaviors and outcomes. This blog will explore the nuances that distinguish qualitative and quantitative research, while uncovering their shared roles in learning analytics, program design and instruction.

What is qualitative data?

Qualitative data is descriptive and includes information that is non numerical. Qualitative research is used to gather in-depth insights that can't be easily measured on a scale like opinions, anecdotes and emotions. In learning analytics qualitative data could include in depth interviews, text responses to a prompt, or a video of a class period. 1

What is quantitative data?

Quantitative data is information that has a numerical value. Quantitative research is conducted to gather measurable data used in statistical analysis. Researchers can use quantitative studies to identify patterns and trends. In learning analytics quantitative data could include test scores, student demographics, or amount of time spent in a lesson. 2

Key difference between qualitative and quantitative data

It's important to understand the differences between qualitative and quantitative data to both determine the appropriate research methods for studies and to gain insights that you can be confident in sharing.

Data Types and Nature

Examples of qualitative data types in learning analytics:

  • Observational data of human behavior from classroom settings such as student engagement, teacher-student interactions, and classroom dynamics
  • Textual data from open-ended survey responses, reflective journals, and written assignments
  • Feedback and discussions from focus groups or interviews
  • Content analysis from various media

Examples of quantitative data types:

  • Standardized test, assessment, and quiz scores
  • Grades and grade point averages
  • Attendance records
  • Time spent on learning tasks
  • Data gathered from learning management systems (LMS), including login frequency, online participation, and completion rates of assignments

Methods of Collection

Qualitative and quantitative research methods for data collection can occasionally seem similar so it's important to note the differences to make sure you're creating a consistent data set and will be able to reliably draw conclusions from your data.

Qualitative research methods

Because of the nature of qualitative data (complex, detailed information), the research methods used to collect it are more involved. Qualitative researchers might do the following to collect data:

  • Conduct interviews to learn about subjective experiences
  • Host focus groups to gather feedback and personal accounts
  • Observe in-person or use audio or video recordings to record nuances of human behavior in a natural setting
  • Distribute surveys with open-ended questions

Quantitative research methods

Quantitative data collection methods are more diverse and more likely to be automated because of the objective nature of the data. A quantitative researcher could employ methods such as:

  • Surveys with close-ended questions that gather numerical data like birthdates or preferences
  • Observational research and record measurable information like the number of students in a classroom
  • Automated numerical data collection like information collected on the backend of a computer system like button clicks and page views

Analysis techniques

Qualitative and quantitative data can both be very informative. However, research studies require critical thinking for productive analysis.

Qualitative data analysis methods

Analyzing qualitative data takes a number of steps. When you first get all your data in one place you can do a review and take notes of trends you think you're seeing or your initial reactions. Next, you'll want to organize all the qualitative data you've collected by assigning it categories. Your central research question will guide your data categorization whether it's by date, location, type of collection method (interview vs focus group, etc), the specific question asked or something else. Next, you'll code your data. Whereas categorizing data is focused on the method of collection, coding is the process of identifying and labeling themes within the data collected to get closer to answering your research questions. Finally comes data interpretation. To interpret the data you'll take a look at the information gathered including your coding labels and see what results are occurring frequently or what other conclusions you can make. 3

Quantitative analysis techniques

The process to analyze quantitative data can be time-consuming due to the large volume of data possible to collect. When approaching a quantitative data set, start by focusing in on the purpose of your evaluation. Without making a conclusion, determine how you will use the information gained from analysis; for example: The answers of this survey about study habits will help determine what type of exam review session will be most useful to a class. 4

Next, you need to decide who is analyzing the data and set parameters for analysis. For example, if two different researchers are evaluating survey responses that rank preferences on a scale from 1 to 5, they need to be operating with the same understanding of the rankings. You wouldn't want one researcher to classify the value of 3 to be a positive preference while the other considers it a negative preference. It's also ideal to have some type of data management system to store and organize your data, such as a spreadsheet or database. Within the database, or via an export to data analysis software, the collected data needs to be cleaned of things like responses left blank, duplicate answers from respondents, and questions that are no longer considered relevant. Finally, you can use statistical software to analyze data (or complete a manual analysis) to find patterns and summarize your findings. 4

Qualitative and quantitative research tools

From the nuanced, thematic exploration enabled by tools like NVivo and ATLAS.ti, to the statistical precision of SPSS and R for quantitative analysis, each suite of data analysis tools offers tailored functionalities that cater to the distinct natures of different data types.

Qualitative research software:

NVivo: NVivo is qualitative data analysis software that can do everything from transcribe recordings to create word clouds and evaluate uploads for different sentiments and themes. NVivo is just one tool from the company Lumivero, which offers whole suites of data processing software. 5

ATLAS.ti: Similar to NVivo, ATLAS.ti allows researchers to upload and import data from a variety of sources to be tagged and refined using machine learning and presented with visualizations and ready for insert into reports. 6

SPSS: SPSS is a statistical analysis tool for quantitative research, appreciated for its user-friendly interface and comprehensive statistical tests, which makes it ideal for educators and researchers. With SPSS researchers can manage and analyze large quantitative data sets, use advanced statistical procedures and modeling techniques, predict customer behaviors, forecast market trends and more. 7

R: R is a versatile and dynamic open-source tool for quantitative analysis. With a vast repository of packages tailored to specific statistical methods, researchers can perform anything from basic descriptive statistics to complex predictive modeling. R is especially useful for its ability to handle large datasets, making it ideal for educational institutions that generate substantial amounts of data. The programming language offers flexibility in customizing analysis and creating publication-quality visualizations to effectively communicate results. 8

Applications in Educational Research

Both quantitative and qualitative data can be employed in learning analytics to drive informed decision-making and pedagogical enhancements. In the classroom, quantitative data like standardized test scores and online course analytics create a foundation for assessing and benchmarking student performance and engagement. Qualitative insights gathered from surveys, focus group discussions, and reflective student journals offer a more nuanced understanding of learners' experiences and contextual factors influencing their education. Additionally feedback and practical engagement metrics blend these data types, providing a holistic view that informs curriculum development, instructional strategies, and personalized learning pathways. Through these varied data sets and uses, educators can piece together a more complete narrative of student success and the impacts of educational interventions.

Master Data Analysis with an M.S. in Learning Sciences From SMU

Whether it is the detailed narratives unearthed through qualitative data or the informative patterns derived from quantitative analysis, both qualitative and quantitative data can provide crucial information for educators and researchers to better understand and improve learning. Dive deeper into the art and science of learning analytics with SMU's online Master of Science in the Learning Sciences program . At SMU, innovation and inquiry converge to empower the next generation of educators and researchers. Choose the Learning Analytics Specialization to learn how to harness the power of data science to illuminate learning trends, devise impactful strategies, and drive educational innovation. You could also find out how advanced technologies like augmented reality (AR), virtual reality (VR), and artificial intelligence (AI) can revolutionize education, and develop the insight to apply embodied cognition principles to enhance learning experiences in the Learning and Technology Design Specialization , or choose your own electives to build a specialization unique to your interests and career goals.

For more information on our curriculum and to become part of a community where data drives discovery, visit SMU's MSLS program website or schedule a call with our admissions outreach advisors for any queries or further discussion. Take the first step towards transforming education with data today.

  • Retrieved on August 8, 2024, from nnlm.gov/guides/data-glossary/qualitative-data
  • Retrieved on August 8, 2024, from nnlm.gov/guides/data-glossary/quantitative-data
  • Retrieved on August 8, 2024, from cdc.gov/healthyyouth/evaluation/pdf/brief19.pdf
  • Retrieved on August 8, 2024, from cdc.gov/healthyyouth/evaluation/pdf/brief20.pdf
  • Retrieved on August 8, 2024, from lumivero.com/solutions/
  • Retrieved on August 8, 2024, from atlasti.com/
  • Retrieved on August 8, 2024, from ibm.com/products/spss-statistics
  • Retrieved on August 8, 2024, from cran.r-project.org/doc/manuals/r-release/R-intro.html#Introduction-and-preliminaries

Return to SMU Online Learning Sciences Blog

Southern Methodist University has engaged Everspring , a leading provider of education and technology services, to support select aspects of program delivery.

This will only take a moment

Quantitative vs. Qualitative Research Design: Understanding the Differences

qualitative vs quantitative research in education

As a future professional in the social and education landscape, research design is one of the most critical strategies that you will master to identify challenges, ask questions and form data-driven solutions to address problems specific to your industry. 

Many approaches to research design exist, and not all work in every circumstance. While all data-focused research methods are valid in their own right, certain research design methods are more appropriate for specific study objectives.

Unlock our resource to learn more about jump starting a career in research design — Research Design and Data Analysis for the Social Good .

We will discuss the differences between quantitative (numerical and statistics-focused) and qualitative (non-numerical and human-focused) research design methods so that you can determine which approach is most strategic given your specific area of graduate-level study. 

Understanding Social Phenomena: Qualitative Research Design

Qualitative research focuses on understanding a phenomenon based on human experience and individual perception. It is a non-numerical methodology relying on interpreting a process or result. Qualitative research also paves the way for uncovering other hypotheses related to social phenomena. 

In its most basic form, qualitative research is exploratory in nature and seeks to understand the subjective experience of individuals based on social reality.

Qualitative data is…

  • often used in fields related to education, sociology and anthropology; 
  • designed to arrive at conclusions regarding social phenomena; 
  • focused on data-gathering techniques like interviews, focus groups or case studies; 
  • dedicated to perpetuating a flexible, adaptive approach to data gathering;
  • known to lead professionals to deeper insights within the overall research study.

You want to use qualitative data research design if:

  • you work in a field concerned with enhancing humankind through the lens of social change;
  • your research focuses on understanding complex social trends and individual perceptions of those trends;
  • you have interests related to human development and interpersonal relationships.

Examples of Qualitative Research Design in Education

Here are just a few examples of how qualitative research design methods can impact education:

Example 1: Former educators participate in in-depth interviews to help determine why a specific school is experiencing a higher-than-average turnover rate compared to other schools in the region. These interviews help determine the types of resources that will make a difference in teacher retention. 

Example 2: Focus group discussions occur to understand the challenges that neurodivergent students experience in the classroom daily. These discussions prepare administrators, staff, teachers and parents to understand the kinds of support that will augment and improve student outcomes.

Example 3: Case studies examine the impacts of a new education policy that limits the number of teacher aids required in a special needs classroom. These findings help policymakers determine whether the new policy affects the learning outcomes of a particular class of students.

Interpreting the Numbers: Quantitative Research Design

Quantitative research tests hypotheses and measures connections between variables. It relies on insights derived from numbers — countable, measurable and statistically sound data. Quantitative research is a strategic research design used when basing critical decisions on statistical conclusions and quantifiable data.

Quantitative research provides numerical-backed quantifiable data that may approve or discount a theory or hypothesis.

Quantitative data is…

  • often used in fields related to education, data analysis and healthcare; 
  • designed to arrive at numerical, statistical conclusions based on objective facts;
  • focused on data-gathering techniques like experiments, surveys or observations;
  • dedicated to using mathematical principles to arrive at conclusions;
  • known to lead professionals to indisputable observations within the overall research study.

You want to use quantitative data research design if:

  • you work in a field concerned with analyzing data to inform decisions;
  • your research focuses on studying relationships between variables to form data-driven conclusions;
  • you have interests related to mathematics, statistical analysis and data science.

Examples of Quantitative Research Design in Education

Here are just a few examples of how quantitative research design methods may impact education:

Example 1: Researchers compile data to understand the connection between class sizes and standardized test scores. Researchers can determine if and what the relationship is between smaller, intimate class sizes and higher test scores for grade-school children using statistical and data analysis.

Example 2: Professionals conduct an experiment in which a group of high school students must complete a certain number of community service hours before graduation. Researchers compare those students to another group of students who did not complete service hours — using statistical analysis to determine if the requirement increased college acceptance rates.

Example 3: Teachers take a survey to examine an education policy that restricts the number of extracurricular activities offered at a particular academic institution. The findings help better understand the far-reaching impacts of extracurricular opportunities on academic performance.

Making the Most of Research Design Methods for Good: Vanderbilt University’s Peabody College

Vanderbilt University's Peabody College of Education and Human Development offers a variety of respected, nationally-recognized graduate programs designed with future agents of social change in mind. We foster a culture of excellence and compassion and guide you to become the best you can be — both in the classroom and beyond.

At Peabody College, you will experience

  • an inclusive, welcoming community of like-minded professionals;
  • the guidance of expert faculty with real-world industry experience;
  • opportunities for valuable, hands-on learning experiences,
  • the option of specializing depending on your specific area of interest.

Explore our monthly publication — Ideas in Action — for an inside look at how Peabody College translates discoveries into action.

Please click below to explore a few of the graduate degrees offered at Peabody College:

  • Child Studies M.Ed. — a rigorous Master of Education degree that prepares students to examine the developmental, learning and social issues concerning children and that allows students to choose from one of two tracks (the Clinical and Developmental Research Track or the Applied Professional Track).
  • Cognitive Psychology in Context M.S. — an impactful Master of Science program that emphasizes research design and statistical analysis to understand cognitive processes and real-world applications best, making it perfect for those interested in pursuing doctoral studies in cognitive science.
  • Education Policy M.P.P — an analysis-focused Master of Public Policy program designed for future leaders in education policy and practice, allowing students to specialize in either K-12 Education Policy, Higher Education Policy or Quantitative Methods in Education Policy. 
  • Quantitative Methods M.Ed. — a data-driven Master of Education degree that teaches the theory and application of quantitative analysis in behavioral, social and educational sciences.

Connect with the Community of Professionals Seeking to Enhance Humankind at Peabody College

At Peabody College, we equip you with the marketable, transferable skills needed to secure a valuable career in education and beyond. You will emerge from the graduate program of your choice ready to enhance humankind in more meaningful ways than you could have imagined.

If you want to develop the sought-after skills needed to be a force for change in the social and educational spaces, you are in the right place .

We invite you to request more information ; we will connect you with an admissions professional who can answer all your questions about choosing one of these transformative graduate degrees at Peabody College. You may also take this opportunity to review our admissions requirements and start your online application today. 

A young boy is sitting at a table with his parents.

Subscribe to the Blog

← All Posts

Media Inquiries

615-322-6397 Email

Latest Stories

7 quantitative data careers in education that can make a difference, 6 trends shaping the future of education, the rewarding outcomes of being a special education teacher, keep reading.

qualitative vs quantitative research in education

Explore Stories by Topic

  • M.Ed. Degrees
  • Research Design
  • 1-800-NAT-UNIV (628-8648)
  • Bachelor of Arts Degree in Early Childhood Education (BAECE)
  • Bachelor of Arts in Early Childhood Development with an Inspired Teaching and Learning Preliminary Multiple Subject Teaching Credential (California)
  • Bachelor of Arts in English
  • Bachelor of Arts in History
  • Master of Arts in Social Emotional Learning
  • Master of Education in Inspired Teaching and Learning with a Preliminary Multiple and Single Subject Teaching Credential and Intern Option (CA)
  • Master of Arts in Education
  • Master of Early Childhood Education
  • Education Specialist
  • Doctor of Education
  • Doctor of Philosophy in Education
  • Doctor of Education in Educational Leadership
  • Ed.D. in Organizational Innovation
  • Certificate in Online Teaching (COT) Program
  • Online Medical Coding Program
  • Building Our Team Through Community Policing
  • Inspired Teaching and Learning with a Preliminary Single Subject Teaching Credential
  • Inspired Teaching and Learning with a Preliminary Multiple Subject Teaching Credential and Internship Option (California)
  • Preliminary Administrative Services Credential (CA Option)
  • Preliminary Education Specialist Credential: Mild/Moderate with Internship Option (CA)
  • All Teaching & Education
  • Associate of Science in Business
  • Bachelor of Business Administration
  • Bachelor of Science in Healthcare Administration
  • Bachelor of Arts in Management
  • Master of Business Administration (MBA)
  • Master of Public Health (MPH)
  • Master of Science in Data Science
  • Master of Public Administration
  • Doctor of Criminal Justice
  • Doctor of Philosophy in Organizational Leadership
  • Doctor of Business Administration
  • Doctor of Philosophy in Business Administration
  • Post-Baccalaureate Certificate in Business
  • Post-Master's Certificate in Business
  • Graduate Certificate in Banking
  • Certificate in Agile Project Management
  • All Business & Marketing
  • Bachelor of Science in Nursing (BSN) (California)
  • Bachelor of Science in Nursing (BSN) Second Bachelor Degree (California)
  • Bachelor of Science in Clinical Laboratory Science
  • Bachelor of Science in Public Health
  • Master of Science in Nursing
  • Master of Science in Health Informatics
  • Master of Healthcare Administration
  • Doctor of Nurse Anesthesia Practice (DNAP)
  • Doctor of Health Administration
  • Doctor of Nursing Practice in Executive Leadership
  • LVN to RN 30 Unit Option Certificate
  • Psychiatric Mental Health Nurse Practitioner Certificate
  • Family Nurse Practitioner Certificate
  • Emergency Medical Technician Certificate
  • All Healthcare & Nursing
  • Bachelor of Arts in Psychology
  • Bachelor of Arts in Integrative Psychology
  • Bachelor of Science in Criminal Justice Administration
  • Bachelor of Arts in Sociology
  • Master of Science in Applied Behavioral Analysis Degree
  • Master of Arts Degree in Counseling Psychology
  • Master of Arts in Consciousness, Psychology, and Transformation
  • Doctor of Clinical Psychology (PsyD) Program
  • Doctor of Philosophy in Marriage and Family Therapy
  • Doctor of Philosophy in Psychology
  • Doctorate of Marriage and Family Therapy
  • Graduate Certificate in Trauma Studies
  • Post-Master's Certificate in Psychology
  • Post-Baccalaureate Certificate in Applied Behavior Analysis
  • Pupil Personnel Services Credential School Counseling (PPSC)
  • University Internship Credential Program for Pupil Personnel Services School Counseling (California Only)
  • All Social Sciences & Psychology
  • Bachelor of Science in Cybersecurity
  • Bachelor of Science in Electrical and Computer Engineering
  • Bachelor of Science in Computer Science
  • Bachelor of Science in Construction Management
  • Master of Science in Cybersecurity
  • Master of Science in Computer Science
  • Master of Science in Engineering Management
  • Doctor of Philosophy in Data Science
  • Doctor of Philosophy in Computer Science
  • Doctor of Philosophy in Technology Management
  • Doctor of Philosophy in Cybersecurity
  • All Engineering & Technology
  • Associate of Arts in General Education
  • Bachelor of Arts in Digital Media Design
  • Bachelor of Arts in General Studies
  • Master of Arts in English
  • Master of Arts in Strategic Communication
  • Foreign Credential Bridge Program
  • All Arts & Humanities
  • Graduate Certificate in Forensic and Crime Scene Investigations
  • Bachelor of Public Administration
  • Bachelor of Science in Homeland Security and Emergency Management
  • Minor in Business Law
  • Master of Criminal Justice Leadership
  • Master of Forensic Sciences
  • Master of Science in Homeland Security and Emergency Management
  • Doctor of Public Administration
  • College of Law and Public Service
  • All Criminal Justice & Public Service
  • Paralegal Specialist Certificate Corporations
  • Paralegal Specialist Certificate Criminal Law
  • Paralegal Specialist Certificate Litigation
  • Associate of Science in Paralegal Studies
  • Bachelor of Arts in Pre-Law Studies
  • Bachelor of Science in Paralegal Studies
  • Juris Doctor
  • Associate of Science in Human Biology
  • Associate of Science in General Education
  • Bachelor of Science in Biology
  • Bachelor of Science in Mathematics
  • All Science & Math
  • Program Finder
  • Undergraduate Admissions
  • Graduate Program Admissions
  • Military Admissions
  • Early College
  • Credential & Certificate Programs
  • Transfer Information
  • Speak to an Advisor
  • How to Pay for College
  • Financial Aid
  • Scholarships
  • Tuition & Fees
  • NU offers a variety of scholarships to help students reduce their financial burden while focusing on achieving their goals. Explore Scholarships
  • Colleges/Schools
  • University Leadership
  • Office of the President
  • Academies at NU
  • Course Catalog
  • Accreditation
  • Workforce and Community Education
  • President’s Circle
  • Board of Trustees
  • NU Foundation
  • Military & Veterans
  • Coast Guard
  • Space Force
  • National Guard & Reservist
  • Military Spouses & Dependents
  • Military Resources
  • NU proudly serves active duty and Veteran students from all branches of the military — at home, on base, and abroad. Military Admissions
  • Online Degrees & Programs
  • Consumer Information
  • Student Login
  • Graduation Events
  • Student Portal
  • Student Bookstore
  • Student Resources
  • Dissertation Boot Camp
  • Show your NU pride and shop our online store for the latest and greatest NU apparel and accessories! Shop Now
  • Request Info
  • Our Programs

What Is Qualitative vs. Quantitative Study?

Bachelor of Science in Clinical Laboratory Science Program Page

Qualitative research focuses on understanding phenomena through detailed, narrative data. It explores the “how” and “why” of human behavior, using methods like interviews, observations, and content analysis. In contrast, quantitative research is numeric and objective, aiming to quantify variables and analyze statistical relationships. It addresses the “when” and “where,” utilizing tools like surveys, experiments, and statistical models to collect and analyze numerical data.

In This Article:

What is qualitative research, what is quantitative research.

  • How Do Qualitative and Quantitative Research Differ?

What’s the Difference Between a Qualitative and Quantitative Study?

Analyzing qualitative and quantitative data, when to use qualitative or quantitative research, develop your research skills at national university.

Qualitative and quantitative data are broad categories covering many research approaches and methods. While both share the primary aim of knowledge acquisition, quantitative research is numeric and objective, seeking to answer questions like when or where. On the other hand, qualitative research is concerned with subjective phenomena that can’t be numerically measured, like how different people experience grief.

Having a firm grounding in qualitative and quantitative research methodologies will become especially important once you begin work on your dissertation or thesis toward the end of your academic program. At that point, you’ll need to decide which approach best aligns with your research question, a process that involves working closely with your Dissertation Chair.

Keep reading to learn more about the difference between quantitative vs. qualitative research, including what research techniques they involve, how they approach the task of data analysis, and some strengths — and limitations — of each approach. We’ll also briefly examine mixed-method research, which incorporates elements of both methodologies.

Qualitative research differs from quantitative research in its objectives, techniques, and design. Qualitative research aims to gain insights into phenomena, groups, or experiences that cannot be objectively measured or quantified using mathematics. Instead of seeking to uncover precise answers or statistics in a controlled environment like quantitative research, qualitative research is more exploratory, drawing upon data sources such as photographs, journal entries, video footage, and interviews.

These features stand in stark contrast to quantitative research, as we’ll see throughout the remainder of this article.

Quantitative research tackles questions from different angles compared to qualitative research. Instead of probing for subjective meaning by asking exploratory “how?” and “why?” questions, quantitative research provides precise causal explanations that can be measured and communicated mathematically. While qualitative researchers might visit subjects in their homes or otherwise in the field, quantitative research is usually conducted in a controlled environment. Instead of gaining insight or understanding into a subjective, context-dependent issue, as is the case with qualitative research, the goal is instead to obtain objective information, such as determining the best time to undergo a specific medical procedure.

qualitative vs quantitative research in education

How Does Qualitative and Quantitative Research Differ?

How are the approaches of quantitative and qualitative research different?

In qualitative studies, data is usually gathered in the field from smaller sample sizes, which means researchers might personally visit participants in their own homes or other environments. Once the research is completed, the researcher must evaluate and make sense of the data in its context, looking for trends or patterns from which new theories, concepts, narratives, or hypotheses can be generated.

Quantitative research is typically carried out via tools (such as questionnaires) instead of by people (such as a researcher asking interview questions). Another significant difference is that, in qualitative studies, researchers must interpret the data to build hypotheses. In a quantitative analysis, the researcher sets out to test a hypothesis.

Bachelor of Science in Allied Health Program Page

Both qualitative and quantitative studies are subject to rigorous quality standards. However, the research techniques utilized in each type of study differ, as do the questions and issues they hope to address or resolve. In quantitative studies, researchers tend to follow more rigid structures to test the links or relationships between different variables, ideally based on a random sample. On the other hand, in a qualitative study, not only are the samples typically smaller and narrower (such as using convenience samples), the study’s design is generally more flexible and less structured to accommodate the open-ended nature of the research.

Below are a few examples of qualitative and quantitative research techniques to help illustrate these differences further.

Sources of Quantitative Research

Some example methods of quantitative research methods or sources include, but are not limited to, the following:

  • Conducting polls, surveys, and experiments
  • Compiling databases of records and information
  • Observing the topic of the research, such as a specific reaction
  • Performing a meta-analysis, which involves analyzing multiple prior studies in order to identify statistical trends or patterns
  • Supplying online or paper questionnaires to participants

The following section will cover some examples of qualitative research methods for comparison, followed by an overview of mixed research methods that blend components of both approaches.

Sources of Qualitative Research

Researchers can use numerous qualitative methods to explore a topic or gain insight into an issue. Some sources of, or approaches to, qualitative research include the following examples:

  • Conducting ethnographic studies, which are studies that seek to explore different phenomena through a cultural or group-specific lens
  • Conducting focus groups
  • Examining various types of records, including but not limited to diary entries, personal letters, official documents, medical or hospital records, photographs, video or audio recordings, and even minutes from meetings
  • Holding one-on-one interviews
  • Obtaining personal accounts and recollections of events or experiences

Examples of Research Questions Best Suited for Qualitative vs. Quantitative Methods

Qualitative research questions:.

  • How do patients experience the process of recovering from surgery?
  • Why do some employees feel more motivated in remote work environments?
  • What are the cultural influences on dietary habits among teenagers?

Quantitative Research Questions:

  • What is the average recovery time for patients after surgery?
  • How does remote work impact employee productivity levels?
  • What percentage of teenagers adhere to recommended dietary guidelines?

These examples illustrate how qualitative research delves into the depth and context of human experiences, while quantitative research focuses on measurable data and statistical analysis.

Mixed Methods Research

In addition to the purely qualitative and quantitative research methods outlined above, such as conducting focus groups or performing meta-analyses, it’s also possible to take a hybrid approach that merges qualitative and quantitative research aspects. According to an article published by LinkedIn , “Mixed methods research avoids many [of the] criticisms” that have historically been directed at qualitative and quantitative research, such as the former’s vulnerability to bias, by “canceling the effects of one methodology by including the other methodology.” In other words, this mixed approach provides the best of both worlds. “Mixed methods research also triangulates results that offer higher validity and reliability.”

If you’re enrolled as a National University student, you can watch a video introduction to mixed-method research by logging in with your student ID. Our resource library also covers qualitative and quantitative research methodologies and a video breakdown of when to use which approach.

When it comes to quantitative and qualitative research, methods of collecting data differ, as do the methods of organizing and analyzing it. So what are some best practices for analyzing qualitative and quantitative data sets, and how do they call for different approaches by researchers?

How to Analyze Qualitative Data

Below is a step-by-step overview of how to analyze qualitative data.

  • Make sure all of your data is finished being compiled before you begin any analysis.
  • Organize and connect your data for consistency using computer-assisted qualitative data analysis software (CAQDAS).
  • Code your data, which can be partially automated using a feedback analytics platform.
  • Start digging deep into analysis, potentially using augmented intelligence to get more accurate results.
  • Report on your findings, ideally using engaging aids to help tell the story.

How to Analyze Quantitative Data

There are numerous approaches to analyzing quantitative data. Some examples include cross-tabulation, conjoint analysis, gap analysis, trend analysis, and SWOT analysis, which refers to Strengths, Weaknesses, Opportunities, and Threats.

Whichever system or systems you use, there are specific steps you should take to ensure that you’ve organized your data and analyzed it as accurately as possible. Here’s a brief four-step overview.

  • Connect measurement scales to study variables, which helps ensure that your data will be organized in the appropriate order before you proceed.
  • Link data with descriptive statistics, such as mean, median, mode, or frequency.
  • Determine what measurement scale you’ll use for your analysis.
  • Organize the data into tables and conduct an analysis using methods like cross-tabulation or Total Unduplicated Reach and Frequency (TURF) analysis.

people talking in front of whiteboard with notes written on it

Simply knowing the difference between quantitative and qualitative research isn’t enough — you also need an understanding of when each approach should be used and under what circumstances. For that, you’ll need to consider all of the comparisons we’ve made throughout this article and weigh some potential pros and cons of each methodology.

Pros and Cons of Qualitative Research

Qualitative research has numerous strengths, but the research methodology is only more appropriate for some projects or dissertations. Here are some strengths and weaknesses of qualitative research to help guide your decision:

  • Pro — More flex room for creativity and interpretation of results
  • Pro — Greater freedom to utilize different research techniques as the study evolves
  • Con — Potentially more vulnerable to bias due to their subjective nature
  • Con — Sample sizes tend to be smaller and non-randomized

Pros and Cons of Quantitative Research

Quantitative research also comes with drawbacks and benefits, depending on what information you aim to uncover. Here are a few pros and cons to consider when designing your study.

  • Pro — Large, random samples help ensure that the broader population is more realistically reflected
  • Pro — Specific, precise results can be easily communicated using numbers
  • Con — Data can suffer from a lack of context or personal detail around participant answers
  • Con — Numerous participants are needed, driving up costs while posing logistical challenges

If you dream of making a scientific breakthrough and contributing new knowledge that revolutionizes your field, you’ll need a strong foundation in research, from how it’s conducted and analyzed to a clear understanding of professional ethics and standards. By pursuing your degree at National University, you build stronger research skills and countless other in-demand job skills.

With flexible course schedules, convenient online classes , scholarships and financial aid , and an inclusive military-friendly culture, higher education has never been more achievable or accessible. At National University, you’ll find opportunities to challenge and hone your research skills in more than 75 accredited graduate and undergraduate programs and fast-paced credential and certificate programs in healthcare, business, engineering, computer science, criminal justice, sociology, accounting, and more.

Contact our admissions office to request program information, or apply to National University online today .

Learn More About Our University and Scholarships

Join our email list!

  • First Name *
  • Form Email Field
  • Consent * I agree to the terms and conditions below. *

Recent Resources

  • National University’s Online Ph.D. of Business Administration in Strategic Marketing Named One of the Best in the Nation August 20, 2024
  • How to Become an Investment Banker July 26, 2024
  • What Can You Do With an Economics Degree? July 19, 2024

Your passion. Our Programs.

Choose an area of study, select a degree level.

Search the site

Modal window with site-search and helpful links

Featured Programs

  • Business and Management
  • Computer Science
  • Teaching and Credentials

Helpful Links

  • Admissions & Application Information
  • Online College Degrees & Programs
  • Student Services
  • Request Your Transcripts

Terms & Conditions

By submitting your information to National University as my electronic signature and submitting this form by clicking the Request Info button above, I provide my express written consent to representatives of National University and National University affiliates (including City University of Seattle) to contact me about educational opportunities. This includes the use of automated technology, such as an automatic dialing system and pre-recorded or artificial voice messages, text messages, and mail, both electronic and physical, to the phone numbers (including cellular) and e-mail address(es) I have provided. I confirm that the information provided on this form is accurate and complete. I also understand that certain degree programs may not be available in all states. Message and data rates may apply. Message frequency may vary.

I understand that consent is not a condition to purchase any goods, services or property, and that I may withdraw my consent at any time by sending an email to [email protected] . I understand that if I am submitting my personal data from outside of the United States, I am consenting to the transfer of my personal data to, and its storage in, the United States, and I understand that my personal data will be subject to processing in accordance with U.S. laws, unless stated otherwise in our privacy policy . Please review our privacy policy for more details or contact us at [email protected] .

By submitting my information, I acknowledge that I have read and reviewed the Accessibility Statement . 

By submitting my information, I acknowledge that I have read and reviewed the Student Code of Conduct located in the Catalog .

National University

Chat Options

qualitative vs quantitative research in education

Quantitative and Qualitative Research

  • Quantitative vs. Qualitative Research
  • Find quantitative or qualitative research in CINAHL
  • Find quantitative or qualitative research in PsycINFO
  • Relevant book titles

Mixed Methods Research

As its name suggests, mixed methods research involves using elements of both quantitative and qualitative research methods. Using mixed methods, a researcher can more fully explore a research question and provide greater insight. 

What is Empirical Research?

Empirical research is based on observed  and measured phenomena. Knowledge is extracted from real lived experience rather than from theory or belief. 

IMRaD: Scholarly journals sometimes use the "IMRaD" format to communicate empirical research findings.

Introduction:  explains why this research is important or necessary. Provides context ("literature review").

Methodology:  explains how the research was conducted ("research design").

Results: presents what was learned through the study ("findings").

Discussion:  explains or comments upon the findings including why the study is important and connecting to other research ("conclusion").

What is Quantitative Research?

Quantitative research gathers data that can be measured numerically and analyzed mathematically. Quantitative research attempts to answer research questions through the quantification of data. 

Indicators of quantitative research include:

contains statistical analysis 

large sample size 

objective - little room to argue with the numbers 

types of research: descriptive studies, exploratory studies, experimental studies, explanatory studies, predictive studies, clinical trials 

What is Qualitative Research?

Qualitative research is based upon data that is gathered by observation. Qualitative research articles will attempt to answer questions that cannot be measured by numbers but rather by perceived meaning. Qualitative research will likely include interviews, case studies, ethnography, or focus groups. 

Indicators of qualitative research include:

interviews or focus groups 

small sample size 

subjective - researchers are often interpreting meaning 

methods used: phenomenology, ethnography, grounded theory, historical method, case study 

Video: Empirical Studies: Qualitative vs. Quantitative

This video from usu libraries walks you through the differences between quantitative and qualitative research methods. (5:51 minutes) creative commons attribution license (reuse allowed)  https://youtu.be/rzcfma1l6ce.

  • << Previous: Home
  • Next: Find quantitative or qualitative research in CINAHL >>
  • Last Updated: Mar 25, 2024 12:23 PM
  • URL: https://libguides.hofstra.edu/quantitative-and-qualitative-research

This site is compliant with the W3C-WAI Web Content Accessibility Guidelines HOFSTRA UNIVERSITY Hempstead, NY 11549-1000 (516) 463-6600 © 2000-2009 Hofstra University

Have a language expert improve your writing

Run a free plagiarism check in 10 minutes, automatically generate references for free.

  • Knowledge Base
  • Methodology
  • Qualitative vs Quantitative Research | Examples & Methods

Qualitative vs Quantitative Research | Examples & Methods

Published on 4 April 2022 by Raimo Streefkerk . Revised on 8 May 2023.

When collecting and analysing data, quantitative research deals with numbers and statistics, while qualitative research  deals with words and meanings. Both are important for gaining different kinds of knowledge.

Common quantitative methods include experiments, observations recorded as numbers, and surveys with closed-ended questions. Qualitative research Qualitative research is expressed in words . It is used to understand concepts, thoughts or experiences. This type of research enables you to gather in-depth insights on topics that are not well understood.

Table of contents

The differences between quantitative and qualitative research, data collection methods, when to use qualitative vs quantitative research, how to analyse qualitative and quantitative data, frequently asked questions about qualitative and quantitative research.

Quantitative and qualitative research use different research methods to collect and analyse data, and they allow you to answer different kinds of research questions.

Qualitative vs quantitative research

Prevent plagiarism, run a free check.

Quantitative and qualitative data can be collected using various methods. It is important to use a data collection method that will help answer your research question(s).

Many data collection methods can be either qualitative or quantitative. For example, in surveys, observations or case studies , your data can be represented as numbers (e.g. using rating scales or counting frequencies) or as words (e.g. with open-ended questions or descriptions of what you observe).

However, some methods are more commonly used in one type or the other.

Quantitative data collection methods

  • Surveys :  List of closed or multiple choice questions that is distributed to a sample (online, in person, or over the phone).
  • Experiments : Situation in which variables are controlled and manipulated to establish cause-and-effect relationships.
  • Observations: Observing subjects in a natural environment where variables can’t be controlled.

Qualitative data collection methods

  • Interviews : Asking open-ended questions verbally to respondents.
  • Focus groups: Discussion among a group of people about a topic to gather opinions that can be used for further research.
  • Ethnography : Participating in a community or organisation for an extended period of time to closely observe culture and behavior.
  • Literature review : Survey of published works by other authors.

A rule of thumb for deciding whether to use qualitative or quantitative data is:

  • Use quantitative research if you want to confirm or test something (a theory or hypothesis)
  • Use qualitative research if you want to understand something (concepts, thoughts, experiences)

For most research topics you can choose a qualitative, quantitative or mixed methods approach . Which type you choose depends on, among other things, whether you’re taking an inductive vs deductive research approach ; your research question(s) ; whether you’re doing experimental , correlational , or descriptive research ; and practical considerations such as time, money, availability of data, and access to respondents.

Quantitative research approach

You survey 300 students at your university and ask them questions such as: ‘on a scale from 1-5, how satisfied are your with your professors?’

You can perform statistical analysis on the data and draw conclusions such as: ‘on average students rated their professors 4.4’.

Qualitative research approach

You conduct in-depth interviews with 15 students and ask them open-ended questions such as: ‘How satisfied are you with your studies?’, ‘What is the most positive aspect of your study program?’ and ‘What can be done to improve the study program?’

Based on the answers you get you can ask follow-up questions to clarify things. You transcribe all interviews using transcription software and try to find commonalities and patterns.

Mixed methods approach

You conduct interviews to find out how satisfied students are with their studies. Through open-ended questions you learn things you never thought about before and gain new insights. Later, you use a survey to test these insights on a larger scale.

It’s also possible to start with a survey to find out the overall trends, followed by interviews to better understand the reasons behind the trends.

Qualitative or quantitative data by itself can’t prove or demonstrate anything, but has to be analysed to show its meaning in relation to the research questions. The method of analysis differs for each type of data.

Analysing quantitative data

Quantitative data is based on numbers. Simple maths or more advanced statistical analysis is used to discover commonalities or patterns in the data. The results are often reported in graphs and tables.

Applications such as Excel, SPSS, or R can be used to calculate things like:

  • Average scores
  • The number of times a particular answer was given
  • The correlation or causation between two or more variables
  • The reliability and validity of the results

Analysing qualitative data

Qualitative data is more difficult to analyse than quantitative data. It consists of text, images or videos instead of numbers.

Some common approaches to analysing qualitative data include:

  • Qualitative content analysis : Tracking the occurrence, position and meaning of words or phrases
  • Thematic analysis : Closely examining the data to identify the main themes and patterns
  • Discourse analysis : Studying how communication works in social contexts

Quantitative research deals with numbers and statistics, while qualitative research deals with words and meanings.

Quantitative methods allow you to test a hypothesis by systematically collecting and analysing data, while qualitative methods allow you to explore ideas and experiences in depth.

In mixed methods research , you use both qualitative and quantitative data collection and analysis methods to answer your research question .

The research methods you use depend on the type of data you need to answer your research question .

  • If you want to measure something or test a hypothesis , use quantitative methods . If you want to explore ideas, thoughts, and meanings, use qualitative methods .
  • If you want to analyse a large amount of readily available data, use secondary data. If you want data specific to your purposes with control over how they are generated, collect primary data.
  • If you want to establish cause-and-effect relationships between variables , use experimental methods. If you want to understand the characteristics of a research subject, use descriptive methods.

Data collection is the systematic process by which observations or measurements are gathered in research. It is used in many different contexts by academics, governments, businesses, and other organisations.

There are various approaches to qualitative data analysis , but they all share five steps in common:

  • Prepare and organise your data.
  • Review and explore your data.
  • Develop a data coding system.
  • Assign codes to the data.
  • Identify recurring themes.

The specifics of each step depend on the focus of the analysis. Some common approaches include textual analysis , thematic analysis , and discourse analysis .

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the ‘Cite this Scribbr article’ button to automatically add the citation to our free Reference Generator.

Streefkerk, R. (2023, May 08). Qualitative vs Quantitative Research | Examples & Methods. Scribbr. Retrieved 30 August 2024, from https://www.scribbr.co.uk/research-methods/quantitative-qualitative-research/

Is this article helpful?

Raimo Streefkerk

Raimo Streefkerk

link to library home page

Contact Us!

qualitative vs quantitative research in education

Qualitative and Quantitative Research

In general, quantitative research seeks to understand the causal or correlational relationship between variables through testing hypotheses, whereas qualitative research seeks to understand a phenomenon within a real-world context through the use of interviews and observation. Both types of research are valid, and certain research topics are better suited to one approach or the other. However, it is important to understand the differences between qualitative and quantitative research so that you will be able to conduct an informed critique and analysis of any articles that you read, because you will understand the different advantages, disadvantages, and influencing factors for each approach. 

The table below illustrates the main differences between qualitative and quantitative research. Be aware that these are generalizations, and that not every research study or article will fit neatly into these categories. 

 

Complexity, contextual, inductive logic, discovery, exploration

Experiment, random assignment, independent/dependent variable, causal/correlational, validity, deductive logic

Understand a phenomenon

Discover causal relationships or describe a phenomenon

Purposive sample, small

Random sample, large

Focus groups, interviews, field observation

Tests, surveys, questionnaires

Phenomenological, grounded theory, ethnographic, case study, historical/narrative research, participatory research, clinical research

Experimental, quasi-experimental, descriptive, methodological, exploratory, comparative, correlational, developmental (cross-sectional, longitudinal/prospective/cohort, retrospective/ex post facto/case control)

Systematic reviews, meta-analyses, and integrative reviews are not exactly designs, but they synthesize, analyze, and compare the results from many research studies and are somewhat quantitative in nature. However, they are not truly quantitative or qualitative studies.

References:

LoBiondo-Wood, G., & Haber, J. (2010). Nursing research: Methods and critical appraisal for evidence-based practice (7 th ed.). St. Louis, MO: Mosby Elsevier

Mertens, D. M. (2010). Research and evaluation in education and psychology (3 rd ed.). Los Angeles: SAGE

Quick Overview

This 2-minute video provides a simplified overview of the primary distinctions between quantitative and qualitative research.

It's Not Always One or the Other!

It's important to keep in mind that research studies and articles are not always 100% qualitative or 100% quantitative. A mixed methods study involves both qualitative and quantitative approaches. If you need to find articles that are purely qualitative or purely quanititative, be sure to look carefully at the methodology sections to make sure the studies did not utilize both methods. 

  • Next: Nursing >>
  • Last Updated: May 28, 2024 4:52 PM
  • URL: https://stevenson.libguides.com/c.php?g=236343

Christopher Dwyer Ph.D.

Critically Thinking About Qualitative Versus Quantitative Research

What should we do regarding our research questions and methodology.

Posted January 26, 2022 | Reviewed by Davia Sills

  • Neither a quantitative nor a qualitative methodology is the right way to approach every scientific question.
  • Rather, the nature of the question determines which methodology is best suited to address it.
  • Often, researchers benefit from a mixed approach that incorporates both quantitative and qualitative methodologies.

As a researcher who has used a wide variety of methodologies, I understand the importance of acknowledging that we, as researchers, do not pick the methodology; rather, the research question dictates it. So, you can only imagine how annoyed I get when I hear of undergraduates designing their research projects based on preconceived notions, like "quantitative is more straightforward," or "qualitative is easier." Apart from the fact that neither of these assertions is actually the case, these young researchers are blatantly missing one of the foundational steps of good research: If you are interested in researching a particular area, you must get to know the area (i.e., through reading) and then develop a question based on that reading.

The nature of the question will dictate the most appropriate methodological approach.

I’ve debated with researchers in the past who are "exclusively" qualitative or "exclusively" quantitative. Depending on the rationale for their exclusivity, I might question a little deeper, learn something, and move on, or I might debate further. Sometimes, I throw some contentious statements out to see what the responses are like. For example, "Qualitative research, in isolation, is nothing but glorified journalism . " This one might not be new to you. Yes, qualitative is flawed, but so, too, is quantitative.

Let's try this one: "Numbers don’t lie, just the researchers who interpret them." If researchers are going to have a pop at qual for subjectivity, why don’t they recognize the same issues in quant? The numbers in a results section may be objectively correct, but their meaningfulness is only made clear through the interpretation of the human reporting them. This is not a criticism but is an important observation for those who believe in the absolute objectivity of quantitative reporting. The subjectivity associated with this interpretation may miss something crucial in the interpretation of the numbers because, hey, we’re only human.

With that, I love quantitative research, but I’m not unreasonable about it. Let’s say we’ve evaluated a three-arm RCT—the new therapeutic intervention is significantly efficacious, with a large effect, for enhancing "x" in people living with "y." One might conclude that this intervention works and that we must conduct further research on it to further support its efficacy—this is, of course, a fine suggestion, consistent with good research practice and epistemological understanding.

However, blindly recommending the intervention based on the interpretation of numbers alone might be suspect—think of all the variables that could be involved in a 4-, 8-, 12-, or 52-week intervention with human participants. It would be foolish to believe that all variables were considered—so, here is a fantastic example of where a qualitative methodology might be useful. At the end of the intervention, a researcher might decide to interview a random 20 percent of the cohort who participated in the intervention group about their experience and the program’s strengths and weaknesses. The findings from this qualitative element might help further explain the effects, aid the initial interpretation, and bring to life new ideas and concepts that had been missing from the initial interpretation. In this respect, infusing a qualitative approach at the end of quantitative analysis has shown its benefits—a mixed approach to intervention evaluation is very useful.

What about before that? Well, let’s say I want to develop another intervention to enhance "z," but there’s little research on it, and that which has been conducted isn’t of the highest quality; furthermore, we don’t know about people’s experiences with "z" or even other variables associated with it.

To design an intervention around "z" would be ‘jumping the gun’ at best (and a waste of funds). It seems that an exploration of some sort is necessary. This is where qualitative again shines—giving us an opportunity to explore what "z" is from the perspective of a relevant cohort(s).

Of course, we cannot generalize the findings; we cannot draw a definitive conclusion as to what "z" is. But what the findings facilitate is providing a foundation from which to work; for example, we still cannot say that "z" is this, that, or the other, but it appears that it might be associated with "a," "b" and "c." Thus, future research should investigate the nature of "z" as a particular concept, in relation to "a," "b" and "c." Again, a qualitative methodology shows its worth. In the previous examples, a qualitative method was used because the research questions warranted it.

Through considering the potentially controversial statements about qual and quant above, we are pushed into examining the strengths and weaknesses of research methodologies (regardless of our exclusivity with a particular approach). This is useful if we’re going to think critically about finding answers to our research questions. But simply considering these does not let poor research practice off the hook.

For example, credible qualitative researchers acknowledge that generalizability is not the point of their research; however, that doesn’t stop some less-than-credible researchers from presenting their "findings" as generalizable as possible, without actually using the word. Such practices should be frowned upon—so should making a career out of strictly using qualitative methodology in an attempt to find answers core to the human condition. All these researchers are really doing is spending a career exploring, yet never really finding anything (despite arguing to the contrary, albeit avoiding the word "generalize").

qualitative vs quantitative research in education

The solution to this problem, again, is to truly listen to what your research question is telling you. Eventually, it’s going to recommend a quantitative approach. Likewise, a "numbers person" will be recommended a qualitative approach from time to time—flip around the example above, and there’s a similar criticism. Again, embrace a mixed approach.

What's the point of this argument?

I conduct both research methodologies. Which do I prefer? Simple—whichever one helps me most appropriately answer my research question.

Do I have problems with qualitative methodologies? Absolutely—but I have issues with quantitative methods as well. Having these issues is good—it means that you recognize the limitations of your tools, which increases the chances of you "fixing," "sharpening" or "changing out" your tools when necessary.

So, the next time someone speaks with you about labeling researchers as one type or another, ask them why they think that way, ask them which they think you are, and then reflect on the responses alongside your own views of methodology and epistemology. It might just help you become a better researcher.

Christopher Dwyer Ph.D.

Christopher Dwyer, Ph.D., is a lecturer at the Technological University of the Shannon in Athlone, Ireland.

  • Find a Therapist
  • Find a Treatment Center
  • Find a Psychiatrist
  • Find a Support Group
  • Find Online Therapy
  • United States
  • Brooklyn, NY
  • Chicago, IL
  • Houston, TX
  • Los Angeles, CA
  • New York, NY
  • Portland, OR
  • San Diego, CA
  • San Francisco, CA
  • Seattle, WA
  • Washington, DC
  • Asperger's
  • Bipolar Disorder
  • Chronic Pain
  • Eating Disorders
  • Passive Aggression
  • Personality
  • Goal Setting
  • Positive Psychology
  • Stopping Smoking
  • Low Sexual Desire
  • Relationships
  • Child Development
  • Self Tests NEW
  • Therapy Center
  • Diagnosis Dictionary
  • Types of Therapy

July 2024 magazine cover

Sticking up for yourself is no easy task. But there are concrete skills you can use to hone your assertiveness and advocate for yourself.

  • Emotional Intelligence
  • Gaslighting
  • Affective Forecasting
  • Neuroscience

Banner

New Scholar

  • Tips for Publishing
  • Publishing Process Overview
  • Selecting a Journal Overview
  • Conference Presentation or Poster Session Overview
  • Predatory Publishers

Qualitative vs Quantitative Research

Types of empirical research, list of journals friendly to qualitative work.

  • Understanding Research Impact This link opens in a new window

Reproducibility Librarian

Profile Photo

Make an Appointment with Me

Do you want to meet with me in-person or online?

Quantitative Research is used to quantify the problem by way of generating numerical data or data that can be transformed into useable statistics. It is used to quantify attitudes, opinions, behaviors, and other defined variables – and generalize results from a larger sample population.

Qualitative Research is primarily exploratory research.  It is used to gain an understanding of underlying reasons, opinions, and motivations. This data is usually gathered using conversational methods such as interviews or focus groups.

Some journals and even some disciplines may have a preference as to what type of empirical research they wish to publish.  Some authors who have written an article that is primarily qualitative in nature, may seek out journals that are "qualitative research friendly." We have listed a few such journals below. 

qualitative vs quantitative research in education

  • International Journal of Qualitative Methods
  • Journal of Mixed Methods Research
  • Quality & Quantity
  • List of qualitative research journals compiled by Saint Louis University

You can also look through the last few issues of a journal to see if the articles they publish tend to be more qualitative or quantitative in nature.

  • << Previous: Predatory Publishers
  • Next: Understanding Research Impact >>
  • Last Updated: Aug 29, 2024 12:36 PM
  • URL: https://libguides.lib.rochester.edu/newscholar
  • Bipolar Disorder
  • Therapy Center
  • When To See a Therapist
  • Types of Therapy
  • Best Online Therapy
  • Best Couples Therapy
  • Managing Stress
  • Sleep and Dreaming
  • Understanding Emotions
  • Self-Improvement
  • Healthy Relationships
  • Student Resources
  • Personality Types
  • Sweepstakes
  • Guided Meditations
  • Verywell Mind Insights
  • 2024 Verywell Mind 25
  • Mental Health in the Classroom
  • Editorial Process
  • Meet Our Review Board
  • Crisis Support

Quantitative vs. Qualitative Research in Psychology

  • Key Differences

Quantitative Research Methods

Qualitative research methods.

  • How They Relate

In psychology and other social sciences, researchers are faced with an unresolved question: Can we measure concepts like love or racism the same way we can measure temperature or the weight of a star? Social phenomena⁠—things that happen because of and through human behavior⁠—are especially difficult to grasp with typical scientific models.

At a Glance

Psychologists rely on quantitative and quantitative research to better understand human thought and behavior.

  • Qualitative research involves collecting and evaluating non-numerical data in order to understand concepts or subjective opinions.
  • Quantitative research involves collecting and evaluating numerical data. 

This article discusses what qualitative and quantitative research are, how they are different, and how they are used in psychology research.

Qualitative Research vs. Quantitative Research

In order to understand qualitative and quantitative psychology research, it can be helpful to look at the methods that are used and when each type is most appropriate.

Psychologists rely on a few methods to measure behavior, attitudes, and feelings. These include:

  • Self-reports , like surveys or questionnaires
  • Observation (often used in experiments or fieldwork)
  • Implicit attitude tests that measure timing in responding to prompts

Most of these are quantitative methods. The result is a number that can be used to assess differences between groups.

However, most of these methods are static, inflexible (you can't change a question because a participant doesn't understand it), and provide a "what" answer rather than a "why" answer.

Sometimes, researchers are more interested in the "why" and the "how." That's where qualitative methods come in.

Qualitative research is about speaking to people directly and hearing their words. It is grounded in the philosophy that the social world is ultimately unmeasurable, that no measure is truly ever "objective," and that how humans make meaning is just as important as how much they score on a standardized test.

Used to develop theories

Takes a broad, complex approach

Answers "why" and "how" questions

Explores patterns and themes

Used to test theories

Takes a narrow, specific approach

Answers "what" questions

Explores statistical relationships

Quantitative methods have existed ever since people have been able to count things. But it is only with the positivist philosophy of Auguste Comte (which maintains that factual knowledge obtained by observation is trustworthy) that it became a "scientific method."

The scientific method follows this general process. A researcher must:

  • Generate a theory or hypothesis (i.e., predict what might happen in an experiment) and determine the variables needed to answer their question
  • Develop instruments to measure the phenomenon (such as a survey, a thermometer, etc.)
  • Develop experiments to manipulate the variables
  • Collect empirical (measured) data
  • Analyze data

Quantitative methods are about measuring phenomena, not explaining them.

Quantitative research compares two groups of people. There are all sorts of variables you could measure, and many kinds of experiments to run using quantitative methods.

These comparisons are generally explained using graphs, pie charts, and other visual representations that give the researcher a sense of how the various data points relate to one another.

Basic Assumptions

Quantitative methods assume:

  • That the world is measurable
  • That humans can observe objectively
  • That we can know things for certain about the world from observation

In some fields, these assumptions hold true. Whether you measure the size of the sun 2000 years ago or now, it will always be the same. But when it comes to human behavior, it is not so simple.

As decades of cultural and social research have shown, people behave differently (and even think differently) based on historical context, cultural context, social context, and even identity-based contexts like gender , social class, or sexual orientation .

Therefore, quantitative methods applied to human behavior (as used in psychology and some areas of sociology) should always be rooted in their particular context. In other words: there are no, or very few, human universals.

Statistical information is the primary form of quantitative data used in human and social quantitative research. Statistics provide lots of information about tendencies across large groups of people, but they can never describe every case or every experience. In other words, there are always outliers.

Correlation and Causation

A basic principle of statistics is that correlation is not causation. Researchers can only claim a cause-and-effect relationship under certain conditions:

  • The study was a true experiment.
  • The independent variable can be manipulated (for example, researchers cannot manipulate gender, but they can change the primer a study subject sees, such as a picture of nature or of a building).
  • The dependent variable can be measured through a ratio or a scale.

So when you read a report that "gender was linked to" something (like a behavior or an attitude), remember that gender is NOT a cause of the behavior or attitude. There is an apparent relationship, but the true cause of the difference is hidden.

Pitfalls of Quantitative Research

Quantitative methods are one way to approach the measurement and understanding of human and social phenomena. But what's missing from this picture?

As noted above, statistics do not tell us about personal, individual experiences and meanings. While surveys can give a general idea, respondents have to choose between only a few responses. This can make it difficult to understand the subtleties of different experiences.

Quantitative methods can be helpful when making objective comparisons between groups or when looking for relationships between variables. They can be analyzed statistically, which can be helpful when looking for patterns and relationships.

Qualitative data are not made out of numbers but rather of descriptions, metaphors, symbols, quotes, analysis, concepts, and characteristics. This approach uses interviews, written texts, art, photos, and other materials to make sense of human experiences and to understand what these experiences mean to people.

While quantitative methods ask "what" and "how much," qualitative methods ask "why" and "how."

Qualitative methods are about describing and analyzing phenomena from a human perspective. There are many different philosophical views on qualitative methods, but in general, they agree that some questions are too complex or impossible to answer with standardized instruments.

These methods also accept that it is impossible to be completely objective in observing phenomena. Researchers have their own thoughts, attitudes, experiences, and beliefs, and these always color how people interpret results.

Qualitative Approaches

There are many different approaches to qualitative research, with their own philosophical bases. Different approaches are best for different kinds of projects. For example:

  • Case studies and narrative studies are best for single individuals. These involve studying every aspect of a person's life in great depth.
  • Phenomenology aims to explain experiences. This type of work aims to describe and explore different events as they are consciously and subjectively experienced.
  • Grounded theory develops models and describes processes. This approach allows researchers to construct a theory based on data that is collected, analyzed, and compared to reach new discoveries.
  • Ethnography describes cultural groups. In this approach, researchers immerse themselves in a community or group in order to observe behavior.

Qualitative researchers must be aware of several different methods and know each thoroughly enough to produce valuable research.

Some researchers specialize in a single method, but others specialize in a topic or content area and use many different methods to explore the topic, providing different information and a variety of points of view.

There is not a single model or method that can be used for every qualitative project. Depending on the research question, the people participating, and the kind of information they want to produce, researchers will choose the appropriate approach.

Interpretation

Qualitative research does not look into causal relationships between variables, but rather into themes, values, interpretations, and meanings. As a rule, then, qualitative research is not generalizable (cannot be applied to people outside the research participants).

The insights gained from qualitative research can extend to other groups with proper attention to specific historical and social contexts.

Relationship Between Qualitative and Quantitative Research

It might sound like quantitative and qualitative research do not play well together. They have different philosophies, different data, and different outputs. However, this could not be further from the truth.

These two general methods complement each other. By using both, researchers can gain a fuller, more comprehensive understanding of a phenomenon.

For example, a psychologist wanting to develop a new survey instrument about sexuality might and ask a few dozen people questions about their sexual experiences (this is qualitative research). This gives the researcher some information to begin developing questions for their survey (which is a quantitative method).

After the survey, the same or other researchers might want to dig deeper into issues brought up by its data. Follow-up questions like "how does it feel when...?" or "what does this mean to you?" or "how did you experience this?" can only be answered by qualitative research.

By using both quantitative and qualitative data, researchers have a more holistic, well-rounded understanding of a particular topic or phenomenon.

Qualitative and quantitative methods both play an important role in psychology. Where quantitative methods can help answer questions about what is happening in a group and to what degree, qualitative methods can dig deeper into the reasons behind why it is happening. By using both strategies, psychology researchers can learn more about human thought and behavior.

Gough B, Madill A. Subjectivity in psychological science: From problem to prospect . Psychol Methods . 2012;17(3):374-384. doi:10.1037/a0029313

Pearce T. “Science organized”: Positivism and the metaphysical club, 1865–1875 . J Hist Ideas . 2015;76(3):441-465.

Adams G. Context in person, person in context: A cultural psychology approach to social-personality psychology . In: Deaux K, Snyder M, eds. The Oxford Handbook of Personality and Social Psychology . Oxford University Press; 2012:182-208.

Brady HE. Causation and explanation in social science . In: Goodin RE, ed. The Oxford Handbook of Political Science. Oxford University Press; 2011. doi:10.1093/oxfordhb/9780199604456.013.0049

Chun Tie Y, Birks M, Francis K. Grounded theory research: A design framework for novice researchers .  SAGE Open Med . 2019;7:2050312118822927. doi:10.1177/2050312118822927

Reeves S, Peller J, Goldman J, Kitto S. Ethnography in qualitative educational research: AMEE Guide No. 80 . Medical Teacher . 2013;35(8):e1365-e1379. doi:10.3109/0142159X.2013.804977

Salkind NJ, ed. Encyclopedia of Research Design . Sage Publishing.

Shaughnessy JJ, Zechmeister EB, Zechmeister JS.  Research Methods in Psychology . McGraw Hill Education.

By Anabelle Bernard Fournier Anabelle Bernard Fournier is a researcher of sexual and reproductive health at the University of Victoria as well as a freelance writer on various health topics.

  • Enroll & Pay

Qualitative vs. Quantitative Data: What’s the difference?

Wed, 08/07/7619.

Lauren Coleman-Tempel

qualitative vs quantitative research in education

In the world of program evaluation, terms like methods, data sources, and analysis are thrown around like candy in a parade. For those of us who do not regularly participate in the evaluation end of programming, the frequent use of these terms can seem daunting and even downright annoying. As staff and directors of grant-funded programs, you will be advised to collect a variety of data on your student outcomes, and often these data sources are discussed in the aforementioned annoying terms. I hope to give you a little background on the difference between the most commonly used data descriptors: qualitative and quantitative data. Here we go….

Quantitative Data

Beans. Yes, I said beans. I like to think of quantitative data as something that you can count, or QUANTify, like a handful of dried beans. In our programs, these quantitative data commonly look like district assessments, student attendance, activity participation, student GPA, etc. 

qualitative vs quantitative research in education

For us to demonstrate that our programs have an impact on the student outcomes that our funders care about, these quantitative data sources must be collected. These are hard numbers. Don’t get me wrong; hard numbers measure things like GPAs and attendance but they can also measure “soft constructs” such as social-emotional skills, feelings of belonging, and academic motivation. This is done through a thoughtful survey collection plan. When seeking to use quantitative data, get creative! It does not and should not always be about our traditional success indicators.

Qualitative Data

Now, we all know that numbers are important and are the backbone of what we report to our funders, but now I am going to shift into the more colorful of our data sources: qualitative data. Our programs, schools and students are more than their test scores, attendance data, and activity counts; they have stories and complexities that numbers cannot always demonstrate. If collected properly, qualitative data can serve as a meaningful backdrop to contextualize what we are seeing in our “bean counts”. In the spirit of qualitative data collection, let me tell you a story.

Program A serves a cohort of high school students. In order to better understand their experiences with college visit events, program staff distribute a survey. The survey results (quantitative data) show that students rated their trip to Brainiac University very low. If program staff stop here, they would just determine that the students are not interested in that university and would not schedule any more visits there in the future. Thinking that there must be more to the story, Site Coordinator Wanda suggests that their evaluators conduct a focus group with students to ask some follow-up open-ended questions about the college visit series. Through these focus groups, evaluators learn that on their visit to Brainiac University, there was a single tour guide who made an offensive comment, which colored this group of students’ experience in a negative light. Without the follow-up focus group, program staff would not have known the deeper experience of their students regarding their college visit.

Qualitative data can be collected in a variety of ways including focus groups, interviews, social media comments, and short answer survey itesm.

qualitative vs quantitative research in education

Qualitative data can provide you with rich background information that allows you to get to know your students on a deeper level. Although it is rarely asked about in program APRs (Annual Progress Reports) and institutional reports, it can be an extremely useful tool to both increase our awareness of our students’ experiences, as well as demonstrate our impact on a human level.

In conclusion, when encountered with the question, “What kind of data should I use?” remember to think outside the box! In order to answer questions about program impact, we can utilize both qualitative and quantitative data. I hope this post helped make these two types of data a bit clearer and also gave you ideas of how these data can be used to both demonstrate program efficacy as well as inform program improvement. Learning about data and program evaluation can feel overwhelming at times, but we are here to help!

qualitative vs quantitative research in education

Contributed By Lauren Coleman-Tempel

Lauren Coleman-Tempel, Ph.D. is the assistant director of  Research, Evaluation & Dissemination  for the University of Kansas Center for Educational Opportunity Programs (CEOP). She oversees multiple federally funded equity-based program evaluations including GEAR UP and TRIO and assists with the supervision of research and evaluation projects.

Follow  @CEOPmedia  on Twitter to learn more about how our Research, Evaluation, and Dissemination team leverages data and strategic dissemination to improve program outcomes while improving the visibility of college access programs. 

Advertisement

Advertisement

Bridging the Gap Between Qualitative and Quantitative Assessment in Science Education Research with Machine Learning — A Case for Pretrained Language Models-Based Clustering

  • Open access
  • Published: 01 June 2022
  • Volume 31 , pages 490–513, ( 2022 )

Cite this article

You have full access to this open access article

qualitative vs quantitative research in education

  • Peter Wulff   ORCID: orcid.org/0000-0002-5471-7977 1 ,
  • David Buschhüter 2 ,
  • Andrea Westphal 3 ,
  • Lukas Mientus 2 ,
  • Anna Nowak 2 &
  • Andreas Borowski 2  

5692 Accesses

22 Citations

Explore all metrics

"You can have data without information, but you cannot have information without data." (Daniel Keys Moran)

Science education researchers typically face a trade-off between more quantitatively oriented confirmatory testing of hypotheses, or more qualitatively oriented exploration of novel hypotheses. More recently, open-ended, constructed response items were used to combine both approaches and advance assessment of complex science-related skills and competencies. For example, research in assessing science teachers’ noticing and attention to classroom events benefitted from more open-ended response formats because teachers can present their own accounts. Then, open-ended responses are typically analyzed with some form of content analysis. However, language is noisy, ambiguous, and unsegmented and thus open-ended, constructed responses are complex to analyze. Uncovering patterns in these responses would benefit from more principled and systematic analysis tools. Consequently, computer-based methods with the help of machine learning and natural language processing were argued to be promising means to enhance assessment of noticing skills with constructed response formats. In particular, pretrained language models recently advanced the study of linguistic phenomena and thus could well advance assessment of complex constructs through constructed response items. This study examines potentials and challenges of a pretrained language model-based clustering approach to assess preservice physics teachers’ attention to classroom events as elicited through open-ended written descriptions. It was examined to what extent the clustering approach could identify meaningful patterns in the constructed responses, and in what ways textual organization of the responses could be analyzed with the clusters. Preservice physics teachers ( N  = 75) were instructed to describe a standardized, video-recorded teaching situation in physics. The clustering approach was used to group related sentences. Results indicate that the pretrained language model-based clustering approach yields well-interpretable, specific, and robust clusters, which could be mapped to physics-specific and more general contents. Furthermore, the clusters facilitate advanced analysis of the textual organization of the constructed responses. Hence, we argue that machine learning and natural language processing provide science education researchers means to combine exploratory capabilities of qualitative research methods with the systematicity of quantitative methods.

Similar content being viewed by others

qualitative vs quantitative research in education

The Application of LSA to the Evaluation of Questionnaire Responses

qualitative vs quantitative research in education

Enhancing Instructors’ Capability to Assess Open-Response Using Natural Language Processing and Learning Analytics

qualitative vs quantitative research in education

Coding with Storyline Approach: Recommendations to Cope with Challenges of Qualitative Data Analysis

Explore related subjects.

  • Artificial Intelligence
  • Digital Education and Educational Technology

Avoid common mistakes on your manuscript.

Research methods in science education are commonly differentiated into quantitative and qualitative methods (Krüger et al., 2014 ). The former allow for the confirmatory testing of statistical hypotheses, whereas the latter allow for more exploratory generation of novel hypotheses. This division is artificial and attributes to the imperfect capabilities of modeling complex systems that involve learning processes of humans. It would be desirable to better integrate both methods and conserve the predictive capabilities of quantitative methods and the exploratory capabilities of qualitative methods. It has been suggested that complex algorithmic approaches such as machine learning can better model assessment in science education (Breiman, 2001 ; Zhai, 2021 ) and eventually provide a new methods paradigm. “Machine learning is about inductively solving problems by machines, i.e., computers.” (Rauf, 2021 , p.8). Inductive learning requires appropriate data for the machines to improve on relevant tasks. Given advances in data storage and accessibility, machine learning (ML) models dramatically improved their performance on many tasks such as image classification, or spoken and written language analytics (Goodfellow et al., 2016 ; Goldberg, 2017 ). Scholars in the fields of education and discipline-based educational research also argued that ML methods can advance educational research (Singer, 2019 ; Baig et al., 2020 ), even “revolutionize” assessments (Zhai et al., 2020 ). Among others, the education sector presents a field where datasets of unprecedented size become available (Baig et al., 2020 ).

ML methods have been utilized in science education research in different contexts. Mostly, science education researchers employed supervised ML methods where a model is trained to map responses to predefined outputs (Zhai et al., 2020 ). However, oftentimes problems in science education research are less well defined and only small datasets can be collected with reasonable effort. For example, in research on university-based teacher education such as noticing and attention to classroom events, typically small samples are available (Chan et al., 2021 ; Wilson et al., 2019 ). Noticing, among others, comprises the careful observation of events in a teaching situation. In science education research it has been highlighted that preservice science and mathematics teachers attend to many different events and contents in a teaching situation (Talanquer et al., 2015 ). To capture the complexity of noticing, science education researchers therefore used open-ended, constructed response formats to assess noticing and attention to classroom events (Barth-Cohen et al., 2018 ; Luna et al., 2018 ; Chan et al., 2021 ). The responses are then analyzed with some form of content analysis. However, not only do the differences in attention between the teachers yield to the complexity of assessing noticing and attention processes, but also the teachers’ use of language in constructed-response items. Language use was characterized to be “noisy, ambiguous, und unsegmented” (Jurafsky, 2003 , p.39). Hence, probabilistic approaches are required to analyze language-related processes and products. A probabilistic approach that also captures complexity is ML. The application of ML-based modeling could provide researchers means to gain novel insights into these complex constructs (Zhai et al., 2020 ). Yet, it is not clear in what ways ML-based approaches can be utilized to identify meaningful patterns in teachers’ constructed responses with respect to noticing and attention to classroom events.

In the present study we therefore evaluate potentials and challenges of using a pretrained language model-based clustering approach to analyze preservice physics teachers’ open-ended, constructed responses in the context of describing a standardized teaching situation. We critically examine to what extent the application of ML in our research context can bridge the divide between quantitative and qualitative methods and provide a more integrative approach.

Utilizing NLP and ML to Model Complex Dataset

Applications with ML and natural language processing (NLP) attracted a lot of interest in the field of science education research (Zhai et al., 2020 ). ML refers to computers’ inductive problem solving based on data (Zhai, 2021 ; Rauf, 2021 ). Two major types of ML are supervised and unsupervised ML (Jordan & Mitchell, 2015 ). In supervised ML, human-annotated data are provided for the models to learn a mapping from input to output in order to classify or predict unseen data (Marsland, 2015 ). Unsupervised ML, on the other hand, encompasses algorithms to reduce complex datasets and extract patterns in them. Both types of ML can be used to analyze natural language. The study of natural language by means of computers is called NLP. NLP refers to the systematic and structured processing of natural language data. Natural language can be contrasted with artificial language such as programming languages or mathematics which are more aligned with formal logic. The attribute “natural” relates to the fact that this form of language can be characterized to be “noisy, ambiguous, and unsegmented” (Jurafsky, 2003 ). It has been argued that it is not possible to specify clear-cut rules for natural language (i.e., a grammar) that explain phenomena of language comprehension and production: “we can’t reduce what we want to say to the free combination of a few abstract primitives” (Halevy et al., 2009 , p. 9). Hence, probabilistic approaches such as ML methods are increasingly incorporated into NLP research in addition to rule-based approaches, given the capacity of probabilistic approaches such as ML to systematically process complex language data, extract patterns in it and classify instances of language use (Goldberg, 2017 ).

ML research experienced a new spring with the successful application of deep neural networks to learn input-output mappings that outperformed more simple (shallow) ML models in most tasks in image and language analysis (Goodfellow et al., 2016 ). A heuristic in ML research states that problems which are easy for humans are difficult for machines to solve such as character recognition or speech perception (Goodfellow et al., 2016 ). Simple ML algorithms like logistic regression excelled in problems where the input representation through features is particularly informative, e.g., the age of a student. The selection and engineering of inputs typically requires efforts for the human researcher, because data are not typically represented in this aggregated form in real-world contexts. Simple ML models would lose performance when more complex data such as images or language form the input (Goodfellow et al., 2016 ). Deep neural networks have been found to be capable of representing the input as part of the modeling, which allowed ML and NLP researchers to apply these models to problems where complex data has to be represented in the first place. Thus, human feature selection and engineering is partly replaced by automated feature representation in the deep neural network approaches oftentimes with the loss of interpretability of the model decisions.

A major facilitator for the deep learning revolution in the last decades was the availability of annotated data. For once, researchers spend tremendous efforts to annotate data manually in order to train deep neural networks that are capable of language comprehension and production, or image classification. For the now famous ImageNet competition, researchers manually labeled over three million images in two years with the help of crowdsourcing (Mitchell, 2020 ). Similar efforts have been undertaken in NLP. To advance language translation, ML researchers were fortunate to find annotated datasets from the cold war where translations were important for intelligence or from the European Parliament that consists of many different nations (Mitchell, 2020 ). However, curating and annotating these datasets captures resources that are not widely available such as money and compute time. Consequently, for most researchers in domains like science education no such well-developed datasets will be available for their specific research questions.

However, the ML paradigm of transfer learning that became important with increasingly complex deep neural networks (Devlin et al., 2018 ) might solve this problem. Transfer learning enables sharing of previously trained ML models for different tasks (Ruder, 2019 ). Much as humans learn language from experiences, feedback and reinforcement (Bruner, 1985 ) and build on learned structures (Rumelhart et al., 1986 ), the paradigm of transfer learning posits that prior trained weights in a given context can be further used to improve model performance in different contexts/domains and with different tasks (Ruder, 2019 ). NLP researchers used transfer learning in the context of language modeling. While in image processing models are oftentimes pretrained on the ImageNet dataset to improve downstream performance (Devlin et al., 2018 ), language models in NLP research can be trained on corpora such as the Internet or Wikipedia (Devlin et al., 2018 ; Ruder, 2019 ). NLP researchers then pretrain language models that are capable of representing language in a way that researchers can use in downstream tasks (Mikolov et al., 2013 ). Typically, these language models are trained with the objective to simply predict context words. The pretrained language models can then be used to generate an informative representation of language to enhance task performance (Mikolov et al., 2013 ; Mikolov et al., 2013 ; Devlin et al., 2018 ).

Modeling Unstructured Data in Science Education Research with NLP and ML

“Perhaps when it comes to natural language processing and related fields [that model human behavior], we’re doomed to complex theories that will never have the elegance of physics equations” (Halevy et al., 2009 , p.8). The “unreasonable effectiveness” (Wigner, 1960 ) of mathematics has been recognized for physics; however, educational sciences are far from having theories in this elegant formulation—given the complexity of the involved problems. In this context, NLP and ML have probably much to offer for these fields where complex theories prevail. Yet, especially more sophisticated NLP and ML applications such as deep neural networks might pose unfulfillable requirements on required size of training datasets and model implementation to be useful for science education research. The size of the training dataset should be judged against the complexity of the task and the complexity of the ML model. While the review by Zhai et al. ( 2020 ) shows that typical applications of NLP and ML in science education research comprise fewer than 30k training samples, the reviewed studies exclusively focus on simpler ML models such as logistic regression, support-vector machines, or naive Bayes. More generally, data collection in domains such as science education is costly and time-consuming, because large coordination efforts are necessary to recruit enough subjects. There are literally no studies in science education where millions of subjects have been collected that comprise datasets that seem to be required to train more general-purpose deep learning models. Does this imply that particularly the more complex ML methods are not applicable for science education researchers?

For supervised ML this hypothesis has been refuted in some science education research contexts. Wulff et al. ( 2022 ) could show that pretrained language models improve classification performance for discourse elements with preservice physics teachers’ written reflections. The findings in this study suggest that complex ML models that are trained from scratch can reach classification accuracy of simpler ML models. Furthermore, the authors show that utilizing pretrained weights for the complex models enhances classification accuracy and generalizability further. Carpenter et al. ( 2020 ) showed that deep contextualized embeddings from pretrained language models could improve prediction of students’ reflective depth in a biology learning context. These findings buttress the applicability of complex ML models such as deep neural networks as facilitators for supervised ML. These studies, however, do not suggest that training the more performant deep learning models from scratch is possible with the available science education datasets. Furthermore, it is not clear from these studies to what extent pretrained language models could be used to extract patterns in the datasets.

Prior research on pattern extraction from unstructured data with simpler unsupervised ML models and larger datasets in education and science education contexts focused on standardized documents such as dissertation or conference abstracts. Munoz-Najar Galvez et al. ( 2020 ) established a data-driven way to systematically analyze the field of education research. They identified paradigm shifts in education research on the basis of 137,024 dissertation abstracts, reconstructing a shift from an outcome-oriented paradigm to an interpretative paradigm. In science education research, Odden et al. ( 2020 ) used latent Dirichlet allocation (LDA), a generative probabilistic topic model, to analyze all papers that were extracted from the Physics Education Research Conference Proceedings from 2001 to 2018 (overall 1,302 papers). They outline shifts in the paper’s topics in the conference over time. Despite the potentials of LDA to summarize occurring research topics and trends over time, the authors recognize some shortcomings with this algorithm. For example, the LDA model groups together segments that use similar vocabulary. However, the segments might differ in meaning anyways (see also: Odden et al., 2021 ). Other researchers could show that simpler unsupervised ML methods could also be used to explore patterns in comparably smaller datasets in science education. Sherin ( 2013 ) used a vector space model and a hierarchical agglomerative clustering algorithm to identify students’ science explanations in interview transcripts. He showed the general applicability of these NLP-based methods in this context, but contends that the algorithms could not account for word ordering effects. He also mentions the desire to more systematically extract the number of topics that are likely present in the data (see also: Xing et al., 2020 ). Also Rosenberg and Krist ( 2020 ) successfully applied an unsupervised clustering algorithm to assess students’ considerations of generality in science (see also: Xing et al., 2020 ; Zehner et al., 2016 ).

A domain of research in science education where NLP and ML in unsupervised contexts has not yet been applied widely is university-based science teacher education. In fact, no reviewed study in Zhai et al. ( 2020 ) engaged in university-based educational research. Besides supervised ML approaches in university-based science teacher education that have been occasionally applied (Wulff et al., 2020 ), unsupervised approaches could facilitate researchers and instructors novel insights into relevant constructs because they can explore patterns in unstructured data (Halevy et al., 2009 ; Hao, 2019 ).

Science Teachers’ Noticing of Classroom Events

Teachers face the challenge to professionally act in uncertain situations (Clifton & Roberts, 1993 ; von Aufschnaiter et al., 2019 ; Chan et al., 2021 ). Learning to professionally act in uncertain situations requires teachers to develop the capacity to reflect on their teaching experiences (Korthagen, 1999 ). An important part of reflective competencies are noticing skills that relate to perceptual and cognitive thinking processes (Chan et al., 2021 ). In particular, noticing comprises observation, interpretation, and reasoning about learning-relevant events in classrooms (Sherin & van Es, 2009 ; van Es & Sherin, 2002a ; Chan et al., 2021 ; Furtak, 2012 ). Van Es & Sherin ( 2002 ) define noticing with regard to three key aspects: “(a) identifying what is important or noteworthy about a classroom situation; (b) making connections between the specifics of classroom interactions and the broader principles of teaching and learning they represent; and (c) using what one knows about the context to reason about classroom interactions.” (p. 573) Noticing research has documented the difficulties that novice and even expert teachers have to direct their attention and notice relevant classroom events (Sherin & Han, 2004 ; Chan et al., 2021 ; Talanquer et al., 2015 ; Levin et al., 2009 ; Roth et al., 2011 ). For example, novice science and mathematics teachers struggle to attend to student thinking and the substance of what they are saying (Sherin & Han, 2004 ; Hammer & van Zee, 2006 ), and tend to strive for quick and conclusive inferences that are right or wrong, rather than tentative interpretations (Crespo, 2000 ). This strand of research also showed that science teachers provide more general evaluations as compared to more specific accounts of student understanding (Hammer & van Zee, 2006 ). Mathematics and science education scholars generally highlighted the complexity of the noticing construct (Chan et al., 2021 ; Talanquer et al., 2015 ). Talanquer et al. ( 2015 ) summarize the noticing foci of teachers as: “the object of noticing (e.g., student actions, student thinking), the noticing stance (e.g., evaluative, interpretive), the specificity of noticing (e.g., specific student, whole class), and the noticing focus (e.g., specific concept, general topic)” (p. 587). To design authentic learning opportunities for mathematics and science teachers to enhance noticing skills, valid, reliable, and scalable assessment of attention to classroom events and noticing is necessary.

To assess noticing and attention to classroom events, science education researchers increasingly embraced constructed response items, e.g., open-ended, free-recall written responses (Barth-Cohen et al., 2018 ; Luna et al., 2018 ; Talanquer et al., 2015 ; Chan et al., 2021 ). Open response items have been argued to allow a more authentic examination of teachers’ professional competencies as compared to more closed-form questions (Nehm et al., 2012 ; Zhai, 2021 ). Many of the noticing research then seeks to analyze inductively what teachers are noticing (Chan et al., 2021 ). However, the mere linguistic complexity of the constructed responses (noisy, ambiguous, and unsegmented) and the complexity of the noticing construct make it challenging to integrate all information in the responses and infer the noticing skills. From their review on teacher noticing research in science education, Chan et al. ( 2021 ) conclude that “methodological trade-offs between different ways of investigating teacher noticing need to be better explored” (p. 37). We suggest that ML-based methods can provide novel means to analyze teachers’ responses inductively “to understand what teachers notice” (Chan et al., 2021 , p.34). Thus, ML methods potentially help researchers to gather ‘knowledge of teachers’ (Fenstermacher, 1994 ). We also concur with Lamb et al. ( 2021 ) that ML models are powerful tools to advance algorithmic understanding of relevant underlying cognitive processes that can explain the process and products of writing. Zhai et al. ( 2020 ) argued that ML models can particularly advance understanding and assessment of complex constructs such as noticing and provide means to automate assessment and feedback. Consequently, this study examines potentials and challenges of an ML-based clustering approach when applied in the context of assessing noticing of classroom events for preservice science teachers.

Research Questions

Noticing or directing attention to relevant classroom events is highly relevant for mathematics and science teachers and, thus, plays an important role in science education research. Attention to classroom events and contents played a particularly important role in mathematics and science education research. Star and Strickland ( 2008 ) suggested that noticing research should focus particularly on what catches teachers’ attention and what is missed. 25 of the 26 science education studies reviewed by Chan et al. ( 2021 ) considered attention to classroom events as an essential aspect of noticing; 11 studies even restricted noticing to attention. Attention to classroom events has often been studied through video clips that present teachers with a standardized teaching situation and are typically followed by some form of eliciting teachers’ observations (Zhai, 2021 ; van Es & Sherin, 2002a ; Seidel & Stürmer, 2014 ; Putnam & Borko, 2000 ; Darling-Hammond, 2000 ; Kleinknecht & Gröschner, 2016 ; Sherin & van Es, 2009 ).

Noticing research can be characterized as a context where it seems to be notoriously difficult to recruit large sample sizes, rendering quantitative research methods difficult to apply. Reviews suggest that studies typically comprise small samples of up to 241 teachers (Wilson et al., 2019 ; Chan et al., 2021 ). This restricts researchers to using mostly qualitative methods with some form of content analysis (Wilson et al., 2019 ; Chan et al., 2021 ; Talanquer et al., 2015 ). As such, it is important to examine to what extent ML-based approaches can be utilized in this context as a means to advance quantifiable hypotheses. Particularly, pretrained language models can improve the ML methods to be more robust with small samples. Hence, we ask the following overarching research question: To what extent and in what ways can a pretrained language model-based clustering approach extract meaningful patterns in preservice physics teachers’ written descriptions of a teaching situation?

In the context of RQ1, we analyze the validity of the extracted clusters:

RQ1: To what extend can a pretrained language model-based clustering approach extract interpretable (RQ1a), specific (RQ1b), and robust (RQ1c) clusters in the preservice physics teachers’ written descriptions of a teaching situation?

We then examined ways in which these clusters provide insights into the composition of the written descriptions. van Es and Sherin ( 2002 ) used the concept of analytical chunks in their noticing research, referring to experts’ tendency to organize their essays more coherently in reference to teaching and learning principles. Based on this concept of analytical chunks, we hypothesize that the analysis of interconnections between the clusters in the teachers’ written descriptions provides tools to develop a more quantitative understanding of chunks in the writing. To analyze the organization of the teachers’ written descriptions based on the extracted clusters, we explored dependencies among clusters:

RQ2: What kinds of dependencies with respect to textual organization can be analyzed based on the extracted clusters?

Written Descriptions of a Video-Recorded Teaching Situation in Physics

In the present study preservice physics teachers’ were given the instruction to describe, evaluate and reason about a video-recorded lesson which presented the teachers an authentic teaching situation in a 9th grade physics classroom held by an in-service physics teacher. Overall, the teaching goal of the observed lesson was to introduce influencing factors on the movement of falling objects and the definition of free fall. Table 2 outlines the chronological order of events in the teaching situation. The teaching situation can be broadly divided into two phases. In the first phase, the teacher performed several experiments with falling objects (two masses, and a vacuum tube with screw and feather). The students posed hypotheses on the outcome of the experiments (e.g., which of the two masses of different weight will hit the floor first. In the second phase, the teacher provided the definition of free fall and students devised experiments to investigate what type of movement free fall is. This video-recorded teaching situation was chosen because it presents preservice physics teachers a complex and authentic teaching situation where many different noticing-relevant general and subject-specific issues could be identified. Teachers could describe mere surface-level, general issues such as that the students were noisy at several occasions, or more deep-level, subject-specific issues such as that several students raised concerns with the experimental setup (e.g., missing control of variables) or conceptual difficulties (e.g., whether an ever-accelerating object reaches the speed of light). Following the classification rubric for noticing research in science education by Chan et al. ( 2021 ), our approach was meant to characterize teacher noticing (purpose) as assessed through observation of other teachers’ teaching (teaching context), where the observing teachers could not control what happened (role of teacher) and the noticing-relevant events were pre-determined (what to notice) and selected by the researchers (selection of probes) with open-ended prompts (nature of prompt) and divergent answers without correct answer (type of teacher responses).

The video is about 17 minutes long. The preservice physics teachers were allowed to watch the video only once, without rewinding the recording, in order to simulate in-the-moment pressures of decision-making (Chan et al., 2021 ). It was an authentic lesson that was recorded in a German grade 9 high school physics classroom as part of a post-university physics teacher preparation program. In Germany, after the university-based teacher training teacher trainees are required to pass a one- to two-year program, run by federal states, that will approve if they are finally allowed to teach in public schools. Using a recorded lesson from this post-university teacher preparation program presents a lesson that is proximal to what the preservice teachers will do in their future careers. Overall, N =75 preservice physics teachers participated in the study who produced 86 written descriptions (sometimes preservice teachers produced two texts, pre and post to a seminar). The teachers varied in their teaching experience and came from three different universities throughout Germany (see Table 1 ). Preservice teachers spent approximately one hour on the entire questionnaire of the online video-vignette. The text production took approximately 20 minutes (independently of another 17 minutes video observation and another 20 minutes answering further questions). Preservice physics teachers were instructed to first describe what happened in the teaching situation. Afterwards, they should evaluate the situation, devise alternative modes of action, and formulate consequences for their own teaching.

Given that preservice physics teachers either described, evaluated, and reasoned about the observed teaching situation, the sentences that count as descriptions were extracted through an ML-based classifier. The ML-based classifier automatically retrieved descriptive sentences based on a classification algorithm that was described elsewhere (Wulff et al., 2022 ). This classifier annotated each sentence with one of the following labels: “circumstances”, “description”, “evaluation”, “alternatives”, and “consequences.” Using sentences as the segmentation units was found to be a reasonable strategy in similar contexts of writing analytics (Ullmann, 2019 ). The descriptive sentences were further filtered to a length greater than four words to remove headlines and similar non-informative sentences. 98% of sentences of the original descriptive sentences remained (1537 sentences in total). The preservice teachers wrote on average 16.0 ( SD  = 7.9, min: 4, max: 59) words in a descriptive sentence. In descriptive sentences, the preservice teachers wrote in various ways about the events in the lesson as outlined in Table 2 . A randomly drawn sentence from a preservice physics teacher reads as follows: “The observations [from the students] and differences [to the hypotheses] were collected and summarized by the teacher as free falling movement is independent of the mass.” This sentence and all words and sentences in the following were translated from German to English by the authors who are familiar with English language, in particular specialized vocabulary in physics. Some intricacies emerged with the translations. For example, German language has many specific abbreviations in educational contexts, e.g., “SuS” (“Schülerinnen und Schüler”) for female and male students or “LK” (“Lehrkraft”) as an inclusive word for teacher that have no equivalent in English. We tried to highlight those issues when they occur. Furthermore, German language is well known for its compound nouns that can become very long (e.g., “Fallröhrendemonstrationsexperiment”, which can be translated to “demonstration experiment with drop tube”). In German, compound nouns may count as one word in the vocabulary, whereas in English many different words would be added. Consequently, the German vocabulary in terms of distinct words is larger compared to the English vocabulary. Footnote 1

Clustering Sentences of the Written Descriptions

ML methods that extract patterns in unstructured data such as the constructed responses are categorized as unsupervised ML. Unsupervised ML typically include some form of dimensionality reduction and clustering oftentimes with the purpose to make high-dimensional data human-interpretable. Clustering approaches that were not based on pretrained language models enabled science education scholars to identify emergent topics in conferences or students’ writing (Odden et al., 2020 ; Sherin, 2013 ), however, they also oftentimes require involved preprocessing of the data (Angelov, 2020 ; Odden et al., 2020 ; Zehner et al., 2016 ). Most often, researchers needed to remove frequent words (stopwords), lower-case all words (which might be disadvantageous in German where upper-case letters can differentiate word senses), or transform words into their base form to reduce vocabulary size (Odden et al., 2020 ; Rosenberg & Krist, 2020 ). Furthermore, researchers noted the difficulty in determining the number of clusters that should be extracted in these approaches (Sherin, 2013 ) and these approaches oftentimes assume that word order in the sentences is irrelevant (bag-of-words assumption). Finally, these approaches are ignorant of ambiguous word senses. No prior information on the words is incorporated in these approaches such that the word “bank” in the phrases “river bank” and “bank robbery” might be treated as the same word even though the meaning differs substantially. Recently, however, advances in NLP and ML research provided pretrained language models that provide contextualized embeddings for language data that help to cope with some of the aforementioned challenges. These contextualized embeddings potentially enable researchers to model constructed responses in a more language-sensitive way that is able to preserve word ordering and word sense disambiguation as features.

Pretrained language models can generate contextualized embeddings for language input that enhances modeling of the language data (Mikolov et al., 2013 ; Sherin, 2013 ; Taher Pilehvar & Camacho-Collados, 2020 ). Essentially, words are mapped to a position in high-dimensional vector space, called a distributed representation in the form of embeddings (Taher Pilehvar & Camacho-Collados, 2020 ). Vector space models thus encode word similarity and efficiently represent words. Given the claim that one understands a word by the company it keeps (Jurafsky & Martin, 2014 ), word embeddings can be learned through ML approaches, where model weights are optimized with the goal that a word embedding for a given word predicts the context words (Mikolov et al., 2013 ). More advanced approaches utilize pretrained language models that result in embeddings that also account for the context (contextualized embeddings) and the position in a segment that a word occurs in (Taher Pilehvar & Camacho-Collados, 2020 ). Pretrained language models are typically trained on large unstructured datasets (e.g., the Internet, Wikipedia). Training tasks involve prediction of context words (Devlin et al., 2018 ). For practical purposes the vocabulary is often restricted to some 30,000 tokens, where unknown words can be built from the 30,000 tokens. Linguists have estimated that 30,000 words are sufficient to understand many general English texts well (Mitchell, 2020 ). If a sentence is input into a pretrained language model, typically embeddings for each word in the sentence (given the position and context words) is the output. To generate a contextualized embedding for a sentence, the word embeddings can be pooled.

As an illustrative example for sentence embeddings based on pretrained language models, the following physics-related and general sentences should be considered (some noise data points were added which will be motivated later on): ’Earth exerts a force’, ’The force acts on’, ’The force on earth’, ’We force her’, ’They force him’, ’How to force him’, ’Grass is green’, ’The sunset can be red’, ’Green is grass’ (called Segment 1 to 9 respectively). Force in the first three sentences relates to the physics meaning (given as a noun). In the following three “force” is included as a verb that encapsulates a certain kind of rather aggressive behavior. The final three sentences are included as sentences that are entirely different in meaning. “Force” in the former sentences has a different word sense compared to the sentences 4 to 6 and should be distinguished in a clustering approach. In Fig. 1 (a) a two-dimensional representation of the sentence embeddings gleaned from a pretrained language model is depicted. As can be seen from the separation of datapoints in space, pretrained language model’s word embeddings can in fact disentangle the senses to a certain degree. To further inspect the embedding space, a clustering approach can now determine which sentences are likely related to each other (Angelov, 2020 ).

figure 1

a  Two-dimensional representation of the example segments and noise. b  Surface plot of probability density of the data points. c  Minimal spanning tree with data points as nodes (colors indicate the mutual reachability distance). d  Dendrogram of clusters for varying density values (colored circles indicate clusters)

Extracting clusters from contextualized embeddings can be done with Hierarchical density-based spatial clustering of applications with noise (HDBSCAN) (Campello et al., 2013 ). HDBSCAN is a way to calculate the number of dense volumes (i.e., clusters) in the embedding space. Density-based clustering methods consider the probability density of a collection of data points (Kriegel et al., 2011 ). In Fig. 1 (b) the probability density distribution for the data points in Fig. 1 (a) is depicted. To extract clusters, an imaginary water level can be introduced into the probability space. The water level represents a threshold for cluster extraction. Emerging islands, i.e., regions above the water level, represent clusters. If water level rises, less probability mass lies above the water level, and thus fewer clusters are extracted. A suitable water level has to be chosen in order to extract an appropriate amount of clusters.

To perform the actual clustering the nearest neighbors for each data point will be determined and the closest distance between nearest neighbors will be highlighted as edges in a graph, i.e., the minimal spanning tree (see Fig. 1 (c)). A threshold parameter (i.e., the minimal distance) is then varied where edges that surpass the threshold are removed from the graph. Finally, the minimal spanning tree is mapped into a condensed tree representation (see Fig. 1 (d)). The condensed tree depicts the number of data points in a cluster (width of the branches) with varying densities ( \(\lambda\) ). A way to extract clusters from the condensed tree is by defining a minimal cluster size and examining the stability of the branches over different density values (moving up and down in Fig. 1 (d)). It is desirable to have clusters that persist over varying density-levels. The stability of a cluster basically relates to the regions of maximum area in the condensed tree Kriegel et al. ( 2011 ), Campello et al. ( 2013 ). The algorithm thus determines a number of clusters by examining properties of the clusters. From the illustrative example, the resulting clusters based on this clustering approach (HDBSCAN combined with pretrained language models) are depicted as blue, orange, and green ovoids in Fig. 1 (d). The red-shaped ovoid cluster could be considered as noise, given the instability over density values in Fig. 1 (d). If the sentence embedding points in Fig. 1 (a) were to be colored, the closely aligned sentences would in fact be colored with the same colors, respectively.

Analysis Procedures

Interpretability of clusters (rq1a).

In order to evaluate if the pretrained language model-based clustering approach Footnote 2 outputs represent interpretable clusters, the most representative words for each cluster were considered, and a definition was derived. Visual inspection of the two-dimensional embedding space and the condensed tree representation helped to determine similarities and differences of the clusters. If the five most representative words could be mapped to distinct sections in the observed teaching situation (see Table 2 ) and were coherent, then we considered this as evidence of a meaningful cluster, because clusters were anticipated to attend to localizable events (e.g., experiments) or actions (e.g., devising hypotheses). We also assessed to what extent the clusters related to physics ideas that were implicitly or explicitly relevant in the observed teaching situation, and what ideas or events were not clustered.

Specificity of Clusters (RQ1b)

Then it was evaluated to what extent physics-savvy human raters could use the extracted clusters to manually annotate the video-recorded teaching situation. If human raters struggled to annotate a certain cluster in the video recording, this would provide evidence of unspecific focus of a cluster. To annotate the teaching situation on the basis of the extracted clusters, three independent raters with physics background (one postdoc, two PhD students) who were familiar with the observed teaching situation annotated the entire video sequence based on 10 second intervals. All the information they received were the five most representative words for the respective clusters (coding 1) with no further instruction. In a second iteration (coding 2), the human raters discussed and agreed on some coding rules, e.g., that the entire process of an experiment should be annotated if relevant words of a cluster occurred only at the beginning. To evaluate the reliability of this annotation, we first examined a graphical representation of the annotations over time to evaluate interrater agreement. We considered each cluster separately. To evaluate interrater agreement, Krippendorff’s \(\alpha\) for each cluster was calculated because Krippendorff’s \(\alpha\) is more appropriate than Cohen’s \(\kappa\) for three raters. A Krippendorff’s \(\alpha\) value of 1 refers to perfect reliability and a value of 0 to absence of reliability. Values between .667 and .800 are usually considered to allow researchers to draw tentative conclusions, i.e., consider the agreements as non-random (Krippendorff, 2004 ).

Robustness of Clusters (RQ1c)

To analyze robustness of clusters, the clustering approach was applied to smaller subsets of the dataset. To test if small sample sizes are enough, subsets of N =43 randomly chosen pre-service teachers and N  = 8 randomly chosen pre-service teachers were considered. The extent to which similar clusters emerge was examined. If meaningful clusters could be identified in these subsets, then we considered the algorithm robust with sample size variations which could be beneficial for science education researchers who oftentimes only have small samples at their disposal. Furthermore, we compared the outputs of the pretrained language model-based clustering algorithms with a clustering approach that was not based on pretrained language models, but was successfully applied in a science education research context before. We therefore adopted the topic modeling approach outlined by Sherin ( 2013 ). He devised an accessible approach for extracting clusters in interview transcripts. He started by segmenting texts into chunks of 100 words (with overlap). Afterwards, a normalized term-document matrix was formed. To circumvent the problem of similar topics (low levels of variability in the data), deviation vectors were calculated. Based on the deviation vectors, hierarchical agglomerative clustering yielded a distribution of topics, depending on the number of topics. Finally, the ten most representative words were found as the highest ranking words in the centroid vectors for the respective topics. With parameter values adapted to our research context, we extracted clusters from our descriptions based on this approach. Based on the comparison from the ten most representative words for each topic, we evaluate to what extent both clustering approaches yield similar topics. This would yield evidence that the pretrained language model-based approach could also be successfully employed in science education research contexts.

Advanced Textual Analytics Based on the Clusters (RQ2)

The applicability of the pretrained language model-based clustering for analytics of the constructed responses was evaluated through exploratory analysis of the textual organization of the constructed responses. Based on episodic memory theory it can be expected that the preservice teachers provide a chronologically ordered text organization. Hence, the temporal progression of the clusters within the teachers’ written descriptions was analyzed. To depict the temporal progression of the clusters within the written descriptions, the sentences were mapped to their relative position in reference to the other descriptive sentences for each teacher (see similar analysis in: Sherin, 2013 ). Mapping the sentences to their relative position was supposed to produce certain peaks where clusters are most prevalent in the descriptions. For example, it could be expected that mentioning the introduction with hedgehog and hare or the teacher experiments precedes other clusters such as the discussion of the type of movement, because these descriptions appeared first in the observed teaching situation and teachers are expected to describe the teaching situation chronologically. Distinctiveness in temporal progression would indicate that the extracted clusters in fact captured different aspects of the teaching situation. To further analyze textual organization, we employed a network-analytical approach to calculate the centrality of different clusters and a vector-field approach where the movements through cluster space can be characterized. In both approaches we will evaluate to what extent the respective empirical distributions, i.e., the directed network of clusters and the vector-field representation, are better captured by random processes or more deterministic processes. If teacher’s written descriptions can be characterized by more deterministic processes, we can conclude that the presented clustering approach can yield insights into textual organization.

Validity of the Clustering Approach (RQ1)

Interpretability of the extracted clusters (rq1a).

To evaluate the interpretability of the extracted clusters, contextualized embeddings of the preservice physics teachers’ descriptive sentences were generated with the pretrained language models and clusters were extracted with the HDBSCAN algorithm. This approach yielded a number of 14 clusters and a noise cluster (cluster -1). The absolute sizes (# of sentences in a cluster) are depicted in Table 3 . We also provided a definition of the clusters based on the most representative words for each cluster, and we determined how many sentences per written description on average were categorized into each cluster (see Table 3 ). The largest share of sentences was coded as -1. Footnote 3 The graphical representation of the embedding space with clusters highlighted in colors can be seen in Fig. 2 . The embedding space can be fundamentally separated into two overarching groups (indicated by the black line): (1) clusters that relate to physics-related events or topics that occurred during the teaching situation and (2) clusters that encapsulate general actions, and specific, non-subject-related events. In group 1, cluster 2 thematizes the central experiment of the lesson where a feather and screw are observed falling in a vacuum tube. Cluster 2 had the second largest share of sentences in the descriptions (see Table 1 ). Relatedly, cluster 10 likely represents the students’ hypotheses that the screw has a higher weight, whereas the feather has a high air resistance. Clusters 0 and 1 represent the other experiment, in which two mass pieces (equal shape, different mass) are dropped simultaneously to deduce that free fall is independent of mass. Clusters 8 and 9 refer to the teacher’s question about which type of movement a free fall is and how this type of movement can be experimentally determined.

On the other hand, in group 2, clusters 6 and 7 represent teachers’ and students’ actions of summarizing and posing hypotheses/claims respectively. Given the similarity of clusters 6 and 7, they were also close in embedding space. Cluster 6 was related to posing hypotheses by the students, whereas cluster 5 was related to the process of summing up the hypotheses by the teacher. In fact, this was a recurrent thread in the lesson: the teacher asked the students to hypothesize about the results in advance of an experiment which is why the cluster was coded at several points. Cluster 3 also refers to the teachers’ responding to students’ answers. Cluster 4 represents the students’ action of raising arms and responding to the teachers’ questions. Cluster 13 captured the beginning of the lesson where the teacher reminds the students of the former lesson regarding the race between hedgehog and hare. Finally, cluster 12 referred to the instruction by the teacher that the students may copy the definition of free fall into their folders.

In sum, the clusters encapsulate both short and rather specific events in the teaching situation (e.g., writing the definition of free fall in the folder) and more abstract ideas such as summarizing hypotheses which occurred more than once in the scene. They also include more general clusters (summarizing students’ hypotheses, e.g., cluster 5) and more physics-related contents (characterization of the type of movement, e.g., clusters 8 and 9). Preservice physics teachers wrote on average 3.4 sentences on cluster 2, which comprised the largest share (after the noise cluster), followed by cluster 1 and 11 with 1.8 sentences on average. Thus, physics-specific clusters were more extensively included in the written descriptions. However, the overall low average counts of one sentence for a cluster could indicate that oftentimes the preservice teachers only briefly elaborated on an event. It is also noteworthy that some important events in the teaching situations are not captured in a cluster. During the lesson the students asked for example: “Why is it called free fall for a parachute jumper?”, “Would an infinitely accelerating mass surpass the speed of light?”, or “Would two plates, one made of cardboard the other made of metal, actually arrive on the floor at the same time?”

figure 2

Two-dimensional representation of clusters. A point represents the projection of a sentence embedding into the two dimensions. Colors represent belonging to a cluster. Gray points represent “noise”, i.e., not belonging to any cluster. Larger points indicate cluster centroids

Specificity of the Extracted Cluster (RQ1b)

To examine to what extent the extracted clusters map to discernible events and topics in the teaching situation, human raters used the clusters as represented through the most informative words to annotate the video recording of the teaching situation (RQ3). Figure 5 depicts all codings from three independent annotators separated by cluster over time. To estimate human interrater agreement, we calculated the Krippendorff \(\alpha\) values for the clusters. After the first round of rating the video-recorded teaching situation (coding 1), the Krippendorff \(\alpha\) ’s indicate that some clusters (e.g., 0, 1, 2, 8, 12, and 13) could be identified with good reliability given only the five most representative words and no annotator training. Cluster 12 related to the introduction of the definition of free fall by the teacher. This, apparently, was a localizable event in the teaching situation. Cluster 0 related to the experiment with two masses (similar reasoning for cluster 1). The teacher used two masses only once as an experiment, hence, this formed a recognizable event for the human raters. Cluster 13 related to the very beginning of the lesson. The words “hedgehog” and “hare” are unique for this event. The human annotators reached poor reliability on clusters with more general words (e.g., 3, 7, and 11). The words “respond”, “feedback”, “summarize”, and “teacher” could be applied to many different events in the teaching situation. They represent high-inferential categories, because the teacher and students did not specifically say that they “responded” or “summarized” ideas.

After coding 1, the three annotators made their coding rules more explicit and discussed them. On this basis, the video-recording of the teaching situation was annotated again by all three annotators (coding 2). Some improvements could be seen after the discussion. Most notably, clusters 1, 2, 4, and 9 substantially improved in interrater agreement (see Table 4 ). Cluster 9 made the most substantial improvement. This cluster related to the measurement and determination of the type of movement. The raters agreed to include all student suggestions at the ending of the teaching situation because this represented a coherent phase, which caused the improvements in agreement. However, other clusters (3, 6, 7, 10, 11) seemed to remain too vague to be annotated based on the five most representative words.

Robustness of the Extracted Cluster (RQ1c)

To evaluate the robustness of the extracted clusters, we probed to what extent the clustering algorithm would still yield interpretable and comparable clusters for smaller sample sizes. The baseline for comparison formed the extracted clusters based on the entire dataset (see Fig. 2 ). As sample sizes in noticing research in science education are typically smaller, subsets of N =43 and N =8 were drawn. The entire clustering approach was performed for these subsets of the data. The resulting cluster embeddings and condensed trees can be seen in Fig.  4 . We particularly mapped the extracted clusters based on the top five words to the baseline clusters as extracted with the entire dataset. It is noteworthy that the spatial outline and the actual extracted clusters can be mapped well onto each other. This is even possible for a sample size of only N =8 teachers. The two overarching groups (general and physics-specific) could be identified for the subsamples as well. Based on the condensed trees, some similarities in cluster evolution over different density values can be inferred as well. For example, clusters 8 and 9 seem related in all condensed trees as they evolve from a common branch. Both clusters comprise sentences on type of movement which are physics-specific. Interestingly, in Fig. 3 , also clusters 10 and 11 fall on the same branch as 8 and 9. This might be attributed to the fact that in clusters 10 and 11 the influence of air resistance on free fall is considered which is closely related to movement as well. While clusters 0 and 1 are linked in Fig. 3 (both include the vacuum tube experiment), this link does not exist in Fig.  4 . For these clusters, probably the five most representative words are not informative enough to allow for clear distinction. Clusters 4, 5, and 6 relate to the students’ and teachers’ actions of posing hypotheses (see Fig. 3 ). While they neatly evolve from one parent branch in Fig. 3 , only one of the respective clusters was present in the smaller samples. However, they also separate early (at low densities) from the other clusters (see Fig. 4 ).

Further evidence for robustness of the presented clustering approach based on pretrained language models can be gleaned by comparison with a formerly successfully employed clustering approach in science education research that was not based on pretrained language models. To implement a clustering approach based on hierarchical agglomerative clustering, a similar protocol as outlined in Sherin ( 2013 ) was followed. However, we did not segment our texts into 100-word chunks, but rather into the sentences that were used as smallest segments. We considered this useful, because we expected the grain size of our clusters (i.e., discernable events in the teaching situation) to be smaller compared to the grain-size of the clusters in Sherin ( 2013 ), i.e., explanations. Our overall vocabulary was 2,786 unique words in German language. 232 stopwords were removed. This enabled us to calculate deviation vectors and apply clustering. A number of 14 clusters were found to be reasonable for our data (see Supplementary Material for detailed Table).

figure 3

Condensed tree representation of the extracted clusters

figure 4

Scatter plots and condensed trees for cluster evaluation of smaller samples ( N  = 8 and N  = 43 teachers)

Table 5 depicts the resulting clusters with the most representative words for each cluster vis-á-vis the clusters from the pretrained language model-based clustering approach. Most of the resulting clusters can be mapped to the clusters that were extracted based on the pretrained language model-based clustering approach. Cluster 0S Footnote 4 thematizes students’ formulating hypotheses and summarization by the teacher. This relates to clusters 3, 5, and 7. Clusters 1S and 2S relate to the vacuum tube experiment, where cluster 1S focusses on the execution and cluster 2S on the observation and results. This maps to cluster 2. Cluster 3S relates to the dependency of air resistance and fall velocity, and possibly relates to clusters 10 and 11. Cluster 4S is not entirely clear, and cluster 5S deals with the teacher repeating the experiment, which has no apparent equivalent cluster. Cluster 6S focusses on students’ raising their arms and responding, which could be mapped to cluster 4. Cluster 7S relates to the writing down of the definition of free fall, which can be linked to cluster 12. Cluster 8S likely mixes the response of one female student and the remark of another male student, to what extent the speed of light would be reached by a falling object. No apparent link can be made to the pretrained language model-based clusters. Cluster 9S relates to the experiment with two masses that would most likely map to clusters 0 and 1. Cluster 10S addresses the transition from introduction of the experiments with no apparent corresponding cluster. Cluster 11S, again, deals with the experiment with two masses and links to clusters 0 and 1. Cluster 12S addresses a students’ answer to the question about what kind of movement the free fall is. The closest resemblance is with cluster 8. Finally, cluster 13S addresses the vacuum tube experiment, in particular the repetition of the same. No apparent equivalent exists in the pretrained language model-based clustering approach. Finally, we calculated the proportion of sentences in each cluster from the approach by Sherin ( 2013 ) that were classified as noise in the pretrained language model-based clustering approach. The respective proportions for each cluster were: 0.46 (0S), 0.28 (1S), 0.45 (2S), 0.40 (3S), 0.60 (4S), 0.60 (5S), 0.48 (6S), 0.38 (7S), 0.62 (8S), 0.35 (9S), 0.61 (10S), 0.32 (11S), 0.34 (12S), and 0.09 (13S). Clusters 4S, 5S, 8S, and 10S had a particularly large shares of noise-clustered sentences. Interestingly, these clusters could not be easily mapped to the clusters from the pretrained language model-based clustering approach (however, cluster 13S with a particularly low proportion could also not be assigned). They also consistently included generic words (e.g., teacher or students), which were attributed with the noise cluster in the pretrained language model-based clustering approach Fig. 5 .

figure 5

Codings of video sequence (coding 2) with identified clusters based for three independent raters

Exploring Textual Organization with the Extracted Clusters (RQ2)

To evaluate to what extent the extracted clusters provide quantifiable information on the textual organization of the written descriptions, we first plot the occurrence of clusters throughout the written descriptions, examine the non-random organization of the clusters, and examine properties of the cluster embeddings. Occurrence of clusters throughout the written descriptions is depicted in Fig. 6 . The vertical bars indicate the textual position for the respective maximum occurrence of a certain cluster. The textual positions of the maxima are equally distributed throughout the written descriptions, so that all parts of the written descriptions are attributed with a cluster. Furthermore, the cluster occur at expected positions, given the events in the teaching situation. For example, cluster 13 addressed the beginning of the lesson and it occurred most frequently at the very beginning of the written descriptions (see Fig. 6 ). In the observed teaching situation, three experiments were carried out one after the other: Free fall of a screw and a spring (cluster 10), free fall of two masses of the same size but different weights (cluster 1) and, finally, free fall in a vacuum tube (cluster 2). Cluster 10 appeared at the beginning of the texts. Cluster 1, in contrast, appeared somewhat later, which maps to the temporal sequence of events in the observed teaching situation, since both experiments that were referenced in these clusters were carried out shortly after each other in the first half of the video. Cluster 2 was addressed frequently and extensively throughout the descriptions. In fact, cluster 2 relates to the most noteworthy experiment (vacuum tube) in the entire teaching situation, which might explain the preponderance in the written descriptions.

A problem (cluster 0) occurred in the second experiment (cluster 1). The shapes of the curves for cluster 0 and 1 match well (as it is also evident in Fig. 3 ). Before the first experiment, the teacher summarized the “main hypotheses”; the corresponding cluster 5 for this event also occurred chronologically at the beginning. The other actions, i.e., the formulation and discussion of hypotheses (clusters 6 and 7), the reaction to pupils’ answers (cluster 3) and the pupils’ answers (cluster 4) occurred throughout the teaching situation, which is reflected in the considerably high frequency throughout the first half of the written descriptions in Fig. 6 . Cluster 11 related to the discussion of the connection between air resistance, mass and fall velocity. This was also related to the experiments seen (observations were described and interpreted; hypotheses regarding the connection were posed and tested). The temporal progression was appropriate, less at the beginning, more towards the middle of the texts. Cluster 12 addressed summarizing the findings of the three experiments. It occurred quite often at the beginning of the descriptions, which does not correspond to the chronological sequence of events. The reason for this could be that some preservice physics teachers began the descriptions with what the goal/result of the sequence was. Otherwise, cluster 12 had its second peak before clusters 8 and 9, which again fits the temporal sequencing of events in the teaching situation. At the end of the sequence, the teacher asked what kind of movement the free fall is. The corresponding clusters were the question itself (cluster 8) and the discussion about it (cluster 9). They occurred most often in the middle of the texts, which corresponded to the end of the written descriptions. The noise cluster (cluster -1) occurred almost equally distributed throughout the written descriptions. The respective counts for each relative position were: 57 (0.0), 71 (0.1), 91 (0.2), 73 (0.3), 79 (0.4), 71 (0.5), 66 (0.6), 68 (0.7), 88 (0.8), 76 (0.9), 20 (1.0). This provides evidence that no particular position in the written descriptions was prone to include more noise sentences compared to other positions. The lower counts at the beginning and end positions resulted from the calculation of the relative position index.

figure 6

Progression of extracted clusters relative to other descriptive sentences in the documents. Top: absolute count of occurrence for a cluster at a given document position. Bottom: relative frequency for a cluster at a given document position. Vertical lines indicate the overall peaks in occurrence for each cluster

To analyze the sequential interdependence of the clusters, directed network graphs were generated based on the incoming and outgoing connections for each cluster (see Fig. 7 ). A connection between clusters was established when one cluster occurred in the preceding or receding sentence of another cluster’s sentence. Edges (i.e., the interconnections between two clusters) in the networks were weighted by the cluster sizes to highlight connections that appeared often irrespective of the cluster size. The edges with the largest values for the connections were labeled with the respective values (see the small numbers on the edges in Fig. 7 (a)). The empirical network graph highlights that certain clusters are central in the network (see Fig. 7 (a)). The greatest importance in the network had clusters -1, 2, 4, 6, and 11. In particular, cluster 2 represents the vacuum tube experiment, and cluster 4 the general cluster that students raise their arms and respond. Hence, both physics-specific and general clusters were highly interconnected in the physics teachers’ written descriptions.

By analyzing interconnections between two nodes, it appears that clusters -1, 2, 9, 3, and 10 were self-referenced particularly often. Except for clusters -1 and 3, these clusters related to physics-specific events such as the vacuum tube experiment, the type of movement, and the weight and air resistance. Moreover, clusters 8 and 9, clusters 0 and 1, and 1 and 6 are interconnected particularly often. The former two connections directly attribute to the close connection of these clusters in meaning. The connection of cluster 1 (experiment with two masses) and cluster 6 (students’ hypotheses) can be explained by the fact that the teacher linked this experiment with posing hypotheses.

Finally, movements of the preservice physics teachers through embedding space by means of addressing specific clusters in their texts should be analyzed with streamline plots (see Fig. 7 (b)-(d)). Streamline plots are vector field representations. We define a connecting vector between two sentences that belong to any of the clusters as a “velocity” vector, indicating the movement through cluster embedding space. The resulting vector field is represented in Fig. 7 (b). A tendency to “move” through cluster embedding space in center direction can be verified, because the streamlines direct toward the center. By comparing Fig. 7 (b) with (c), which represents a vector field where every velocity magnitude and direction were chosen at random, it is evident that Fig. 7 (b) does not represent a random vector field. When positional information is added generate the velocity vector direction (see Fig. 7 (d)), the resulting vector field resembles the empirical vector field. The entropies Footnote 5 for comparing velocities in plots (b) with (c), and (b) with (d) in x - and y -direction, respectively, were .45 and .28, and .03 and .10. This indicates that the vector field in Fig. 7 (d) better approximates the empirical vector field. Thus, the preservice physics teachers do not randomly walk through the cluster embedding space, but rather deliberately compose their texts by attending to the different clusters that were extracted with the pretrained language model-based clustering approach.

figure 7

Directed network graphs of clusters and streamline plots of cluster embeddings: a  Empirical directed network based on the actual connections between clusters present in the written descriptions; b  Streamline plot of actual connections between clusters; c  Streamline plot with randomly distributed directions; d  Streamline plot where directions are sampled from pool of existing connections.

Attention to learning-relevant classroom events and students’ thinking is an important skill for teachers to implement a student-centered pedagogy (van Es & Sherin, 2002b ; Chan et al., 2021 ; Levin et al., 2009 ). However, assessment of teachers’ attention to classroom events is complex, because either the uncertainty of teaching situations is oftentimes related to the inherent complexity of ongoing processes, and describing one’s attention processes is intricately tied to teaching knowledge and other filters (Chan et al., 2021 ). Constructed response formats have been argued to facilitate more authentic assessment of attention processes, and computer-based analytical tools such as ML methods have been found to provide promising means to further our understanding and assessment of complex constructs such as attending to classroom events (Lamb et al., 2021 ; Zhai et al., 2020 ). In this paper we sought to examine potentials and challenges of a pretrained language model-based clustering approach for the purpose of extracting patterns, i.e., clusters, in preservice physics teachers’ written descriptions of an observed teaching situation. We examined the validity of the extracted clusters (RQ1) and explored novel ways in which the clusters enable textual analytics that allow to examine quantitative hypotheses on textual organization (RQ2).

To assess the validity of the extracted clusters, the interpretability (RQ1a), the specificity (RQ1b), and the robustness (RQ1c) of the extracted clusters from the pretrained language model-based clustering approach were evaluated. The clustering approach identified a number of 14 clusters that can be grouped into physics-specific and more general clusters. With regard to the contents of the clusters, all clusters could be related to distinct events in the teaching situation. The clusters encapsulated short, concrete events (recapitulating the last lesson), and more abstract ideas (summarizing hypotheses). We found that more specific, event-related clusters could be reliably coded by the raters. However, the more general clusters (related to posing and summarizing hypotheses) that were applicable to several parts of the teaching situation yielded lower reliability scores, and are thus more inferential. The extracted clusters were also robust to variation in sample size and clustering method. A sample size of only N =8 preservice physics teachers’ written descriptions yielded a similar distribution of clusters. This likely resulted from grounding the clustering with embeddings from the pretrained language model. A further indication of robustness resulted from the comparison with a previously employed clustering approach in science education research (Sherin, 2013 ). We found that many of the extracted clusters from the pretrained language model-based clustering approach mapped to the clusters that resulted from the application of the clustering approach by Sherin ( 2013 ).

Given that the clusters were well interpretable and could be mapped to the teaching situation, we conclude that the algorithm identified meaningful and distinguishable clusters in the preservice teachers’ descriptions. The variety of different foci and abstractness in the extracted clusters is well represented within the different foci of noticing that were summarized by Talanquer et al. ( 2015 ). Moreover, the differentiation of more general clusters and physics-specific clusters resonates with the well-established construct of teachers’ knowledge, in particular the notions of general pedagogical knowledge and content knowledge (Shulman, 1986 ; Carlson et al., 2019 ). The pedagogical content knowledge as an “amalgam of content and pedagogy” (Hume, 2009 ) might be conceptualized as the relevant knowledge to connect the clusters and discuss pedagogical implications of the physics-specific, and more general clusters. The pretrained language model provides the relevant structures to classify sentences along this dimension. The contextualized embeddings from the pretrained language model facilitate science education researchers means to extract robust clusters in their datasets. Furthermore, the pretrained language model-based clustering approach integrates the data preprocessing into the modeling and introduces a novel criterion for cluster extraction (stability of clusters over density variation) that provides the human analyst another important measure of appropriate cluster selection.

The findings in the context of RQ1 also indicate that the preservice physics teachers included very general clusters and a comparably large amount of noise clustered sentences. This observation might relate to the finding that novice teachers tend to include broad and general statements in their observations, merely as placeholders (Mena-Marcos et al., 2013 ). Mena-Marcos et al. ( 2013 ) found that more knowledgeable teachers also include more precise statements in their reflections. Furthermore, the preservice physics teachers tended to include only few sentences on each cluster. This indicates that, on average, not much space is spent to describe an event in detail. This might relate to the finding that novice mathematics and science teachers in particular struggle to attend to the specific contents of what was said (Sherin & Han, 2004 ; Levin et al., 2009 ; Roth et al., 2011 ). Rather than describing the concrete hypotheses that the students uttered, many teachers might abstract from the specific contents and simply note that the students posed hypotheses. Yet, developing noticing skills would require the preservice physics teachers to detail the concrete ideas of the students and teacher in order to make an informed evaluation on the substance of the classroom interactions (Levin et al., 2009 ). However, the unspecific contents might relate to our instructional approach. For example, it should be tested if pre-service teachers can attend to specific events if they can watch the video multiple times and take notes for themselves.

In the context of RQ2 we evaluated to what extent the extracted clusters could be used to assess the textual organization of the written descriptions. The absolute and relative frequency of sentences in certain clusters with regard to their relative position in the written descriptions were analyzed through visual means. We found that the maximum counts for the clusters well matched their expected positions in the teaching situation. This suggests that the preservice physics teachers, on average, compose their written descriptions according to the chronological occurrence of the events in the teaching situation. This finding resonates with episodic memory theory which suggests that free recall of events occurs in temporal order (Conway, 2009 ; Kahana et al., 2008 ). Further evaluation of textual organization of clusters by means of network graphs enabled us to document that certain clusters are cued together more closely as would be expected by chance and cluster size. This means that clusters that were semantically or chronologically related were linked by the preservice physics teachers more often. This relates to the contiguity effect, namely that neighboring items (here: events in a teaching situation) are recalled successively (Kahana et al., 2008 ). Furthermore, streamplot analyses revealed that the preservice physics teachers’ movement through cluster embedding space was non-random and dependent on the position in this space. On a local scale, the position in cluster space thus determines the propensity with which the preservice physics teachers’ move in a certain direction in this space. Analysis of textual organization can extend assessment of analytical chunks as outlined by van Es and Sherin ( 2002 ). van Es and Sherin ( 2002 ) differentiate expertise in noticing in a trajectory where experts include more interconnections among their evidences (here: clusters and interconnections between them in the descriptions). The extracted clusters alongside with the network representation directly would yield a quantification of noted events and thus provide a tool to diagnose expertise levels in noticing.

Limitations

Even though the utilization of a pretrained language model allowed us to integrate data preprocessing into the ML-based modeling, there are assumptions on the pretrained language models that have to be critically examined. For example, the resulting contextualized embeddings are determined by the choice of the pretrained language model and cannot be easily adjusted. Problems with the pretrained embeddings have also been reported. Given that they are trained on the Internet, certain biases related to gender or ethnicity are present in the embeddings (Caliskan et al., 2017 ; Bhardwaj et al., 2020 ). As such, it has to be critically examined to what extent these biases might be propagated into educational assessments which can be disadvantageous.

Another feature of the pretrained language model-based clustering approach was the algorithm-derived extraction of the number of clusters present in the data. Even though the means to extract the clusters based on the stability over density variation might be an additional tool for researchers to use in order to determine a viable number of clusters, there are still many hyperparameters that can be tuned which yield different numbers of clusters. Given the scope of this paper, we did not systematically vary the hyperparameters to find a final number of clusters. We rather sought to establish that the proposed number of clusters was well interpretable in reference to the observed teaching situation. However, the large proportion of noise datapoints also indicates that a large share of the data is not accounted for in the clustering.

With regard to the contents of the clusters, it was noticeable that the clustering approach did not capture some relevant students’ questions from the observed teaching situation into a distinct cluster even though some pre-service teachers included them in their descriptions. Attending to these student questions in the teaching situation required physics knowledge. One student asked whether the different movement of feather and screw (the feather was zig-zagging whereas the screw moved straight to the ground) could explain the differences in falling time. This is a relevant question that hints at the missing control of variables in the experiment. Some preservice physics teachers included this question in their descriptions, however, no separate cluster appeared to capture it. This is a consequence of the instability and scarcity of this observation as represented in the preservice physics teachers’ written descriptions. Omitting contents from the clusters is in fact a goal for unsupervised ML approaches that seek to reduce a complex dataset (Jordan & Mitchell, 2015 ). For the purpose of assessing skills related to attention to classroom events, adjustments in the clustering procedure should be made to allow more clusters to occur, because the identification of this student question demonstrates close attention to student thinking and an understanding of the problematic aspects of the teaching situation and would be considered to correspond to high levels of noticing skills.

Conclusions

Many domains such as physics embraced ML methods to extract information from unstructured data, e.g., to sift through collider data (unfeasible for humans) to detect outliers (i.e., noise-clustered datapoints) with even the same clustering approach that has been applied in this study (Arpaia et al., 2021 ). Given the novel potentials to extract information from unstructured data and the increasing availability of this data, science education researchers should critically examine potentials and challenges of these novel ML-based methods in their research contexts as well. This study could show that a pretrained language model-based clustering approach could be used as an assessment tool to analytically induce what teachers attended to in an observed teaching situation and evaluate the potentials of ML for analyzing open-ended responses. We suggest that the applied pretrained language model-based clustering approach can be enhanced by further fine-tuning the pretrained language model weights to science-specific language. This will enable more involved language analytics such as analogical reasoning or synonym detection (Mikolov et al., 2013 ). It has been shown that pretrained language models capture some knowledge about quantities (e.g., the magnitude of weight of a prototypical dog), or some knowledge graphs about entities (e.g., “Bob Dylan is a songwriter”) (Wang et al., 2020 ; Zhang et al., 2020 ). In fact, representing natural language into vector spaces can enable novel research approaches to answer research questions in science education research (Sherin, 2013 ). Once the pretrained language models are trained and publicly available, advanced analytics of written descriptions will be enabled. The presented clustering approach could be applied as a recommender tool to automatically feedback to the teachers which events and contents they addressed and which they missed to pay attention to.

The pretrained language models enabled an informed contextualized representation through embeddings of the language data. Representation of language data through embeddings will also enable researchers to map language to other modalities such as graphical/visual data or mathematical expressions (see: Krstovski & Blei, 2018 ). Multiple representations and translating between different representations has been considered a constitutive feature for scientific literacy (Brookes & Etkina, 2009 ). However, it will be necessary to develop theoretically grounded ontologies and epistemologies of what preservice science teachers can observe and how they reason about it (Brookes & Etkina, 2009 ). Once pretrained language models are developed and ontologies and epistemologies can guide analyses, the presented clustering approach in conjunction with these models can help to make analyses more comparable, scalable, and robust.

With the help of the clustering approach in this study quantitative hypotheses on text composition could be explored. For example, we suspect that preservice physics teachers include general and specific language statements in their written descriptions, are scarce to describe a particular cluster, and compose their texts in chronological order of the appearance of the events. Writing a sentence that can be classified into a specific cluster, to a certain extent, predisposes the teachers to move through the cluster embedding space in certain directions, and noticing certain events predisposes them to also include temporally related events. These hypotheses need to be more systematically tested, because they can enhance assessment of noticing-related cognitive mechanisms such as careful observation and attention to classroom events. We even wonder to what extent mapping the teachers’ trajectories through the embedding space can be captured by more physics-involved concepts such as movement through a potential where equations of motion and conservation laws determine the teachers’ writing. We are not aware that these hypotheses have been tested. ML-based methods will enable these analyses.

In line with the argument put forth by Singer ( 2019 ), we encourage science education researchers to adopt more observational studies that are grounded in data science, assessment and measurement (Singer, 2019 ). Insights in physics today also come from simulation studies and observational (non-manipulable) experiments. The recent Nobel price of 2021 on complex systems’ behavior or insights in astrophysics are testimony to this. We believe that science education researchers can gain novel insights on studied phenomena through ML-based, computational approaches such as the one presented in this study where an unstructured body of textual data is analyzed. Zhai et al. ( 2020 ) and Lamb et al. ( 2021 ) argued that ML-based computational models can capture the complexity of cognitive processes and “revolutionize” science assessment. We concur with these arguments and emphasize the necessity to develop an understanding in the science education research community for unsupervised ML approaches and pretrained language models in particular, given the preponderance of observational data that is available in educational contexts. Unsupervised ML methods have thus great potentials to bridge the gap between quantitative and qualitative methods in science education. Pretrained language models, more particularly, capture human-like semantics as measured through implicit association tests and thus represent cognitive structures of humans (Caliskan et al., 2017 ). Hence, pretrained language models arguably are most promising candidates to model language-based processes. Given that, in our case, the ML-based approach scaled seamlessly (neither human annotations nor preprocessing of the textual data was necessary to extract clusters) and is publicly available to researchers, it would be desirable to increase efforts to share data and models in order to make the most use of the available resources.

Data Availability

Please send requests to corresponding author.

Code Availability

Researchers who would like to adopt the presented clustering approach with English language data would have to implement the English language model which is readily available (see description of technical implementation in the supplement).

Please find details on the technical implementation in the supplement.

This means that these sentences were not close enough to any of the cluster centroids. Column three of Table 3 indicates that very general words fall into this cluster, such as “SuS” (which is the unisex abbreviation for “students” in German) or “lehrer” (engl.: “teacher”). Seemingly, this cluster encapsulated descriptive sentences that were too general or that might belong to multiple clusters such that they average out.

For comparison purposes all clusters which were extracted from the approach by Sherin ( 2013 ) were appended with an ‘S’, e.g., cluster 0S.

Note that higher entropies indicate higher mismatch of two distributions.

Angelov, D. (2020). Top2Vec: Distributed Representations of Topics. arXiv.

Arpaia, P., Azzopardi, G., Blanc, F., Bregliozzi, G., Buffat, X., Coyle, L., et al. (2021). Machine learning for beam dynamics studies at the CERN Large Hadron Collider. Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, 985, 164652.  https://doi.org/10.1016/j.nima.2020.164652

Baig, M. I., Shuib, L., & Yadegaridehkordi, E. (2020). Big data in education: a state of the art, limitations, and future research directions. International Journal of Educational Technology in Higher Education , 17 (1). https://doi.org/10.1186/s41239-020-00223-0

Barth-Cohen, L. A., Little, A. J., & Abrahamson, D. (2018). Building Reflective Practices in a Pre-service Math and Science Teacher Education Course That Focuses on Qualitative Video Analysis. Journal of Science Teacher Education, 29 (2), 83–101. https://doi.org/10.1080/1046560X.2018.1423837

Article   Google Scholar  

Bhardwaj, R., Majumder, N., & Poria, S. (2020). Investigating Gender Bias in BERT. arXiv.

Breiman, L. (2001). Statistical Modeling: The Two Cultures. Statistical Science, 16 (3), 199–231.

Brookes, D. T., & Etkina, E. (2009). “Force,’’ ontology, and language. Physical Review Special Topics - Physics Education Research, 5 (1), 643. https://doi.org/10.1103/PhysRevSTPER.5.010110

Bruner, J. S. (1985). Child’s talk: Learning to use language . New York, London: W.W. Norton & Company.

Google Scholar  

Caliskan, A., Bryson, J. J., & Narayanan, A. (2017). Semantics derived automatically from language corpora contain human-like biases. Science (New York, NY), 356 (6334), 183–186. https://doi.org/10.1126/science.aal4230

Campello, R. J., Moulavi, D., & Sander, J. (2013). Density-Based Clustering Based on Hierarchical Density Estimates. In J. Pei, V. S. Tseng, L. Cao, H. Motoda, & G. Xu (Eds.), Advances in Knowledge Discovery and Data Mining (pp. 160–172). Heidelberg: Springer, Berlin Heidelberg, Berlin.

Chapter   Google Scholar  

Carlson, J., Daehler, K., Alonzo, A., Barendsen, E., Berry, A., Borowski, A., et al. (2019). The Refined Consensus Model of Pedagogical Content Knowledge. In A. Hume, R. Cooper, & A. Borowski (Eds.), Repositioning Pedagogical Content Knowledge in Teachers’ Professional Knowledge . Singapore: Springer.

Carpenter, D., Geden, M., Rowe, J., Azevedo, R., & Lester, J. (2020). Automated Analysis of Middle School Students’ Written Reflections During Game-Based Learning. In I. I. Bittencourt, M. Cukurova, K. Muldner, R. Luckin, & E. Millán (Eds.), Artificial Intelligence in Education (pp. 67–78). Cham: Springer International Publishing.

Chan, K. K. H., Xu, L., Cooper, R., Berry, A., & van Driel, J. H. (2021). Teacher noticing in science education: do you see what I see? Studies in Science Education, 57 (1), 1–44. https://doi.org/10.1080/03057267.2020.1755803

Clifton, R. A., & Roberts, L. W. (1993). Authority in classrooms . Scarborough, ON: Prentice-Hall.

Conway, M. A.(2009). Episodic memories. Neuropsychologia , 47 (11), 2305–2313. Retrieved from https://www.sciencedirect.com/science/article/pii/S0028393209000645 . https://doi.org/10.1016/j.neuropsychologia.2009.02.003

Crespo, S. (2000). Seeing More Than Right and Wrong Answers: Prospective Teachers’ Interpretations of Students’ Mathematical Work. Journal of Mathematics Teacher Education, 3, 155–181.

Darling-Hammond, L. (2000). Teacher Quality and Student Achievement: A Review of State Policy Evidence. Education Policy Analysis, 8 (1), 1–44.

Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2018). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv 1810.04805 .

Fenstermacher, G. (1994). Chapter 1: The Knower and the Known: The Nature of Knowledge in Research on Teaching. Review of Research in Education , 20 .

Furtak, E. M. (2012). Linking a learning progression for natural selection to teachers’ enactment of formative assessment. Journal of Research in Science Teaching, 49 (9), 1181–1210. https://doi.org/10.1002/tea.21054

Goldberg, Y. (2017). Neural Network Methods for Natural Language Processing . Morgan and Claypool: Synthesis Lectures on Human Language Technologies.

Book   Google Scholar  

Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning . MIT Press, Cambridge, Massachusetts and London, England. Retrieved from http://www.deeplearningbook.org/

Halevy, A., Norvig, P., & Pereira, F. (2009). The Unreasonable Effectiveness of Data. IEEE Intelligent Systems , pp 8–12.

Hammer, D., & van Zee, E. (2006). Seeing the science in children’s thinking: Case studies of student inquiry in physical science . Portsmouth, NH: Heinemann Educational Books.

Hao, K. (2019). The AI technique that could imbue machines with the ability to reason: Yann LeCun, Facebook’s chief AI scientist, believes unsupervised learning will bring about the next AI revolution: MIT Technology Review.

Hume, A. (2009). Promoting higher levels of reflective writing in student journals. Higher Education Research & Development, 28 (3), 247–260.

Jordan, M. I., & Mitchell, T. M. (2015). Machine learning: Trends, perspectives, and prospects. Science (New York, NY), 349 (6245), 255–260. https://doi.org/10.1126/science.aac4520

Jurafsky, D. (2003). Probabilistic Modeling in Psycholinguistics: Linguistic Comprehension and Production. In J. Hay, R. Bod, & S. Jannedy (Eds.), Probabilistic linguistics (pp. 39–95). Cambridge, MA: MIT Press.

Jurafsky, D., & Martin, J. H. (2014). Speech and language processing (2nd ed.). Pearson Education, Harlow: Always learning.

Kahana, M. J., Howard, M. W., & Polyn, S. M. (2008). Associative Retrieval Processes in Episodic Memory. Psychology , 3 .

Kleinknecht, M., & Gröschner, A. (2016). Fostering preservice teachers’ noticing with structured video feedback: Results of an online- and video-based intervention study. Teaching and Teacher Education, 59, 45–56. https://doi.org/10.1016/j.tate.2016.05.020

Korthagen, F. A. (1999). Linking Reflection and Technical Competence: the logbook as an instrument in teacher education. European Journal of Teacher Education, 22 (2–3), 191–207. https://doi.org/10.1080/0261976899020191

Kriegel, H. P., Kröger, P., Sander, J., & Zimek, A. (2011). Density-based clustering. WIREs Data Mining and Knowledge Discovery, 1 (3), 231–240. https://doi.org/10.1002/widm.30

Krippendorff, K. (2004). Reliability in Content Analysis: Some Common Misconceptions and Recommendations. Human Communication Research, 30 (3), 411–433.

Krstovski, K., & Blei, D. M. (2018). Equation Embeddings. arXiv.

Krüger, D., Parchmann, I., & Schecker, H. (Eds.). (2014). Methoden in der naturwissenschaftsdidaktischen Forschung . Berlin and Heidelberg: Springer Spektrum.

Lamb, R., Hand, B., & Kavner, A. (2021). Computational Modeling of the Effects of the Science Writing Heuristic on Student Critical Thinking in Science Using Machine Learning. Journal of Science Education and Technology, 30 (2), 283–297. https://doi.org/10.1007/s10956-020-09871-3

Levin, D. M., Hammer, D., & Coffey, J. E. (2009). Novice Teachers’ Attention to Student Thinking. Journal of Teacher Education, 60 (2), 142–154. https://doi.org/10.1177/0022487108330245

Luna, M. J., Selmer, S. J., & Rye, J. A. (2018). Teachers’ Noticing of Students’ Thinking in Science Through Classroom Artifacts: In What Ways Are Science and Engineering Practices Evident? Journal of Science Teacher Education , 29 (2), 148–172. https://doi.org/10.1080/1046560X.2018.1427418

Marsland, S. (2015). Machine learning: An algorithmic perspective, second edition edn. Chapman & Hall / CRC machine learning & pattern recognition series, CRC Press, Boca Raton, FL. Retrieved from http://proquest.tech.safaribooksonline.de/9781466583283

Mena-Marcos, J., García-Rodríguez, M. L., & Tillema, H. (2013). Student teacher reflective writing: what does it reveal? European Journal of Teacher Education, 36 (2), 147–163. https://doi.org/10.1080/02619768.2012.713933

Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient Estimation of Word Representations in Vector Space. arXiv (1301.3781v3).

Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed Representations of Words and Phrases and their Compositionality. NIPS, 13, 3111–3119.

Mitchell, M. (2020). Artificial Intelligence: A guide for thinking humans. Pelican Books.

Munoz-Najar Galvez, S., Heiberger, R., & McFarland, D. (2020). Paradigm Wars Revisited: A Cartography of Graduate Research in the Field of Education (1980–2010). American Educational Research Journal, 57 (2), 612–652. https://doi.org/10.3102/0002831219860511

Nehm, R. H., Ha, M., & Mayfield, E. (2012). Transforming Biology Assessment with Machine Learning: Automated Scoring of Written Evolutionary Explanations. Journal of Science Education and Technology, 21 (1), 183–196. https://doi.org/10.1007/s10956-011-9300-9

Odden, T. O. B., Marin, A., & Caballero, M. D. (2020). Thematic analysis of 18 years of physics education research conference proceedings using natural language processing. Physical Review Physics Education Research , 16 (1). https://doi.org/10.1103/PhysRevPhysEducRes.16.010142

Odden, T. O. B., Marin, A., & Rudolph, J. L. (2021). How has Science Education changed over the last 100 years? An analysis using natural language processing. Science Education, 105 (4), 653–680. https://doi.org/10.1002/sce.21623

Putnam, R. T., & Borko, H. (2000). What Do New Views of Knowledge and Thinking Have to Say about Research on Teacher Learning? Educational Researcher, 29 (1), 4–15.

Rauf, I. A. (2021). Physics of Data Science and Machine Learning. CRC Press, Boca Raton, . https://doi.org/10.1201/9781003206743

Rosenberg, J. M., & Krist, C. (2020). Combining Machine Learning and Qualitative Methods to Elaborate Students’ Ideas About the Generality of their Model-Based Explanations. Journal of Science Education and Technology . https://doi.org/10.1007/s10956-020-09862-4

Roth, K. J., Garnier, H. E., Chen, C., Lemmens, M., Schwille, K., & Wickler, N. I. Z. (2011). Videobased lesson analysis: Effective science PD for teacher and student learning. Journal of Research in Science Teaching, 48 (2), 117–148.

Ruder, S. (2019). Neural Transfer Learning for Natural Language Processing: Dissertation . Ireland: National University of Ireland.

Rumelhart, D. E., Hinton, G., & Williams, R. J. (1986). Learning representations by back-propagating errors. Nature, 323, 533–536.

Seidel, T., & Stürmer, K. (2014). Modeling and Measuring the Structure of Professional Vision in Preservice Teachers. American Educational Research Journal, 51 (4), 739–771. https://doi.org/10.3102/0002831214531321

Sherin, B. (2013). A Computational Study of Commonsense Science: An Exploration in the Automated Analysis of Clinical Interview Data. Journal of the Learning Sciences, 22 (4), 600–638. https://doi.org/10.1080/10508406.2013.836654

Sherin, M. G., & Han, S. Y. (2004). Teacher learning in the context of a video club. Teaching and Teacher Education, 20 (2), 163–183. https://doi.org/10.1016/j.tate.2003.08.001

Sherin, M. G., & van Es, E. A. (2009). Effects of Video Club Participation on Teachers’ Professional Vision. Journal of Teacher Education, 60 (1), 20–37. https://doi.org/10.1177/0022487108328155

Shulman, L. S. (1986). Those Who Understand: Knowledge Growth in Teaching. Educational Researcher, 15 (2), 4–14.

Singer, J. D. (2019). Reshaping the Arc of Quantitative Educational Research: It’s Time to Broaden Our Paradigm. Journal of Research on Educational Effectiveness, 12 (4), 570–593. https://doi.org/10.1080/19345747.2019.1658835

Star, J. R., & Strickland, S. K. (2008). Learning to observe: using video to improve preservice mathematics teachers’ ability to notice. Journal of Mathematics Teacher Education, 11 (2), 107–125. https://doi.org/10.1007/s10857-007-9063-7

Taher Pilehvar, M., & Camacho-Collados, J. (2020). Embeddings in Natural Language Processing: Theory and Advances in Vector Representation of Meaning. Morgan and Claypool.

Talanquer, V., Bolger, M., & Tomanek, D. (2015). Exploring prospective teachers’ assessment practices: Noticing and interpreting student understanding in the assessment of written work. Journal of Research in Science Teaching, 52 (5), 585–609. https://doi.org/10.1002/tea.21209

Ullmann, T. D. (2019). Automated Analysis of Reflection in Writing: Validating Machine Learning Approaches. International Journal of Artificial Intelligence in Education, 29 (2), 217–257. https://doi.org/10.1007/s40593-019-00174-2

van Es, E., & Sherin, M. G. (2002a). Learning to notice: scaffolding new teachers’ interpretations of classroom interactions. Journal of Technology and Teacher Education , 10 (4), 571–596.

van Es, E., & Sherin, M. G. (2002b). Learning to notice: Scaffolding new teachers’ interpretations of classroom interactions. Journal of Technology and Teacher Education , 10 (4), 571–596.

von Aufschnaiter, C., Fraij, A., & Kost, D. (2019). Reflexion und Reflexivität in der Lehrerbildung: 144-159 Seiten / Herausforderung Lehrer\_innenbildung - Zeitschrift zur Konzeption, Gestaltung und Diskussion, Bd. 2 Nr. 1 (2019): Herausforderung Lehrer\_innenbildung - Ausgabe 2. https://doi.org/10.4119/UNIBI/HLZ-144

Wang, C., Liu, X., & Song, D. (2020). Language Models are Open Knowledge Graphs. arXiv.

Wigner, E. P. (1960). The unreasonable effectiveness of mathematics in the natural sciences. Richard courant lecture in mathematical sciences delivered at New York University, May 11, 1959. Communications on Pure and Applied Mathematics , 13 (1),1–14. https://doi.org/10.1002/cpa.3160130102

Wilson, C. D., Borowski, A., & van Driel, J. H. (2019). Perspectives on the Future of PCK Research in Science Education and Beyond. In A. Hume, R. Cooper, & A. Borowski (Eds.), Repositioning Pedagogical Content Knowledge in Teachers’ Professional Knowledge (pp. 289–300). Singapore: Springer.

Wulff, P., Buschhüter, D., Nowak, A., Westphal, A., Becker, L., Robalino, H., et al. (2020). Computer-Based Classification of Preservice Physics Teachers’ Written Reflections. Journal of Science Education and Technology . https://doi.org/10.1007/s10956-020-09865-1

Wulff, P., Mientus, L., Nowak, A., & Borowski, A. (2022). Utilizing a Pretrained Language Model (BERT) to Classify Preservice Physics Teachers’ Written Reflections. International Journal of Artificial Intelligence in Education . https://doi.org/10.1007/s40593-022-00290-6

Xing, W., Lee, H. S., & Shibani, A. (2020). Identifying patterns in students’ scientific argumentation: content analysis through text mining using Latent Dirichlet Allocation. Educational Technology Research and Development . https://doi.org/10.1007/s11423-020-09761-w

Zehner, F., Sälzer, C., & Goldhammer, F. (2016). Automatic Coding of Short Text Responses via Clustering in Educational Assessment. Educational and Psychological Measurement, 76 (2), 280–303. https://doi.org/10.1177/0013164415590022

Zhai, X. (2021). Practices and Theories: How Can Machine Learning Assist in Innovative Assessment Practices in Science Education. Journal of Science Education and Technology . https://doi.org/10.1007/s10956-021-09901-8

Zhai, X., Haudek, K., Shi, L., Nehm, R., & Urban-Lurain, M. (2020). From substitution to redefinition: A framework of machine learning-based science assessment. Journal of Research in Science Teaching, 57 (9), 1430–1459. https://doi.org/10.1002/tea.21658

Zhai, X., Yin, Y., Pellegrino, J. W., Haudek, K. C., & Shi, L. (2020). Applying machine learning in science assessment: a systematic review. Studies in Science Education, 56 (1), 111–151. https://doi.org/10.1080/03057267.2020.1735757

Zhang, X., Ramachandran, D., Tenney, I., Elazar, Y., & Roth, D. (2020). Do Language Embeddings Capture Scales? arXiv.

Download references

Open Access funding enabled and organized by Projekt DEAL. This project is part of the “Qualitätsoffensive Lehrerbildung”, a joint initiative of the Federal Government and the Länder which aims to improve the quality of teacher training. The program is funded by the Federal Ministry of Education and Research. The authors are responsible for the content of this publication.

Author information

Authors and affiliations.

Physics Education Research Group, Heidelberg University of Education, Heidelberg, Germany

Peter Wulff

Physics Education Research Group, University of Potsdam, Potsdam, Germany

David Buschhüter, Lukas Mientus, Anna Nowak & Andreas Borowski

Department of Educational Research, University of Greifswald, Greifswald, Germany

Andrea Westphal

You can also search for this author in PubMed   Google Scholar

Corresponding author

Correspondence to Peter Wulff .

Ethics declarations

Ethical statement.

All procedures followed were in accordance with ethical standards for research with human subjects, as outlined for example by the American Psychological Association.

Consent Statement

Informed consent with all participants was assured.

Disclosure of Potential Conflicts of Interest

The authors are not aware of any potential conflict of interests.

Additional information

Publisher’s note.

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (PDF 137 KB)

Rights and permissions.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .

Reprints and permissions

About this article

Wulff, P., Buschhüter, D., Westphal, A. et al. Bridging the Gap Between Qualitative and Quantitative Assessment in Science Education Research with Machine Learning — A Case for Pretrained Language Models-Based Clustering. J Sci Educ Technol 31 , 490–513 (2022). https://doi.org/10.1007/s10956-022-09969-w

Download citation

Accepted : 02 May 2022

Published : 01 June 2022

Issue Date : August 2022

DOI : https://doi.org/10.1007/s10956-022-09969-w

Share this article

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Attention to classroom events
  • Find a journal
  • Publish with us
  • Track your research

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • J Korean Med Sci
  • v.37(16); 2022 Apr 25

Logo of jkms

A Practical Guide to Writing Quantitative and Qualitative Research Questions and Hypotheses in Scholarly Articles

Edward barroga.

1 Department of General Education, Graduate School of Nursing Science, St. Luke’s International University, Tokyo, Japan.

Glafera Janet Matanguihan

2 Department of Biological Sciences, Messiah University, Mechanicsburg, PA, USA.

The development of research questions and the subsequent hypotheses are prerequisites to defining the main research purpose and specific objectives of a study. Consequently, these objectives determine the study design and research outcome. The development of research questions is a process based on knowledge of current trends, cutting-edge studies, and technological advances in the research field. Excellent research questions are focused and require a comprehensive literature search and in-depth understanding of the problem being investigated. Initially, research questions may be written as descriptive questions which could be developed into inferential questions. These questions must be specific and concise to provide a clear foundation for developing hypotheses. Hypotheses are more formal predictions about the research outcomes. These specify the possible results that may or may not be expected regarding the relationship between groups. Thus, research questions and hypotheses clarify the main purpose and specific objectives of the study, which in turn dictate the design of the study, its direction, and outcome. Studies developed from good research questions and hypotheses will have trustworthy outcomes with wide-ranging social and health implications.

INTRODUCTION

Scientific research is usually initiated by posing evidenced-based research questions which are then explicitly restated as hypotheses. 1 , 2 The hypotheses provide directions to guide the study, solutions, explanations, and expected results. 3 , 4 Both research questions and hypotheses are essentially formulated based on conventional theories and real-world processes, which allow the inception of novel studies and the ethical testing of ideas. 5 , 6

It is crucial to have knowledge of both quantitative and qualitative research 2 as both types of research involve writing research questions and hypotheses. 7 However, these crucial elements of research are sometimes overlooked; if not overlooked, then framed without the forethought and meticulous attention it needs. Planning and careful consideration are needed when developing quantitative or qualitative research, particularly when conceptualizing research questions and hypotheses. 4

There is a continuing need to support researchers in the creation of innovative research questions and hypotheses, as well as for journal articles that carefully review these elements. 1 When research questions and hypotheses are not carefully thought of, unethical studies and poor outcomes usually ensue. Carefully formulated research questions and hypotheses define well-founded objectives, which in turn determine the appropriate design, course, and outcome of the study. This article then aims to discuss in detail the various aspects of crafting research questions and hypotheses, with the goal of guiding researchers as they develop their own. Examples from the authors and peer-reviewed scientific articles in the healthcare field are provided to illustrate key points.

DEFINITIONS AND RELATIONSHIP OF RESEARCH QUESTIONS AND HYPOTHESES

A research question is what a study aims to answer after data analysis and interpretation. The answer is written in length in the discussion section of the paper. Thus, the research question gives a preview of the different parts and variables of the study meant to address the problem posed in the research question. 1 An excellent research question clarifies the research writing while facilitating understanding of the research topic, objective, scope, and limitations of the study. 5

On the other hand, a research hypothesis is an educated statement of an expected outcome. This statement is based on background research and current knowledge. 8 , 9 The research hypothesis makes a specific prediction about a new phenomenon 10 or a formal statement on the expected relationship between an independent variable and a dependent variable. 3 , 11 It provides a tentative answer to the research question to be tested or explored. 4

Hypotheses employ reasoning to predict a theory-based outcome. 10 These can also be developed from theories by focusing on components of theories that have not yet been observed. 10 The validity of hypotheses is often based on the testability of the prediction made in a reproducible experiment. 8

Conversely, hypotheses can also be rephrased as research questions. Several hypotheses based on existing theories and knowledge may be needed to answer a research question. Developing ethical research questions and hypotheses creates a research design that has logical relationships among variables. These relationships serve as a solid foundation for the conduct of the study. 4 , 11 Haphazardly constructed research questions can result in poorly formulated hypotheses and improper study designs, leading to unreliable results. Thus, the formulations of relevant research questions and verifiable hypotheses are crucial when beginning research. 12

CHARACTERISTICS OF GOOD RESEARCH QUESTIONS AND HYPOTHESES

Excellent research questions are specific and focused. These integrate collective data and observations to confirm or refute the subsequent hypotheses. Well-constructed hypotheses are based on previous reports and verify the research context. These are realistic, in-depth, sufficiently complex, and reproducible. More importantly, these hypotheses can be addressed and tested. 13

There are several characteristics of well-developed hypotheses. Good hypotheses are 1) empirically testable 7 , 10 , 11 , 13 ; 2) backed by preliminary evidence 9 ; 3) testable by ethical research 7 , 9 ; 4) based on original ideas 9 ; 5) have evidenced-based logical reasoning 10 ; and 6) can be predicted. 11 Good hypotheses can infer ethical and positive implications, indicating the presence of a relationship or effect relevant to the research theme. 7 , 11 These are initially developed from a general theory and branch into specific hypotheses by deductive reasoning. In the absence of a theory to base the hypotheses, inductive reasoning based on specific observations or findings form more general hypotheses. 10

TYPES OF RESEARCH QUESTIONS AND HYPOTHESES

Research questions and hypotheses are developed according to the type of research, which can be broadly classified into quantitative and qualitative research. We provide a summary of the types of research questions and hypotheses under quantitative and qualitative research categories in Table 1 .

Quantitative research questionsQuantitative research hypotheses
Descriptive research questionsSimple hypothesis
Comparative research questionsComplex hypothesis
Relationship research questionsDirectional hypothesis
Non-directional hypothesis
Associative hypothesis
Causal hypothesis
Null hypothesis
Alternative hypothesis
Working hypothesis
Statistical hypothesis
Logical hypothesis
Hypothesis-testing
Qualitative research questionsQualitative research hypotheses
Contextual research questionsHypothesis-generating
Descriptive research questions
Evaluation research questions
Explanatory research questions
Exploratory research questions
Generative research questions
Ideological research questions
Ethnographic research questions
Phenomenological research questions
Grounded theory questions
Qualitative case study questions

Research questions in quantitative research

In quantitative research, research questions inquire about the relationships among variables being investigated and are usually framed at the start of the study. These are precise and typically linked to the subject population, dependent and independent variables, and research design. 1 Research questions may also attempt to describe the behavior of a population in relation to one or more variables, or describe the characteristics of variables to be measured ( descriptive research questions ). 1 , 5 , 14 These questions may also aim to discover differences between groups within the context of an outcome variable ( comparative research questions ), 1 , 5 , 14 or elucidate trends and interactions among variables ( relationship research questions ). 1 , 5 We provide examples of descriptive, comparative, and relationship research questions in quantitative research in Table 2 .

Quantitative research questions
Descriptive research question
- Measures responses of subjects to variables
- Presents variables to measure, analyze, or assess
What is the proportion of resident doctors in the hospital who have mastered ultrasonography (response of subjects to a variable) as a diagnostic technique in their clinical training?
Comparative research question
- Clarifies difference between one group with outcome variable and another group without outcome variable
Is there a difference in the reduction of lung metastasis in osteosarcoma patients who received the vitamin D adjunctive therapy (group with outcome variable) compared with osteosarcoma patients who did not receive the vitamin D adjunctive therapy (group without outcome variable)?
- Compares the effects of variables
How does the vitamin D analogue 22-Oxacalcitriol (variable 1) mimic the antiproliferative activity of 1,25-Dihydroxyvitamin D (variable 2) in osteosarcoma cells?
Relationship research question
- Defines trends, association, relationships, or interactions between dependent variable and independent variable
Is there a relationship between the number of medical student suicide (dependent variable) and the level of medical student stress (independent variable) in Japan during the first wave of the COVID-19 pandemic?

Hypotheses in quantitative research

In quantitative research, hypotheses predict the expected relationships among variables. 15 Relationships among variables that can be predicted include 1) between a single dependent variable and a single independent variable ( simple hypothesis ) or 2) between two or more independent and dependent variables ( complex hypothesis ). 4 , 11 Hypotheses may also specify the expected direction to be followed and imply an intellectual commitment to a particular outcome ( directional hypothesis ) 4 . On the other hand, hypotheses may not predict the exact direction and are used in the absence of a theory, or when findings contradict previous studies ( non-directional hypothesis ). 4 In addition, hypotheses can 1) define interdependency between variables ( associative hypothesis ), 4 2) propose an effect on the dependent variable from manipulation of the independent variable ( causal hypothesis ), 4 3) state a negative relationship between two variables ( null hypothesis ), 4 , 11 , 15 4) replace the working hypothesis if rejected ( alternative hypothesis ), 15 explain the relationship of phenomena to possibly generate a theory ( working hypothesis ), 11 5) involve quantifiable variables that can be tested statistically ( statistical hypothesis ), 11 6) or express a relationship whose interlinks can be verified logically ( logical hypothesis ). 11 We provide examples of simple, complex, directional, non-directional, associative, causal, null, alternative, working, statistical, and logical hypotheses in quantitative research, as well as the definition of quantitative hypothesis-testing research in Table 3 .

Quantitative research hypotheses
Simple hypothesis
- Predicts relationship between single dependent variable and single independent variable
If the dose of the new medication (single independent variable) is high, blood pressure (single dependent variable) is lowered.
Complex hypothesis
- Foretells relationship between two or more independent and dependent variables
The higher the use of anticancer drugs, radiation therapy, and adjunctive agents (3 independent variables), the higher would be the survival rate (1 dependent variable).
Directional hypothesis
- Identifies study direction based on theory towards particular outcome to clarify relationship between variables
Privately funded research projects will have a larger international scope (study direction) than publicly funded research projects.
Non-directional hypothesis
- Nature of relationship between two variables or exact study direction is not identified
- Does not involve a theory
Women and men are different in terms of helpfulness. (Exact study direction is not identified)
Associative hypothesis
- Describes variable interdependency
- Change in one variable causes change in another variable
A larger number of people vaccinated against COVID-19 in the region (change in independent variable) will reduce the region’s incidence of COVID-19 infection (change in dependent variable).
Causal hypothesis
- An effect on dependent variable is predicted from manipulation of independent variable
A change into a high-fiber diet (independent variable) will reduce the blood sugar level (dependent variable) of the patient.
Null hypothesis
- A negative statement indicating no relationship or difference between 2 variables
There is no significant difference in the severity of pulmonary metastases between the new drug (variable 1) and the current drug (variable 2).
Alternative hypothesis
- Following a null hypothesis, an alternative hypothesis predicts a relationship between 2 study variables
The new drug (variable 1) is better on average in reducing the level of pain from pulmonary metastasis than the current drug (variable 2).
Working hypothesis
- A hypothesis that is initially accepted for further research to produce a feasible theory
Dairy cows fed with concentrates of different formulations will produce different amounts of milk.
Statistical hypothesis
- Assumption about the value of population parameter or relationship among several population characteristics
- Validity tested by a statistical experiment or analysis
The mean recovery rate from COVID-19 infection (value of population parameter) is not significantly different between population 1 and population 2.
There is a positive correlation between the level of stress at the workplace and the number of suicides (population characteristics) among working people in Japan.
Logical hypothesis
- Offers or proposes an explanation with limited or no extensive evidence
If healthcare workers provide more educational programs about contraception methods, the number of adolescent pregnancies will be less.
Hypothesis-testing (Quantitative hypothesis-testing research)
- Quantitative research uses deductive reasoning.
- This involves the formation of a hypothesis, collection of data in the investigation of the problem, analysis and use of the data from the investigation, and drawing of conclusions to validate or nullify the hypotheses.

Research questions in qualitative research

Unlike research questions in quantitative research, research questions in qualitative research are usually continuously reviewed and reformulated. The central question and associated subquestions are stated more than the hypotheses. 15 The central question broadly explores a complex set of factors surrounding the central phenomenon, aiming to present the varied perspectives of participants. 15

There are varied goals for which qualitative research questions are developed. These questions can function in several ways, such as to 1) identify and describe existing conditions ( contextual research question s); 2) describe a phenomenon ( descriptive research questions ); 3) assess the effectiveness of existing methods, protocols, theories, or procedures ( evaluation research questions ); 4) examine a phenomenon or analyze the reasons or relationships between subjects or phenomena ( explanatory research questions ); or 5) focus on unknown aspects of a particular topic ( exploratory research questions ). 5 In addition, some qualitative research questions provide new ideas for the development of theories and actions ( generative research questions ) or advance specific ideologies of a position ( ideological research questions ). 1 Other qualitative research questions may build on a body of existing literature and become working guidelines ( ethnographic research questions ). Research questions may also be broadly stated without specific reference to the existing literature or a typology of questions ( phenomenological research questions ), may be directed towards generating a theory of some process ( grounded theory questions ), or may address a description of the case and the emerging themes ( qualitative case study questions ). 15 We provide examples of contextual, descriptive, evaluation, explanatory, exploratory, generative, ideological, ethnographic, phenomenological, grounded theory, and qualitative case study research questions in qualitative research in Table 4 , and the definition of qualitative hypothesis-generating research in Table 5 .

Qualitative research questions
Contextual research question
- Ask the nature of what already exists
- Individuals or groups function to further clarify and understand the natural context of real-world problems
What are the experiences of nurses working night shifts in healthcare during the COVID-19 pandemic? (natural context of real-world problems)
Descriptive research question
- Aims to describe a phenomenon
What are the different forms of disrespect and abuse (phenomenon) experienced by Tanzanian women when giving birth in healthcare facilities?
Evaluation research question
- Examines the effectiveness of existing practice or accepted frameworks
How effective are decision aids (effectiveness of existing practice) in helping decide whether to give birth at home or in a healthcare facility?
Explanatory research question
- Clarifies a previously studied phenomenon and explains why it occurs
Why is there an increase in teenage pregnancy (phenomenon) in Tanzania?
Exploratory research question
- Explores areas that have not been fully investigated to have a deeper understanding of the research problem
What factors affect the mental health of medical students (areas that have not yet been fully investigated) during the COVID-19 pandemic?
Generative research question
- Develops an in-depth understanding of people’s behavior by asking ‘how would’ or ‘what if’ to identify problems and find solutions
How would the extensive research experience of the behavior of new staff impact the success of the novel drug initiative?
Ideological research question
- Aims to advance specific ideas or ideologies of a position
Are Japanese nurses who volunteer in remote African hospitals able to promote humanized care of patients (specific ideas or ideologies) in the areas of safe patient environment, respect of patient privacy, and provision of accurate information related to health and care?
Ethnographic research question
- Clarifies peoples’ nature, activities, their interactions, and the outcomes of their actions in specific settings
What are the demographic characteristics, rehabilitative treatments, community interactions, and disease outcomes (nature, activities, their interactions, and the outcomes) of people in China who are suffering from pneumoconiosis?
Phenomenological research question
- Knows more about the phenomena that have impacted an individual
What are the lived experiences of parents who have been living with and caring for children with a diagnosis of autism? (phenomena that have impacted an individual)
Grounded theory question
- Focuses on social processes asking about what happens and how people interact, or uncovering social relationships and behaviors of groups
What are the problems that pregnant adolescents face in terms of social and cultural norms (social processes), and how can these be addressed?
Qualitative case study question
- Assesses a phenomenon using different sources of data to answer “why” and “how” questions
- Considers how the phenomenon is influenced by its contextual situation.
How does quitting work and assuming the role of a full-time mother (phenomenon assessed) change the lives of women in Japan?
Qualitative research hypotheses
Hypothesis-generating (Qualitative hypothesis-generating research)
- Qualitative research uses inductive reasoning.
- This involves data collection from study participants or the literature regarding a phenomenon of interest, using the collected data to develop a formal hypothesis, and using the formal hypothesis as a framework for testing the hypothesis.
- Qualitative exploratory studies explore areas deeper, clarifying subjective experience and allowing formulation of a formal hypothesis potentially testable in a future quantitative approach.

Qualitative studies usually pose at least one central research question and several subquestions starting with How or What . These research questions use exploratory verbs such as explore or describe . These also focus on one central phenomenon of interest, and may mention the participants and research site. 15

Hypotheses in qualitative research

Hypotheses in qualitative research are stated in the form of a clear statement concerning the problem to be investigated. Unlike in quantitative research where hypotheses are usually developed to be tested, qualitative research can lead to both hypothesis-testing and hypothesis-generating outcomes. 2 When studies require both quantitative and qualitative research questions, this suggests an integrative process between both research methods wherein a single mixed-methods research question can be developed. 1

FRAMEWORKS FOR DEVELOPING RESEARCH QUESTIONS AND HYPOTHESES

Research questions followed by hypotheses should be developed before the start of the study. 1 , 12 , 14 It is crucial to develop feasible research questions on a topic that is interesting to both the researcher and the scientific community. This can be achieved by a meticulous review of previous and current studies to establish a novel topic. Specific areas are subsequently focused on to generate ethical research questions. The relevance of the research questions is evaluated in terms of clarity of the resulting data, specificity of the methodology, objectivity of the outcome, depth of the research, and impact of the study. 1 , 5 These aspects constitute the FINER criteria (i.e., Feasible, Interesting, Novel, Ethical, and Relevant). 1 Clarity and effectiveness are achieved if research questions meet the FINER criteria. In addition to the FINER criteria, Ratan et al. described focus, complexity, novelty, feasibility, and measurability for evaluating the effectiveness of research questions. 14

The PICOT and PEO frameworks are also used when developing research questions. 1 The following elements are addressed in these frameworks, PICOT: P-population/patients/problem, I-intervention or indicator being studied, C-comparison group, O-outcome of interest, and T-timeframe of the study; PEO: P-population being studied, E-exposure to preexisting conditions, and O-outcome of interest. 1 Research questions are also considered good if these meet the “FINERMAPS” framework: Feasible, Interesting, Novel, Ethical, Relevant, Manageable, Appropriate, Potential value/publishable, and Systematic. 14

As we indicated earlier, research questions and hypotheses that are not carefully formulated result in unethical studies or poor outcomes. To illustrate this, we provide some examples of ambiguous research question and hypotheses that result in unclear and weak research objectives in quantitative research ( Table 6 ) 16 and qualitative research ( Table 7 ) 17 , and how to transform these ambiguous research question(s) and hypothesis(es) into clear and good statements.

VariablesUnclear and weak statement (Statement 1) Clear and good statement (Statement 2) Points to avoid
Research questionWhich is more effective between smoke moxibustion and smokeless moxibustion?“Moreover, regarding smoke moxibustion versus smokeless moxibustion, it remains unclear which is more effective, safe, and acceptable to pregnant women, and whether there is any difference in the amount of heat generated.” 1) Vague and unfocused questions
2) Closed questions simply answerable by yes or no
3) Questions requiring a simple choice
HypothesisThe smoke moxibustion group will have higher cephalic presentation.“Hypothesis 1. The smoke moxibustion stick group (SM group) and smokeless moxibustion stick group (-SLM group) will have higher rates of cephalic presentation after treatment than the control group.1) Unverifiable hypotheses
Hypothesis 2. The SM group and SLM group will have higher rates of cephalic presentation at birth than the control group.2) Incompletely stated groups of comparison
Hypothesis 3. There will be no significant differences in the well-being of the mother and child among the three groups in terms of the following outcomes: premature birth, premature rupture of membranes (PROM) at < 37 weeks, Apgar score < 7 at 5 min, umbilical cord blood pH < 7.1, admission to neonatal intensive care unit (NICU), and intrauterine fetal death.” 3) Insufficiently described variables or outcomes
Research objectiveTo determine which is more effective between smoke moxibustion and smokeless moxibustion.“The specific aims of this pilot study were (a) to compare the effects of smoke moxibustion and smokeless moxibustion treatments with the control group as a possible supplement to ECV for converting breech presentation to cephalic presentation and increasing adherence to the newly obtained cephalic position, and (b) to assess the effects of these treatments on the well-being of the mother and child.” 1) Poor understanding of the research question and hypotheses
2) Insufficient description of population, variables, or study outcomes

a These statements were composed for comparison and illustrative purposes only.

b These statements are direct quotes from Higashihara and Horiuchi. 16

VariablesUnclear and weak statement (Statement 1)Clear and good statement (Statement 2)Points to avoid
Research questionDoes disrespect and abuse (D&A) occur in childbirth in Tanzania?How does disrespect and abuse (D&A) occur and what are the types of physical and psychological abuses observed in midwives’ actual care during facility-based childbirth in urban Tanzania?1) Ambiguous or oversimplistic questions
2) Questions unverifiable by data collection and analysis
HypothesisDisrespect and abuse (D&A) occur in childbirth in Tanzania.Hypothesis 1: Several types of physical and psychological abuse by midwives in actual care occur during facility-based childbirth in urban Tanzania.1) Statements simply expressing facts
Hypothesis 2: Weak nursing and midwifery management contribute to the D&A of women during facility-based childbirth in urban Tanzania.2) Insufficiently described concepts or variables
Research objectiveTo describe disrespect and abuse (D&A) in childbirth in Tanzania.“This study aimed to describe from actual observations the respectful and disrespectful care received by women from midwives during their labor period in two hospitals in urban Tanzania.” 1) Statements unrelated to the research question and hypotheses
2) Unattainable or unexplorable objectives

a This statement is a direct quote from Shimoda et al. 17

The other statements were composed for comparison and illustrative purposes only.

CONSTRUCTING RESEARCH QUESTIONS AND HYPOTHESES

To construct effective research questions and hypotheses, it is very important to 1) clarify the background and 2) identify the research problem at the outset of the research, within a specific timeframe. 9 Then, 3) review or conduct preliminary research to collect all available knowledge about the possible research questions by studying theories and previous studies. 18 Afterwards, 4) construct research questions to investigate the research problem. Identify variables to be accessed from the research questions 4 and make operational definitions of constructs from the research problem and questions. Thereafter, 5) construct specific deductive or inductive predictions in the form of hypotheses. 4 Finally, 6) state the study aims . This general flow for constructing effective research questions and hypotheses prior to conducting research is shown in Fig. 1 .

An external file that holds a picture, illustration, etc.
Object name is jkms-37-e121-g001.jpg

Research questions are used more frequently in qualitative research than objectives or hypotheses. 3 These questions seek to discover, understand, explore or describe experiences by asking “What” or “How.” The questions are open-ended to elicit a description rather than to relate variables or compare groups. The questions are continually reviewed, reformulated, and changed during the qualitative study. 3 Research questions are also used more frequently in survey projects than hypotheses in experiments in quantitative research to compare variables and their relationships.

Hypotheses are constructed based on the variables identified and as an if-then statement, following the template, ‘If a specific action is taken, then a certain outcome is expected.’ At this stage, some ideas regarding expectations from the research to be conducted must be drawn. 18 Then, the variables to be manipulated (independent) and influenced (dependent) are defined. 4 Thereafter, the hypothesis is stated and refined, and reproducible data tailored to the hypothesis are identified, collected, and analyzed. 4 The hypotheses must be testable and specific, 18 and should describe the variables and their relationships, the specific group being studied, and the predicted research outcome. 18 Hypotheses construction involves a testable proposition to be deduced from theory, and independent and dependent variables to be separated and measured separately. 3 Therefore, good hypotheses must be based on good research questions constructed at the start of a study or trial. 12

In summary, research questions are constructed after establishing the background of the study. Hypotheses are then developed based on the research questions. Thus, it is crucial to have excellent research questions to generate superior hypotheses. In turn, these would determine the research objectives and the design of the study, and ultimately, the outcome of the research. 12 Algorithms for building research questions and hypotheses are shown in Fig. 2 for quantitative research and in Fig. 3 for qualitative research.

An external file that holds a picture, illustration, etc.
Object name is jkms-37-e121-g002.jpg

EXAMPLES OF RESEARCH QUESTIONS FROM PUBLISHED ARTICLES

  • EXAMPLE 1. Descriptive research question (quantitative research)
  • - Presents research variables to be assessed (distinct phenotypes and subphenotypes)
  • “BACKGROUND: Since COVID-19 was identified, its clinical and biological heterogeneity has been recognized. Identifying COVID-19 phenotypes might help guide basic, clinical, and translational research efforts.
  • RESEARCH QUESTION: Does the clinical spectrum of patients with COVID-19 contain distinct phenotypes and subphenotypes? ” 19
  • EXAMPLE 2. Relationship research question (quantitative research)
  • - Shows interactions between dependent variable (static postural control) and independent variable (peripheral visual field loss)
  • “Background: Integration of visual, vestibular, and proprioceptive sensations contributes to postural control. People with peripheral visual field loss have serious postural instability. However, the directional specificity of postural stability and sensory reweighting caused by gradual peripheral visual field loss remain unclear.
  • Research question: What are the effects of peripheral visual field loss on static postural control ?” 20
  • EXAMPLE 3. Comparative research question (quantitative research)
  • - Clarifies the difference among groups with an outcome variable (patients enrolled in COMPERA with moderate PH or severe PH in COPD) and another group without the outcome variable (patients with idiopathic pulmonary arterial hypertension (IPAH))
  • “BACKGROUND: Pulmonary hypertension (PH) in COPD is a poorly investigated clinical condition.
  • RESEARCH QUESTION: Which factors determine the outcome of PH in COPD?
  • STUDY DESIGN AND METHODS: We analyzed the characteristics and outcome of patients enrolled in the Comparative, Prospective Registry of Newly Initiated Therapies for Pulmonary Hypertension (COMPERA) with moderate or severe PH in COPD as defined during the 6th PH World Symposium who received medical therapy for PH and compared them with patients with idiopathic pulmonary arterial hypertension (IPAH) .” 21
  • EXAMPLE 4. Exploratory research question (qualitative research)
  • - Explores areas that have not been fully investigated (perspectives of families and children who receive care in clinic-based child obesity treatment) to have a deeper understanding of the research problem
  • “Problem: Interventions for children with obesity lead to only modest improvements in BMI and long-term outcomes, and data are limited on the perspectives of families of children with obesity in clinic-based treatment. This scoping review seeks to answer the question: What is known about the perspectives of families and children who receive care in clinic-based child obesity treatment? This review aims to explore the scope of perspectives reported by families of children with obesity who have received individualized outpatient clinic-based obesity treatment.” 22
  • EXAMPLE 5. Relationship research question (quantitative research)
  • - Defines interactions between dependent variable (use of ankle strategies) and independent variable (changes in muscle tone)
  • “Background: To maintain an upright standing posture against external disturbances, the human body mainly employs two types of postural control strategies: “ankle strategy” and “hip strategy.” While it has been reported that the magnitude of the disturbance alters the use of postural control strategies, it has not been elucidated how the level of muscle tone, one of the crucial parameters of bodily function, determines the use of each strategy. We have previously confirmed using forward dynamics simulations of human musculoskeletal models that an increased muscle tone promotes the use of ankle strategies. The objective of the present study was to experimentally evaluate a hypothesis: an increased muscle tone promotes the use of ankle strategies. Research question: Do changes in the muscle tone affect the use of ankle strategies ?” 23

EXAMPLES OF HYPOTHESES IN PUBLISHED ARTICLES

  • EXAMPLE 1. Working hypothesis (quantitative research)
  • - A hypothesis that is initially accepted for further research to produce a feasible theory
  • “As fever may have benefit in shortening the duration of viral illness, it is plausible to hypothesize that the antipyretic efficacy of ibuprofen may be hindering the benefits of a fever response when taken during the early stages of COVID-19 illness .” 24
  • “In conclusion, it is plausible to hypothesize that the antipyretic efficacy of ibuprofen may be hindering the benefits of a fever response . The difference in perceived safety of these agents in COVID-19 illness could be related to the more potent efficacy to reduce fever with ibuprofen compared to acetaminophen. Compelling data on the benefit of fever warrant further research and review to determine when to treat or withhold ibuprofen for early stage fever for COVID-19 and other related viral illnesses .” 24
  • EXAMPLE 2. Exploratory hypothesis (qualitative research)
  • - Explores particular areas deeper to clarify subjective experience and develop a formal hypothesis potentially testable in a future quantitative approach
  • “We hypothesized that when thinking about a past experience of help-seeking, a self distancing prompt would cause increased help-seeking intentions and more favorable help-seeking outcome expectations .” 25
  • “Conclusion
  • Although a priori hypotheses were not supported, further research is warranted as results indicate the potential for using self-distancing approaches to increasing help-seeking among some people with depressive symptomatology.” 25
  • EXAMPLE 3. Hypothesis-generating research to establish a framework for hypothesis testing (qualitative research)
  • “We hypothesize that compassionate care is beneficial for patients (better outcomes), healthcare systems and payers (lower costs), and healthcare providers (lower burnout). ” 26
  • Compassionomics is the branch of knowledge and scientific study of the effects of compassionate healthcare. Our main hypotheses are that compassionate healthcare is beneficial for (1) patients, by improving clinical outcomes, (2) healthcare systems and payers, by supporting financial sustainability, and (3) HCPs, by lowering burnout and promoting resilience and well-being. The purpose of this paper is to establish a scientific framework for testing the hypotheses above . If these hypotheses are confirmed through rigorous research, compassionomics will belong in the science of evidence-based medicine, with major implications for all healthcare domains.” 26
  • EXAMPLE 4. Statistical hypothesis (quantitative research)
  • - An assumption is made about the relationship among several population characteristics ( gender differences in sociodemographic and clinical characteristics of adults with ADHD ). Validity is tested by statistical experiment or analysis ( chi-square test, Students t-test, and logistic regression analysis)
  • “Our research investigated gender differences in sociodemographic and clinical characteristics of adults with ADHD in a Japanese clinical sample. Due to unique Japanese cultural ideals and expectations of women's behavior that are in opposition to ADHD symptoms, we hypothesized that women with ADHD experience more difficulties and present more dysfunctions than men . We tested the following hypotheses: first, women with ADHD have more comorbidities than men with ADHD; second, women with ADHD experience more social hardships than men, such as having less full-time employment and being more likely to be divorced.” 27
  • “Statistical Analysis
  • ( text omitted ) Between-gender comparisons were made using the chi-squared test for categorical variables and Students t-test for continuous variables…( text omitted ). A logistic regression analysis was performed for employment status, marital status, and comorbidity to evaluate the independent effects of gender on these dependent variables.” 27

EXAMPLES OF HYPOTHESIS AS WRITTEN IN PUBLISHED ARTICLES IN RELATION TO OTHER PARTS

  • EXAMPLE 1. Background, hypotheses, and aims are provided
  • “Pregnant women need skilled care during pregnancy and childbirth, but that skilled care is often delayed in some countries …( text omitted ). The focused antenatal care (FANC) model of WHO recommends that nurses provide information or counseling to all pregnant women …( text omitted ). Job aids are visual support materials that provide the right kind of information using graphics and words in a simple and yet effective manner. When nurses are not highly trained or have many work details to attend to, these job aids can serve as a content reminder for the nurses and can be used for educating their patients (Jennings, Yebadokpo, Affo, & Agbogbe, 2010) ( text omitted ). Importantly, additional evidence is needed to confirm how job aids can further improve the quality of ANC counseling by health workers in maternal care …( text omitted )” 28
  • “ This has led us to hypothesize that the quality of ANC counseling would be better if supported by job aids. Consequently, a better quality of ANC counseling is expected to produce higher levels of awareness concerning the danger signs of pregnancy and a more favorable impression of the caring behavior of nurses .” 28
  • “This study aimed to examine the differences in the responses of pregnant women to a job aid-supported intervention during ANC visit in terms of 1) their understanding of the danger signs of pregnancy and 2) their impression of the caring behaviors of nurses to pregnant women in rural Tanzania.” 28
  • EXAMPLE 2. Background, hypotheses, and aims are provided
  • “We conducted a two-arm randomized controlled trial (RCT) to evaluate and compare changes in salivary cortisol and oxytocin levels of first-time pregnant women between experimental and control groups. The women in the experimental group touched and held an infant for 30 min (experimental intervention protocol), whereas those in the control group watched a DVD movie of an infant (control intervention protocol). The primary outcome was salivary cortisol level and the secondary outcome was salivary oxytocin level.” 29
  • “ We hypothesize that at 30 min after touching and holding an infant, the salivary cortisol level will significantly decrease and the salivary oxytocin level will increase in the experimental group compared with the control group .” 29
  • EXAMPLE 3. Background, aim, and hypothesis are provided
  • “In countries where the maternal mortality ratio remains high, antenatal education to increase Birth Preparedness and Complication Readiness (BPCR) is considered one of the top priorities [1]. BPCR includes birth plans during the antenatal period, such as the birthplace, birth attendant, transportation, health facility for complications, expenses, and birth materials, as well as family coordination to achieve such birth plans. In Tanzania, although increasing, only about half of all pregnant women attend an antenatal clinic more than four times [4]. Moreover, the information provided during antenatal care (ANC) is insufficient. In the resource-poor settings, antenatal group education is a potential approach because of the limited time for individual counseling at antenatal clinics.” 30
  • “This study aimed to evaluate an antenatal group education program among pregnant women and their families with respect to birth-preparedness and maternal and infant outcomes in rural villages of Tanzania.” 30
  • “ The study hypothesis was if Tanzanian pregnant women and their families received a family-oriented antenatal group education, they would (1) have a higher level of BPCR, (2) attend antenatal clinic four or more times, (3) give birth in a health facility, (4) have less complications of women at birth, and (5) have less complications and deaths of infants than those who did not receive the education .” 30

Research questions and hypotheses are crucial components to any type of research, whether quantitative or qualitative. These questions should be developed at the very beginning of the study. Excellent research questions lead to superior hypotheses, which, like a compass, set the direction of research, and can often determine the successful conduct of the study. Many research studies have floundered because the development of research questions and subsequent hypotheses was not given the thought and meticulous attention needed. The development of research questions and hypotheses is an iterative process based on extensive knowledge of the literature and insightful grasp of the knowledge gap. Focused, concise, and specific research questions provide a strong foundation for constructing hypotheses which serve as formal predictions about the research outcomes. Research questions and hypotheses are crucial elements of research that should not be overlooked. They should be carefully thought of and constructed when planning research. This avoids unethical studies and poor outcomes by defining well-founded objectives that determine the design, course, and outcome of the study.

Disclosure: The authors have no potential conflicts of interest to disclose.

Author Contributions:

  • Conceptualization: Barroga E, Matanguihan GJ.
  • Methodology: Barroga E, Matanguihan GJ.
  • Writing - original draft: Barroga E, Matanguihan GJ.
  • Writing - review & editing: Barroga E, Matanguihan GJ.

IMAGES

  1. Qualitative vs Quantitative

    qualitative vs quantitative research in education

  2. Qualitative and Quantitative Research: How different and similar are they?

    qualitative vs quantitative research in education

  3. Qualitative Vs. Quantitative Research

    qualitative vs quantitative research in education

  4. Qualitative Research

    qualitative vs quantitative research in education

  5. Qualitative Vs. Quantitative Methods of Verification and Evaluation

    qualitative vs quantitative research in education

  6. 🏷️ Key differences between qualitative and quantitative research

    qualitative vs quantitative research in education

VIDEO

  1. What is the Difference between Quantitative and Qualitative Research?

  2. Qualitative vs Quantitative Data

  3. Qualitative vs. Quantitative Research Design

  4. Qualitative research vs Quantitative research -2

  5. 10 Difference Between Qualitative and Quantitative Research (With Table)

  6. Difference Between Quantitative and Qualitative Research

COMMENTS

  1. Qualitative vs. Quantitative Research: Comparing the Methods and

    Qualitative vs. Quantitative Research in Education: Definitions Although there are many overlaps in the objectives of qualitative and quantitative research in education, researchers must understand the fundamental functions of each methodology in order to design and carry out an impactful research study.

  2. Qualitative vs Quantitative Research: What's the Difference?

    The main difference between quantitative and qualitative research is the type of data they collect and analyze. Quantitative research collects numerical data and analyzes it using statistical methods. The aim is to produce objective, empirical data that can be measured and expressed numerically. Quantitative research is often used to test ...

  3. Qualitative vs. Quantitative Research

    When collecting and analyzing data, quantitative research deals with numbers and statistics, while qualitative research deals with words and meanings. Both are important for gaining different kinds of knowledge. Quantitative research. Quantitative research is expressed in numbers and graphs. It is used to test or confirm theories and assumptions.

  4. Qualitative vs. Quantitative Data Analysis in Education

    Key difference between qualitative and quantitative data. It's important to understand the differences between qualitative and quantitative data to both determine the appropriate research methods for studies and to gain insights that you can be confident in sharing. Data Types and Nature. Examples of qualitative data types in learning analytics:

  5. Quantitative vs. Qualitative Research Design: Understanding the Differences

    Examples of Quantitative Research Design in Education. Here are just a few examples of how quantitative research design methods may impact education: Example 1: Researchers compile data to understand the connection between class sizes and standardized test scores. Researchers can determine if and what the relationship is between smaller ...

  6. Qualitative vs. Quantitative Research: What's the Difference?

    Because qualitative and quantitative studies collect different types of data, their data collection methods differ considerably. Quantitative studies rely on numerical or measurable data. In contrast, qualitative studies rely on personal accounts or documents that illustrate in detail how people think or respond within society.

  7. What Is Qualitative vs. Quantitative Study?

    What Is Qualitative vs. Quantitative Study? Qualitative research focuses on understanding phenomena through detailed, narrative data. It explores the "how" and "why" of human behavior, using methods like interviews, observations, and content analysis. In contrast, quantitative research is numeric and objective, aiming to quantify ...

  8. Quantitative vs. Qualitative Research

    Qualitative research is based upon data that is gathered by observation. Qualitative research articles will attempt to answer questions that cannot be measured by numbers but rather by perceived meaning. Qualitative research will likely include interviews, case studies, ethnography, or focus groups. Indicators of qualitative research include:

  9. SU Library: Qualitative vs. Quantitative Research: Education

    The following books are available in the Stevenson Library: Research and Evaluation in Education and Psychology by Donna M. Mertens. Call Number: LB1028 .M3964 2010. ISBN: 9781412971904. Publication Date: 2009-08-06. Focused on discussing what is considered to be "good" research, this text explains quantitative, qualitative, and mixed methods ...

  10. Qualitative vs Quantitative Research

    This type of research can be used to establish generalisable facts about a topic. Common quantitative methods include experiments, observations recorded as numbers, and surveys with closed-ended questions. Qualitative research. Qualitative research is expressed in words. It is used to understand concepts, thoughts or experiences.

  11. SU Library: Qualitative vs. Quantitative Research: Overview

    In general, quantitative research seeks to understand the causal or correlational relationship between variables through testing hypotheses, whereas qualitative research seeks to understand a phenomenon within a real-world context through the use of interviews and observation. Both types of research are valid, and certain research topics are better suited to one approach or the other.

  12. PDF The Usefulness of Qualitative and Quantitative Approaches and Methods

    3.0. Advantages and disadvantages of qualitative and quantitative research Over the years, debate and arguments have been going on with regard to the appropriateness of qualitative or quantitative research approaches in conducting social research. Robson (2002, p43) noted that there has been a paradigm war between constructivists and positivists.

  13. Critically Thinking About Qualitative Versus Quantitative Research

    Key points. Neither a quantitative nor a qualitative methodology is the right way to approach every scientific question. Rather, the nature of the question determines which methodology is best ...

  14. LibGuides: New Scholar: Qualitative vs Quantitative

    Quantitative Research is used to quantify the problem by way of generating numerical data or data that can be transformed into useable statistics. It is used to quantify attitudes, opinions, behaviors, and other defined variables - and generalize results from a larger sample population. Qualitative Research is primarily exploratory research.

  15. Qualitative vs. Quantitative Studies in EdD Degrees

    Qualitative vs. quantitative studies in education programs commonly evaluate issues in curriculum, practices and policies. ... Qualitative and quantitative research in education can have many objectives in common. Researchers must understand the fundamental functions of each methodology to produce a successful study with actionable results ...

  16. When Does a Researcher Choose a Quantitative, Qualitative, or Mixed

    In educational studies, the paradigm war over quantitative and qualitative research approaches has raged for more than half a century. The focus in the late twentieth century was on the distinction between the two approaches, and the motivation was to retain one of the approaches' supremacy. Since the early twenty-first century, there has been a growing interest in situating in the middle ...

  17. Difference Between Qualitative and Qualitative Research

    At a Glance. Psychologists rely on quantitative and quantitative research to better understand human thought and behavior. Qualitative research involves collecting and evaluating non-numerical data in order to understand concepts or subjective opinions. Quantitative research involves collecting and evaluating numerical data.

  18. Qualitative Comparative Analysis in Education Research: Its Current

    Review of Research in Education March 2020, Vol. 44, pp. 332-369 DOI: 10.3102/0091732X20907347 ... from those of conventional quantitative as well as qualitative research approaches in the social sciences, some scholars repudiate its assumptions and theoretical basis. For example, Lucas and Szatrowski (2014) claimed that both the theoretical ...

  19. PDF The Advantages and Disadvantages of Using Qualitative and Quantitative

    Keywords: qualitative and quantitative research, advantages, disadvantages, testing and assessment 1. Introduction Qualitative and quantitative research approaches and methods are usually found to be utilised rather frequently in different disciplines of education such as sociology, psychology, history, and so on. Concerning the research

  20. Qualitative vs. Quantitative Data: What's the difference?

    Quantitative Data. Beans. Yes, I said beans. I like to think of quantitative data as something that you can count, or QUANTify, like a handful of dried beans. In our programs, these quantitative data commonly look like district assessments, student attendance, activity participation, student GPA, etc.

  21. Bridging the Gap Between Qualitative and Quantitative ...

    Science education researchers typically face a trade-off between more quantitatively oriented confirmatory testing of hypotheses, or more qualitatively oriented exploration of novel hypotheses. More recently, open-ended, constructed response items were used to combine both approaches and advance assessment of complex science-related skills and competencies. For example, research in assessing ...

  22. A Practical Guide to Writing Quantitative and Qualitative Research

    INTRODUCTION. Scientific research is usually initiated by posing evidenced-based research questions which are then explicitly restated as hypotheses.1,2 The hypotheses provide directions to guide the study, solutions, explanations, and expected results.3,4 Both research questions and hypotheses are essentially formulated based on conventional theories and real-world processes, which allow the ...

  23. Distinguishing Between Quantitative and Qualitative Research: A

    Living within blurry boundaries: The value of distinguishing between qualitative and quantitative research. Journal of Mixed Methods Research , 12(3), 268-279. Crossref