Data Analysis & Interpretation

This section introduces essential AI-powered analytical tools designed to help behavioural researchers extract meaningful insights from diverse datasets across research methodologies.

For qualitative analyses:

  • AI can extract key themes and trends from large textual datasets.
  • NLP models can analyse sentiment in qualitative responses.

For quantitative analyses:

  • AI can generate visualisations for complex data trends.
  • AI can provide quick solutions to coding errors.
  • AI-powered tools like Julius AI and GenAI platforms may help analyse statistical data.

 

Tools

  • Julius AI (free & paid): https://julius.ai/chat

    Julius AI is a comprehensive data analysis platform that processes multiple file formats, provides visualizations, and executes code in Python and R. The tool offers automatic data correction, PDF information extraction, and structured "Workflows" for common analytical procedures, while employing "Advanced Reasoning" to break complex requests into manageable steps with web search capabilities to enhance analysis. More information is available in this video: https://www.youtube.com/watch?v=OdHLOqxM_XQ&t=41s.

  • Ailyze (paid): https://www.ailyze.com/

    Ailyze can be used for assisted coding, document analysis, and thematic/content analysis. There is also an AI interviewer option for collecting qualitative data, but the free version only offers limited features for qualitative data analysis.

  • “Hugging Face (free & paid tiers): https://huggingface.co

    Hugging Face hosts thousands of AI models and datasets, many open-source and useful for research. Researchers can access pre-trained models for sentiment analysis, emotion detection, text classification, and named entity recognition to analyse interview transcripts, survey responses, and social media data. Models can be tested via web interface, downloaded for local use, or accessed through the Inference API (free tier with limits; paid for extensive use). The platform includes emotion datasets like GoEmotions and Twitter emotion data for training or validation. It is particularly valuable for analysing qualitative data at scale and automating coding of open-ended responses, though researchers should evaluate model quality and relevance for their specific needs.

  • GenAI platforms such as ChatGPT, Claude, Gemini and Deepseek can be helpful in qualitative analyses, as well as generating R or Python codes for quantitative analyses.
    • Quantitative analyses in R, Stata, or Python: it can help generate and debug efficient code for complex statistical procedures, creating customised data visualisations, and translating technical outputs into clear interpretations. You can prompt it to recommend appropriate statistical methods based on research questions, generate code for data preprocessing tasks, and write comprehensive documentation (e.g. ReadMe files) for reproducibility. AI assistants can also convert analysis scripts between languages while maintaining analytical integrity if you would like to switch analysis software.
    • More sophisticated AI-powered code editors include GitHub’s Copilot, Cursor and Windsurf. They are also helpful in building web and smartphone applications for your study design

Examples

Research shows that it can be risky for researchers with little expertise in quantitative analyses to use GenAI for such quantitative methods: Prandner and colleagues (2025) found for analysis procedures were often general and not clearly illustrated, making it difficult for a naïve user to make informed decisions. It also shows difficulties with proprietary software (SPSS) but more promising results with open-source software (R). The AI can act as a starting point or provide partial solutions, but without a foundational understanding of statistics and software, the naïve user may struggle to evaluate the AI’s output, make informed decisions, and correct errors.

  • Prandner, D., Wetzelhütter, D., & Hese, S. (2025). ChatGPT as a data analyst: An exploratory study on AI-supported quantitative data analysis in empirical research. Frontiers in Education, 9, 1417900. https://doi.org/10.3389/feduc.2024.1417900  

For sentiment analysis, Lossio-Ventura and colleagues (2024)found that LLMs, particularly ChatGPT and fine-tuned OPT, represent a significant advancement in sentiment analysis, especially for health-related survey data. These models offer superior performance and efficiency gains. However, they caution that users should be aware of the limitations, particularly concerning PHI, potential errors with linguistic nuances and specialized terminology, and the necessity for human oversight. Open-source LLMs like OPT provide a promising alternative

  • Lossio-Ventura, J. A., Weger, R., Lee, A. Y., Guinee, E. P., Chung, J., Atlas, L., Linos, E., & Pereira, F. (2024). A Comparison of ChatGPT and Fine-Tuned Open Pre-Trained Transformers (OPT) Against Widely Used Sentiment Analysis Tools: Sentiment Analysis of COVID-19 Survey Data. JMIR Mental Health, 11(1), e50150. https://doi.org/10.2196/50150  

Wheeler (2025) provides an excellent introductory how-to guide on using Generative AI to assist in Qualitative Data analysis

  • Wheeler, Kathryn (2025) How to use Generative AI to Assist the Analysis of Qualitative Data [How-to Guide]. Sage Research Methods: Data and Research Literacy. DOI https://doi.org/10.4135/9781036217471 (In Press)”

In a qualitative study on engagement with research, Floris and colleagues (2024) employed Ailyze for rapid automated identification of preliminary themes from the dataset. This technological approach was then complemented by manual thematic analysis to refine the findings. According to the paper, the combined analyses allowed for thorough verification of themes, ensuring they accurately reflected the complexities and nuances present in participants' experiences.

  • Floris, F. D., Widiati, U., Renandya, W. A., & Basthomi, Y. (2024). Engagement with research: A qualitative study of English department teachers’ experiences and insights. Social Sciences & Humanities Open, 9, 100846. https://doi.org/10.1016/j.ssaho.2024.100846  

Cameron (2025) at MIT also compared the performance of thematic analysis conducted by trained graduate students and Ailyze, and found that the AI-powered tool was 75% faster and assessed as producing higher quality output.

Smirnov (2025) used simulations with ChatGPT and Claude to perform content analyses and narrative analyses and compared them with real human output. Their findings suggest that LLMs hold great promise with the right prompts in coding textual data and generating insightful themes with increased credibility and reduced time.

  • Smirnov, E. (2025). Enhancing qualitative research in psychology with large language models: A methodological exploration and examples of simulations. Qualitative Research in Psychology, 22(2), 482–512. https://doi.org/10.1080/14780887.2024.2428255  

Researchers also highlight the potential of LLMs as collaborative analytical partners to enhance qualitative by enabling researchers to “converse” with textual data through targeted questioning, revealing patterns and connections that might otherwise remain hidden. AI tools such as Claude can dramatically accelerate traditional tasks like transcription, coding, and theme identification while maintaining the researcher's control over interpretation and conceptual framing. 

  • Hayes, A. S. (2025). “Conversing” With Qualitative Data: Enhancing Qualitative Research Through Large Language Models (LLMs). International Journal of Qualitative Methods, 24, 16094069251322346. https://doi.org/10.1177/16094069251322346