Qualitative Research Data Analysis: How Researchers Study Text, Images, and Audio


When researchers start collecting qualitative data, they work with non-number-based information such as interview scripts, images, or audio recordings. Analysing this kind of data might seem tough, however, qualitative data analysis gives us organised ways to understand detailed descriptive info. Let's look at how researchers address analytics in academic research for texts, images, and audio, while also talking about why research data analysis matters overall.
What is Qualitative Data Analysis?
Qualitative data analysis examines and interprets non-numerical data to understand underlying themes, patterns, or stories. Unlike quantitative analysis, which focuses on numbers and statistical relationships, qualitative analysis emphasises meaning, context, and subjective interpretation. This type of research data analysis is often used in fields like sociology, anthropology, psychology, and education.
Steps in Qualitative Research Data Analysis
1. Data Familiarisation
Before jumping into research data analysis, researchers take time to really get familiar with the data. They might read transcripts, examine images, or listen to audio recordings several times. The goal is to fully understand the content and its context.
For example, a researcher looking at workplace communication might listen to recordings of team meetings to get a sense of the tone, how the conversation flows, and the main topics being discussed.
2. Coding the Data
Coding is a fundamental step in research data analysis. Researchers break the data into smaller parts by assigning labels or codes to segments of text, images, or audio. These codes represent themes, ideas, or categories that emerge from the data.
3. Identifying Themes
After coding the data, researchers start grouping similar codes into bigger themes. These themes are the main ideas or patterns that help answer the research question.
4. Interpreting the Data
Interpretation in research data analysis is about making sense of the themes in light of the research goals. Researchers dig into what the data is showing and how it connects to their questions or hypotheses.
5. Presenting Findings
The final step in research data analysis is putting all the insights into a clear, engaging story. This often involves using quotes from participants, highlighted images, or excerpts from audio transcripts to back up the findings.

Tools for Qualitative Data Analysis
While many researchers still analyze data manually, software tools can make the process a lot easier. Softwares like NVivo, ATLAS.ti, and MAXQDA help with coding, organising, and visualising the data. These tools are especially helpful in research data analysis when working with large datasets.
Applications of Qualitative Data Analysis
1. Text Analysis
Textual data includes interview transcripts, written surveys, and documents. Researchers examine word choice, sentence structure, and overall content to uncover insights.
2. Image Analysis
Analysing images in research data analysis involves looking at visual elements like colour, composition, and symbolism. This is often used in media studies, art history, and cultural research.
3. Audio Analysis
Audio data analysis in research, such as recorded interviews or podcasts, requires careful listening to capture nuances like tone, emphasis, and pauses.
The Importance of Data Analysis in Research
Qualitative data analysis is a vital part of research, it helps to uncover the stories and meanings behind the numbers. It gives context and depth to numerical data. By working with non-numerical data researchers can:
- Understand how people think and behave in different situations
- Explore cultural and social trends to see how they shape communities.
- Build theories based on real-life experiences and observations
Qualitative Research Best Practices
1. Be Clear About Your Purpose
Start with a straightforward question or goal. Why are you conducting this research? Knowing what you're looking for helps you stay focused and avoid getting lost in the details when conducting research data analysis.
2. Choose the Right People
Who can give you the best insights? Look for a mix of people with different experiences or perspectives. That makes it more valuable.
3. Build a Comfortable Environment
Imagine yourself as one of the participants. Would you be at ease expressing your opinions in this setting? People are more open in an informal, welcoming environment.
4. Keep an Open Mind
The unexpected may lead to the most insightful discoveries. Be adaptable and curious; go with the flow of the discussion.
5. Pay Attention to the Details
Make thorough notes or, with consent, record the conversation. A person's tone, pauses, and body language can all give away a lot about their intentions.
6. Treat People with Care
Be mindful of participants' boundaries, privacy, and time. Make sure they understand how their contributions will be used and that their contributions are valued.
7. Organise Your Findings
Sort your data into themes or patterns once you have it. Look for frequent arguments people give when answering your question.
8. Share What You Learn
Use actual quotes or cases as you write up your findings so that readers may see what others are saying in their own words.
9. Keep Learning
Each project is an opportunity to develop your abilities. To improve even more over time, take note of what went well and what didn't.

In conclusion
Research data analysis in qualitative studies turns raw data into insights. Whether it’s text, images or audio this process helps researchers explore the personal and cultural aspects of their work, to gain a deeper understanding of the experiences and views behind the data. It combines structure with interpretation to make rich descriptive data meaningful.

From Boolean to Intelligent Search: A Librarian’s Guide to Smarter Information Retrieval
For decades, librarians have been the trusted guides in the vast world of information. But today, that world has grown into something far more complex. Databases multiply, metadata standards evolve, and users expect instant answers. Traditional search still relies on structured logic, keywords, operators, and carefully crafted queries. AI enhances this by interpreting intent rather than just words. Instead of matching text, AI tools for librarians analyse meaning. A researcher looking for “climate change effects on migration” won’t just get papers containing those words, but research exploring environmental displacement, socioeconomic factors, and regional studies. This shift from keyword to context means librarians can spend less time teaching a researcher how to “speak database” and more time helping them evaluate and use the results effectively. The Evolution of Library Search Traditional search engines focus on keywords and often return long lists of potential matches. With AI, libraries can now benefit from search engines that employ natural language processing (NLP) and machine learning (ML) to understand user queries and map them to the most relevant resources, even when key terms are missing or imprecise. Semantic search, embedding-based retrieval, and vector databases allow AI to find conceptually similar resources and suggest new directions for research. Examples of AI Tools for Librarians AI ToolMain FunctionLibrarian BenefitZendyAI-powered platform offering literature discovery, summarisation, keyphrase highlighting, and PDF analysisSupports researchers with instant insights, simplifies literature reviews, and improves discovery across 40M+ publicationsConsensusAI-powered academic search enginemanaging citation libraries, efficient literature reviewEx Libris PrimoIntegrates AI for discovery and metadata managementImproves record accuracy and user experienceMeilisearchFast, scalable vector search with NLPEnhanced search for large content databases The Ethics of Intelligent Search AI doesn’t just retrieve; it prioritises. AI tools for librarians determine which results appear first, whose research receives visibility, and what remains hidden. This creates ethical questions around transparency and bias. Librarians are uniquely positioned to question those algorithms, advocate for equitable access, and ensure users understand how results are ranked. In an AI-driven world, digital literacy extends beyond knowing how to search—it’s about learning how machines think. In conclusion AI tools for librarians are becoming more accessible. Platforms now integrate summarisation, concept mapping, and citation analysis directly into search. helping librarians and users avoid unreliable content. For libraries, experimenting with these tools can mean faster reference responses, smarter cataloguing, and better support for researchers drowning in information overload. .wp-block-image img { max-width: 85% !important; margin-left: auto !important; margin-right: auto !important; }

Why AI like ChatGPT still quotes retracted papers?
AI models like ChatGPT are trained on massive datasets collected at specific moments in time, which means they lack awareness of papers retracted after their training cutoff. When a scientific paper gets retracted, whether due to errors, fraud, or ethical violations, most AI systems continue referencing it as if nothing happened. This creates a troubling scenario where researchers using AI assistants might unknowingly build their work on discredited foundations. In other words: retracted papers are the academic world's way of saying "we got this wrong, please disregard." Yet the AI tools designed to help us navigate research faster often can't tell the difference between solid science and work that's been officially debunked. ChatGPT and other assistants tested Recent studies examined how popular AI research tools handle retracted papers, and the results were concerning. Researchers tested ChatGPT, Google's Gemini, and similar language models by asking them about known retracted papers. In many cases, they not only failed to flag the retractions but actively praised the withdrawn studies. One investigation found that ChatGPT referenced retracted cancer imaging research without any warning to users, presenting the flawed findings as credible. The problem extends beyond chatbots to AI-powered literature review tools that researchers increasingly rely on for efficiency. Common failure scenarios The risks show up across different domains, each with its own consequences: Medical guidance: Healthcare professionals consulting AI for clinical information might receive recommendations based on studies withdrawn for data fabrication or patient safety concerns Literature reviews: Academic researchers face citation issues when AI assistants suggest retracted papers, damaging credibility and delaying peer review Policy decisions: Institutional leaders making evidence-based choices might rely on AI-summarised research without realising the underlying studies have been retracted A doctor asking about treatment protocols could unknowingly follow advice rooted in discredited research. Meanwhile, detecting retracted citations manually across hundreds of references proves nearly impossible for most researchers. How Often Retractions Slip Into AI Training Data The scale of retracted papers entering AI systems is larger than most people realise. Crossref, the scholarly metadata registry that tracks digital object identifiers (DOIs) for academic publications, reports thousands of retraction notices annually. Yet many AI models were trained on datasets harvested years ago, capturing papers before retraction notices appeared. Here's where timing becomes critical. A paper published in 2020 and included in an AI training dataset that same year might get retracted in 2023. If the model hasn't been retrained with updated data, it remains oblivious to the retraction. Some popular language models go years between major training updates, meaning their knowledge of the research landscape grows increasingly outdated. Lag between retraction and model update Training Large Language Models requires enormous computational resources and time, which explains why most AI companies don't continuously update their systems. Even when retraining occurs, the process of identifying and removing retracted papers from massive datasets presents technical challenges that many organisations haven't prioritised solving. The result is a growing gap between the current state of scientific knowledge and what AI assistants "know." You might think AI systems could simply check retraction databases in real-time before responding, but most don't. Instead, they generate responses based solely on their static training data, unaware that some information has been invalidated. Risks of Citing Retracted Papers in Practice The consequences of AI-recommended retracted papers extend beyond embarrassment. When flawed research influences decisions, the ripple effects can be substantial and long-lasting. Clinical decision errors Healthcare providers increasingly turn to AI tools for quick access to medical literature, especially when facing unfamiliar conditions or emerging treatments. If an AI assistant recommends a retracted study on drug efficacy or surgical techniques, clinicians might implement approaches that have been proven harmful or ineffective. The 2020 hydroxychloroquine controversy illustrated how quickly questionable research spreads. Imagine that dynamic accelerated by AI systems that can't distinguish between valid and retracted papers. Policy and funding implications Government agencies and research institutions often use AI tools to synthesise large bodies of literature when making funding decisions or setting research priorities. Basing these high-stakes choices on retracted work wastes resources and potentially misdirects entire fields of inquiry. A withdrawn climate study or economic analysis could influence policy for years before anyone discovers the AI-assisted review included discredited research. Academic reputation damage For individual researchers, citing retracted papers carries professional consequences. Journals may reject manuscripts, tenure committees question research rigour, and collaborators lose confidence. While honest mistakes happen, the frequency of such errors increases when researchers rely on AI tools that lack retraction awareness, and the responsibility still falls on the researcher, not the AI. Why Language Models Miss Retraction Signals The technical architecture of most AI research assistants makes them inherently vulnerable to the retraction problem. Understanding why helps explain what solutions might actually work. Corpus quality controls lacking AI models learn from their training corpus, the massive collection of text they analyse during development. Most organisations building these models prioritise breadth over curation, scraping academic databases, preprint servers, and publisher websites without rigorous quality checks. The assumption is that more data produces better models, but this approach treats all papers equally regardless of retraction status. Even when training data includes retraction notices, the AI might not recognise them as signals to discount the paper's content. A retraction notice is just another piece of text unless the model has been specifically trained to understand its significance. Sparse or inconsistent metadata Publishers handle retractions differently, creating inconsistencies that confuse automated systems: Some journals add "RETRACTED" to article titles Others publish separate retraction notices A few quietly remove papers entirely This lack of standardisation means AI systems trained to recognise one retraction format might miss others completely. Metadata، the structured information describing each paper, often fails to consistently flag retraction status across databases. A paper retracted in PubMed might still appear without warning in other indexes that AI training pipelines access. Hallucination and overconfidence AI hallucination occurs when models generate plausible-sounding but false information, and it exacerbates the retraction problem. Even if a model has no information about a topic, it might confidently fabricate citations or misremember details from its training data. This overconfidence means AI assistants rarely express uncertainty about the papers they recommend, leaving users with no indication that additional verification is needed. Real-Time Retraction Data Sources Researchers Should Trust While AI tools struggle with retractions, several authoritative databases exist for manual verification. Researchers concerned about citation integrity can cross-reference their sources against these resources. Retraction Watch Database Retraction Watch operates as an independent watchdog, tracking retractions across all academic disciplines and publishers. Their freely accessible database includes detailed explanations of why papers were withdrawn, from honest error to fraud. The organisation's blog also provides context about patterns in retractions and systemic issues in scholarly publishing. Crossref metadata service Crossref maintains the infrastructure that assigns DOIs to scholarly works, and publishers report retractions through this system. While coverage depends on publishers properly flagging retractions, Crossref offers a comprehensive view across multiple disciplines and publication types. Their API allows developers to build tools that automatically check retraction status, a capability that forward-thinking platforms are beginning to implement. PubMed retracted publication tag For medical and life sciences research, PubMed provides reliable retraction flagging with daily updates. The National Library of Medicine maintains this database with rigorous quality control, ensuring retracted papers receive prominent warning labels. However, this coverage is limited to biomedical literature, leaving researchers in other fields without equivalent resources. DatabaseCoverageUpdate SpeedAccessRetraction WatchAll disciplinesReal-timeFreeCrossrefPublisher-reportedVariableFree APIPubMedMedical/life sciencesDailyFree Responsible AI Starts with Licensing When AI systems access research papers, articles, or datasets, authors and publishers have legal and ethical rights that need protection. Ignoring these rights can undermine the sustainability of the research ecosystem and diminish trust between researchers and technology providers. One of the biggest reasons AI tools get it wrong is that they often cite retracted papers as if they’re still valid. When an article is retracted, e.g. due to peer review process not being conducted properly or failing to meet established standards, most AI systems don’t know, it simply remains part of their training data. This is where licensing plays a crucial role. Licensed data ensures that AI systems are connected to the right sources, continuously updated with accurate, publisher-verified information. It’s the foundation for what platforms like Zendy aim to achieve: making sure the content is clean and trustworthy. Licensing ensures that content is used responsibly. Proper agreements between AI companies and copyright holders allow AI systems to access material legally while providing attribution and, when appropriate, compensation. This is especially important when AI tools generate insights or summaries that are distributed at scale, potentially creating value for commercial platforms without benefiting the sources of the content. in conclusion, consent-driven licensing helps build trust. Publishers and authors can choose whether and how their work is incorporated into AI systems, ensuring that content is included only when rights are respected. Advanced AI platforms, such as Zendy, can even track which licensed sources contributed to a particular output, providing accountability and a foundation for equitable revenue sharing. .wp-block-image img { max-width: 85% !important; margin-left: auto !important; margin-right: auto !important; }

5 Tools Every Librarian Should Know in 2025
The role of librarians has always been about connecting people with knowledge. But in 2025, with so much information floating around online, the challenge isn’t access, it’s sorting through the noise and finding what really matters. This is where AI for libraries is starting to make a difference. Here are five that are worth keeping in your back pocket this year. 1. Zendy Zendy is a one-stop AI-powered research library that blends open access with subscription-based resources. Instead of juggling multiple platforms, librarians can point students and researchers to one place where they’ll find academic articles, reports, and AI tools to help with research discovery and literature review. With its growing use of AI for libraries, Zendy makes it easier to summarise research, highlight key ideas, and support literature reviews without adding to the librarian’s workload. 2. LibGuides Still one of the most practical tools for librarians, LibGuides makes it easy to create tailored resource guides for courses, programs, or specific assignments. Whether you’re curating resources for first-year students or putting together a subject guide for advanced research, it helps librarians stay organised while keeping information accessible to learners. 3. OpenRefine Cleaning up messy data is nobody’s favourite job, but it’s a reality when working with bibliographic records or digital archives. OpenRefine is like a spreadsheet, but with superpowers, it can quickly detect duplicates, fix formatting issues, and make large datasets more manageable. For librarians working in cataloguing or digital collections, it saves hours of tedious work. 4. PressReader Library patrons aren’t just looking for academic content; they often want newspapers, magazines, and general reading material too. PressReader gives libraries a simple way to provide access to thousands of publications from around the world. It’s especially valuable in public libraries or institutions with international communities. 5. OCLC WorldShare Managing collections and sharing resources across institutions is a constant task. OCLC WorldShare helps libraries handle cataloguing, interlibrary loans, and metadata management. It’s not flashy, but it makes collaboration between libraries smoother and ensures that resources don’t sit unused when another community could benefit from them. Final thought The tools above aren’t just about technology, they’re about making everyday library work more practical. Whether it’s curating resources with Zendy, cleaning data with OpenRefine, or sharing collections through WorldShare, these platforms help librarians do what they do best: guide people toward knowledge that matters. .wp-block-image img { max-width: 85% !important; margin-left: auto !important; margin-right: auto !important; }
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom