WHR 2026 | Chapter 4

Translating scientific evidence into effective policies for health and technology requires care

47 min. read

Sophie Lloyd-Hurwitz Oxford Internet Institute, University of Oxford
Andrew Przybylski Oxford Internet Institute, University of Oxford

Acknowledgments

We are particularly thankful to Yeeun Archer Lee, Felix Cheung, and Lara Aknin for their review of this chapter. We thank the WHR team, and the other authors in this edition for their helpful comments and discussions. We also thank Dr Victoria Nash for her expertise and discussion early on in this work.

DOI: https://doi.org/10.18724/whr-zm8z-mg59

Key insights

Professional science organisations that have examined social media and adolescent mental health have reached different conclusions and policy recommendations despite examining similar research. Given their substantial influence on policy and public understanding, it is important to investigate their evidence synthesis practices.

Our analysis of three high-profile reports on social media and adolescent mental health finds that they cited broadly similar types of research, yet showed little overlap (<1%) in their sources.

We also found considerable variation in how the reports synthesise, communicate, and simplify evidence, including differences in citation accuracy, contextual detail, limitation acknowledgement, and conclusion strength.

The stakes of getting these syntheses right are substantial. Poor synthesis quality risks developing policies which may be ineffective or cause unintended harm, and may contribute to the erosion of public trust in scientific institutions more broadly.

When communicating the state of a complex scientific field, it is crucial to be honest about shortcomings and uncertainties, and to maximise fidelity to the underlying research. As scientists committed to rigorous, transparent, and replicable approaches to understanding complex phenomena, we have a responsibility to consistently uphold standards that justify claims to scientific authority and to identify opportunities for improving practices within our community.

Introduction

The question of how social media engagement relates to adolescent mental health attracts intense public concern and demand for scientific guidance.^[1] Indeed, with adolescents spending substantial time on social media platforms and rising adolescent mental health concerns, policymakers, parents, educators, and practitioners are increasingly seeking evidence-based guidance on appropriate responses.^[2] The stakes are high: policymaking in this domain directly affects vulnerable and young populations, involves significant resources, and shapes how technology companies design platforms used by billions of young people worldwide. This is hard to get right because balancing the potential benefits and unintended consequences of new policies requires rigorous synthesis and communication of the available research evidence.

Indeed, research indicates that reviewing and translating research into policy guidance is a challenging process.^[3] Individual studies, however well-designed, cannot alone constitute what we know about a social phenomenon. Scientific understanding emerges from integrating findings across multiple investigations, navigating often contradictory results, weighing methodological differences, and accurately characterising what is known and what remains uncertain.^[4] This synthesis work is fundamental to how research contributes to evidence-based policy. In this realm, professional science organisations, such as the American Psychological Association and the National Academies of Sciences, serve as crucial intermediaries in this process by translating complex research into accessible reports with policy recommendations designed to be read by diverse audiences.^[5]

Because organisations represent distinct academic disciplines, it is possible they will produce and synthesise studies investigating the same question, but distinct methodological and disciplinary emphases means they might reach markedly different conclusions. For instance, while the US Surgeon General’s Office advocates to “pursue policies that further limit access to social media for all children,”^[6] the National Academies found that current evidence does not support population-level causal conclusions and noted that “the committee sympathizes with some parents’ desire for authoritative prescriptions on teenagers’ social media use but [are] also mindful of overreaching the data”,^[7] cautioning against strict age limits. What should readers and policymakers make of this difference?

When authoritative bodies reach different conclusions from scientific evidence, this raises questions about the practices governing evidence synthesis in these settings. Despite the substantial influence these reports wield,^[8] the processes by which organisations synthesise and communicate evidence rarely receive systematic evaluation. And yet, these synthesis practices powerfully shape both policy development^[9] and public understanding. Practitioners use them to stay current with rapidly evolving research fields,^[10] policymakers draw on them to inform legislative and regulatory deliberation,^[11] and the general public often encounters them as accessible explanations of “what the science shows” on complex questions.^[12] Given this broad influence, the quality of evidence synthesis in these documents matters considerably for downstream policy development and public understanding.

The stakes of getting synthesis reports right are substantial. Poor evidence translation risks policy ineffectiveness when interventions are poorly calibrated to what research actually demonstrates,^[13] potential for unintended harms when policies proceed on overstated evidence,^[14] and risk the erosion of public trust in scientific institutions more broadly.^[15] In a domain where scientific understanding remains genuinely uncertain, intellectual honesty about what we do and do not know with confidence serves evidence-based policy far better than premature certainty constructed through selective emphasis or strong rhetoric.

When authoritative bodies reach different conclusions from scientific evidence, this raises questions about the practices governing evidence synthesis in these settings.

In this chapter, we take a step back from examining what makes empirical research good^[16] and we focus instead on how empirical research is translated into influential policy recommendations. More specifically, we investigated how professional science organisations synthesise research into clear policy guidance by analysing three high-profile US-based reports published between 2023 and 2024: The National Academies of Sciences, Engineering, and Medicine (NASEM),^[17] the American Psychological Association (APA),^[18] and the US Surgeon General’s Office (OSG).^[19]

We focus on three US reports for several reasons. First, these documents are highly prominent and frequently referenced in policy debates internationally, not just domestically — reflecting the intensity of US attention to social media and adolescent mental health during 2023–2024. Second, their production within an 18-month window means the organisations were theoretically drawing from a largely overlapping literature base, enabling comparison of synthesis practices while controlling for both national policy context and evidence availability. This temporal proximity is particularly important given how rapidly the social media effects literature evolves.

We acknowledge that similar evidence syntheses have been produced in other regions, including reports from the WHO,^[20] OECD,^[21] and various national governments.^[22] A comprehensive cross-national comparison was beyond our scope, but we encourage readers to consider how the evaluative framework applied here might illuminate evidence translation practices in these diverse global contexts.

We classified the peer-reviewed research on social media and mental health cited across the reports by methodological characteristics, study design features, and thematic content to determine how organisations identified evidence bases. We also conducted a qualitative analysis of how organisations synthesised and communicated their selected evidence, examining citation accuracy, evidence integration practices, acknowledgment of limitations and contradictory findings, and rhetorical construction of conclusions. This mixed-methods approach enabled us to distinguish between differences stemming from evidence selection versus differences in synthesis and communication practices. In this chapter, we present our analysis, discuss what our findings reveal about evidence synthesis practices, consider implications for evidence-based policy in contested domains, and offer recommendations for improving synthesis quality based on our observations.

We retain confidence that scientific research can meaningfully inform policy deliberation on complex social questions, like the relationship between social media and adolescent wellbeing. However, realising this potential requires greater attention to the standards and practices governing how scientific research is synthesised and translated for policy guidance.

Background

The evidence-based policy (EBP) movement argues that systematic use of research evidence can improve policy effectiveness, reduce unintended consequences, and promote democratic accountability by providing more transparency in the policy decision process.^[23] This evidence-based policy movement emerged from parallel developments in evidence-based medicine and the broader “what works” and “modernising government” agendas in social policy in the early 2000s.^[24] While EBP has seen success in medicine, the transition between scientific research and policy decisions in the social sciences is rarely straightforward.^[25] Social interventions often involve complex relationships difficult to isolate experimentally, ethical constraints limiting experimental manipulation, and contextual factors affecting generalisability.^[26] Early EBP frameworks emphasised selecting high-quality evidence through hierarchies privileging randomised trials and systematic reviews,^[27] but scholars increasingly recognise that directly applying medical approaches fails to account for social complexity.^[28]

Furthermore, even with clear causal evidence, social policies are guided by values, societal goals, and political factors beyond empirical findings alone.^[29] Parkhurst makes an important distinction between selecting “good evidence” and ensuring “good governance of evidence” which recognises that high-quality evidence is necessary but insufficient without faithful stewardship within complex decision-making processes.^[30] Contemporary scholarship has moved beyond simple linear models to recognise evidence-based policy as fundamentally political, with research representing one input among many competing considerations.^[31] IJzerman and colleagues propose “evidence-readiness levels” for the social sciences considering replication status, theoretical grounding, and policy applicability beyond methodological rigour alone.^[32] While such frameworks represent crucial advances in evidence quality assessment, they should be coupled with increased attention to how evidence is synthesised and communicated for policy audiences.

This domain presents particularly acute translation challenges: (a) methodologically limited evidence, (b) high public stakes, (c) intense political pressure for guidance, and (d) recent proliferation of competing overlapping professional statements addressing similar questions.

Despite demand for definitive guidance, experts in adolescent mental health have raised significant concerns about methodological limitations in the evidence base that constrain the ability to establish causal relationships or generalise findings to broader policy-relevant contexts.^[33] One fundamental challenge is the difficulty in defining, testing, and demonstrating convincing evidence of causal relationships. Much evidence relies on correlational designs, with limited longitudinal research examining effects over time.^[34] Additionally, it is incredibly challenging to conduct true experimental work in this domain given the near ubiquity of social media in adolescent lives. Even experimental studies employing social media “detox” strategies face significant limitations, as participants remain embedded in social environments where social media presence is pervasive.^[35]

Conceptually, screen time research faces challenges: time is finite, so increased screen time necessarily entails decreased time spent on other activities. Displacement theory highlights that even when studies demonstrate associations between screen time and poorer outcomes, it remains unclear whether effects stem from the presence of screen time itself or from the absence of displaced activities — sleep, exercise, socialising — or some combination.^[36] Most research measures only screen time without capturing broader time-use patterns, limiting our ability to identify mechanisms.

Experts in adolescent mental health have raised significant concerns about methodological limitations in the evidence base that constrain the ability to establish causal relationships.

Correlational studies suffer from additional confounding factors, including generational differences (in both social media usage and in mental health outcomes) which make it difficult to isolate the specific contribution of social media use from other contemporary influences on mental health outcomes such as climate anxiety,^[37] economic factors,^[38] or increased mental health awareness and resultant clinical diagnoses.^[39] Additional methodological challenges include self-report measurement limitations concerning both time spent on social media platforms and mental health indicators, which compromise the ability to validate findings or conduct meaningful comparisons across studies.^[40] The literature consistently indicates that users, particularly adolescents, are poor estimators of their true screen time use.^[41] Earlier research treated social media as homogeneous, focusing primarily on aggregate device screen time. Following critiques,^[42] recent work adopts more sophisticated approaches; disaggregating platforms and considering specific features.

Longitudinal studies generally report small or mixed effects, with meta-analyses highlighting limited practical significance despite statistical significance.^[43] Considering publication bias favouring significant findings, some experts argue effects are likely very small at population levels.^[44] The research base also suffers from limited sample diversity, predominantly featuring educated, affluent adolescents despite supporting nationwide frameworks,^[45] with substantial portions conducted with adult rather than adolescent samples.^[46]

These limitations exist within the broader context of psychology’s “replication crisis” and subsequent reforms.^[47] The scientific community has increasingly emphasised improving research production through open science practices including pre-registration of study designs, data and code sharing, comprehensive conflict of interest reporting, and systematic replication efforts.^[48] Despite the substantial attention on improving research production, less systematic scrutiny addresses how research synthesis might also be improved.

Professional science organisations and evidence synthesis

Professional science organisations occupy a distinctive position in this landscape.^[49] Unlike advocacy groups, commercial interests, or individual commentators, these organisations explicitly claim scientific authority for their conclusions, positioning themselves as representing “what the research shows” or articulating “scientific consensus.” These organisations are frequently called on to provide authoritative scientific guidance on topics including environmental and climate change research, public health issues such as COVID-19, and education research.^[50] Such reports are influential in the policymaking process and also contribute to broader public understanding of important social topics.^[51] When organisations with similar mandates reach different policy conclusions, it raises questions about the quality of synthesis processes.

Standards for evidence synthesis vary considerably across contexts. Systematic reviews and meta-analyses in scientific publishing follow established protocols including PRISMA and Cochrane guidelines specifying transparent search strategies, explicit inclusion criteria, and structured quality assessment.^[52] However, policy-facing evidence syntheses take many forms, and operate without industry-wide standardised methodological frameworks. In some domains, organisations have worked to establish their own frameworks to promote clear and accurate syntheses of evidence.^[53]

When organisations with similar mandates reach different policy conclusions, it raises questions about the quality of synthesis processes.

Few studies have examined evidence use in the social media and adolescent mental health domain specifically. Richards and colleagues^[54] analysed evidence cited in US pre-trial filings against social media companies, revealing selective referencing including reliance on outdated research, limited population samples, and under-specified health outcomes. Their descriptive mapping approach quantified evidence characteristics related to scientific rigour, including methodology, thematic appropriateness, and population specificity. Examining evidence communication across policy documents, Elson and colleagues^[55] analysed professional organisation policies on a wide range of media effects produced before 2018, finding systematic translation issues including overstatement of causal claims, selective citation, and inadequate representation of methodological limitations. Their work established approaches for evaluating translation fidelity, though it did not examine how individual evidence pieces are represented.

The present study

This chapter presents a systematic analysis of three major policy documents addressing social media and adolescent mental health, issued contemporaneously between 2023 and 2024: the American Psychological Association (APA), the National Academies of Sciences, Engineering, and Medicine (NASEM), and the US Surgeon General’s Office (OSG). These organisations are well-resourced, scientifically sophisticated actors with explicit mandates to provide authoritative evidence synthesis for policy purposes, making their work especially informative for understanding how evidence syntheses are conducted in practice.

Figure 4.1: Three major policy documents addressing social media and adolescent mental health

However, it is important to acknowledge that these organisations differ in institutional character and stated aims. Their reports also varied in scope: the OSG advisory aimed to “call attention to growing concerns” and provide urgent, actionable recommendations; the APA advisories sought to “summarise psychological science for stakeholders”; and NASEM explicitly aimed to “comprehensively examine current research” through systematic review, which partly explains its substantially greater length. All three focused broadly on adolescent health outcomes in relation to social media, encompassing both clinical mental health indicators and broader wellbeing measures, though the OSG report focuses more heavily on mental health outcomes more specifically. Table 4.1 outlines some key differences in the organisation types, stated goals, and policy conclusions of these three reports.

We employ a mixed-methods approach building on prior frameworks for critically appraising policy statement quality. Firstly, we characterise what evidence the reports cite. Our quantitative analysis adapts Richards et al.'s citation mapping methodology^[56] to characterise the evidence base underlying each report, systematically coding all 617 unique academic sources for methodological characteristics, study design features, and thematic content. This enables comparison of whether organisations drew upon fundamentally different evidence or selected from similar research pools. Secondly, our qualitative analysis extends upon Elson and colleagues’ framework^[57] identifying common problems in how evidence is communicated to examine how organisations synthesised and communicated selected evidence, including citation accuracy, evidence integration practices, and rhetorical construction of conclusions.

Table 4.1 presents the policy documents analysed, highlighting key organisational differences and divergent policy positions. Despite all three organisations examining scientific understanding of social media and mental health during the same period, they reached notably different conclusions with correspondingly different policy recommendations ranging from urgent calls for population-level age restrictions to cautions against interventions that outpace empirical support. Please refer to Online Appendix 4A for a detailed comparison of the conclusions and policy positions of the three reports.

Table 4.1: Comparison of the APA, NASEM, and OSG reports
Organisation	Org type	Report title	Date	Length	Stated goals	Key conclusions and recommendations
APA	Professional scientific association	Health advisory on social media use in adolescents	May 2023	6pg	Science-informed recommendations for stakeholders	Concludes the use of social media is not inherently beneficial or harmful to young people. Recommends industry standards, parental monitoring, and platform design changes to prioritise youth safety.
APA	Professional scientific association	Potential risks of content, features, and functions	Apr 2024	11pg	Elaborate on science relevant to policy solutions	Focuses on specific platform features requiring modification; emphasises design-level interventions.
NASEM	Congressionally chartered advisory body	Social media and adolescent health	Dec 2024	287pg	Comprehensive systematic examination of current research	Finds the literature did not support the conclusion that social media causes changes in adolescent health at the population level. Concludes social media can both harm and improve adolescent health. Cautions against population-level interventions; emphasises individual differences and potential benefits alongside risks.
OSG	Federal government office	Social media and youth mental health	May 2023	25pg	Call urgent attention; provide actionable recommendations	Advocates pursuing policies to further limit access including strengthening age minimums. Strengthen protections to ensure greater safety for children interacting on social media platforms. Characterises there being insufficient evidence to conclude platforms are sufficiently safe for youth.

What types of evidence are included?

Methods

This study was exploratory rather than confirmatory, aimed at characterising patterns in evidence use across policy documents rather than testing pre-specified hypotheses. Accordingly, it was not pre-registered. We systematically analysed all citations from the three policy documents, extracting 617 unique peer-reviewed scientific articles from 1,063 total citations, to examine whether organisations’ different policy conclusions reflected systematic differences in evidence selection patterns. Given our focus on social media effects research in particular, we only retained articles whose primary focus was on social media effects (n = 355). This criterion excluded articles written primarily about non-media-related neurological development, sleep, media literacy, and education technology. We defined social media very broadly, deferring to whether each article positioned itself as investigating social media. This means articles with varying definitions, some encompassing social networking, smartphone use, or general screen time, were included, whereas articles focusing only on video games were excluded. We acknowledge the difficulty of drawing a line in the sand here. These criteria were constructed to enable our analysis to focus as far as possible on research that underlies much of the debate surrounding social media’s effects, which has historically drawn on the broader literature on screen time and smartphone use.

Following Richards et al.'s^[58] framework for evaluating social media evidence use in policy statements, we categorised each cited article along three dimensions: study methodology, thematic focus, and sample characteristics.^[59] This mapping enabled systematic comparison of what types of social media research each organisation drew upon to support policy recommendations. We also assessed whether each study’s methodology could plausibly support causal inferences,^[60] given the centrality of causal claims in policy debates about social media effects.

Given the large citation volume, we employed an AI-assisted classification system with validation protocols to promote reliability.^[61] The system processed article abstracts using structured prompts, with explicit instructions to respond with “inconclusive” when information was unclear or required inference beyond explicit abstract content. We implemented formal validation testing to ensure classification accuracy before proceeding with full analysis. Due to resource constraints, articles classified as “inconclusive” remained coded as such rather than conducting full-text reviews. Full reporting on inconclusive classifications appears in the results.

Chi-square tests of independence compared categorical distributions across organisations,^[62] with appropriate corrections for multiple testing.^[63] Given substantial sample size differences between reports, we conducted sensitivity analyses to assess whether findings remained robust to these imbalances. Statistical significance was assessed using both raw and corrected p-values at conventional ɑ = 0.05, with effect sizes reported to distinguish statistical from practical significance. More methodological detail is available in Online Appendix 4B.

Results

What types of evidence did organisations cite?

We analysed 1,063 total citations across the three reports to understand what types of sources these drew upon. Reports showed small but significantly different patterns in their sources,^[64] though these differences may reflect varying approaches to supplementing core academic evidence rather than fundamental distinctions in evidential foundations.

The APA reports relied most heavily on peer-reviewed research (92%, k = 77), while the OSG and NASEM reports incorporated more diverse source types, with journal articles comprising 62% (k = 64) and 60% (k = 527) of their citations, respectively. However, these proportional differences mask important absolute numbers: the NASEM report’s 60% still represented 527 peer-reviewed studies, nearly seven times more academic research than the APA reports and over eight times more than the OSG report. The OSG and NASEM reports’ incorporation of diverse sources should be understood as expanding rather than substituting for scientific research. Table 4.2 shows the complete distribution of citation types.

Table 4.2: Distribution of citation types by source (k = 1,063)
Item Type	APA n (%)	NAS n (%)	OSG n (%)	Total n (%)
Journal articles	77 (91.7)	527 (60.2)	64 (62.1)	668 (62.8)
Reports	2 (2.4)	86 (9.8)	16 (15.5)	104 (9.8)
Documents	2 (2.4)	83 (9.5)	14 (13.6)	99 (9.3)
News articles^a	0 (0.0)	78 (8.9)	4 (3.9)	82 (7.7)
Books^b	2 (2.4)	45 (5.1)	3 (2.9)	50 (4.7)
Other academic publications^c	0 (0.0)	40 (4.6)	0 (0.0)	40 (3.8)
Other online media^d	1 (1.2)	17 (1.9)	2 (1.9)	20 (1.9)
Total	84 (100.0)	876 (100.0)	103 (100.0)	1,063 (100.0)
a Includes online newspaper or magazine articles. b Includes books and book sections. c Includes conference papers, preprints, and theses. d Includes blog posts, miscellaneous webpages, and video recordings.

The OSG report showed greater proportional reliance on professional reports (15.5%) and published documents (13.6%), while the NASEM report uniquely incorporated news articles (8.9%) and other academic publications including preprints and conference papers (4.6%). Notably, 20% of the APA reports’ citations referenced professional reports from organisations including itself, the AAP, and similar bodies.

These differences in source type distribution, while statistically significant, primarily reflect organisational approaches to incorporating supplementary materials alongside substantial cores of peer-reviewed literature. The characteristics of the academic literature selected — which formed the majority of citations across all organisations — showed considerably more convergence, as examined in the subsequent analysis.

How did citations overlap across organisations?

Despite organisations’ broadly similar scientific evidence characteristics, analysis of citation overlap revealed very few shared pieces of literature. Of the 668 total cited journal articles, there were 617 unique works, with 24 articles present in multiple sources. Only four citations appeared across all three documents, representing less than 1% of the total unique academic literature.^[65] Pairwise overlaps reveal a further 20 articles found in two of the three reports. Figure 4.2 visualises these shared citation patterns. Online Appendix 4C contains the information of the overlapping articles.

Figure 4.2: Proportional Venn diagram of shared citations across reports

This low overlap raised questions about whether organisations were drawing from different but equally impactful segments of the research landscape, or whether some organisations might be systematically selecting less influential or peripheral studies. Of particular concern was the observation that 20% (k = 17) of the APA report’s academic citations were authored by members of the report’s own advisory panel, suggesting potential bias toward self-citation rather than field-representative selection. The NASEM report also included several studies written by its committee members, but made up considerably less of its overall scientific evidence base (3%, k = 27).

Methodological characteristics

To understand the types of research organisations selected, we systematically categorised all 617 unique peer-reviewed articles. Our analysis focused on the 355 articles whose primary topic was social media and adolescent health, which comprised 53% of the APA reports’ journal articles, 55% of the NASEM report’s, and 77% of the OSG report’s.^[66] The academic literature cited across all organisations showed similar methodological distributions. The most frequently cited study types were meta-analyses or systematic reviews (35.8%, k = 127), cross-sectional studies (31.3%, k = 111), experimental studies (17.5%, k = 62), ethnographic or qualitative studies (6.5%, k = 23), and longitudinal designs (4.8%, k = 23). Notably, only 16.6% (k = 59) of cited articles employed methodologies that could plausibly support causal inferences.^[67] Table 4.3 shows methodological characteristics by report.^[68]

Table 4.3: Methodological characteristics by report
Study characteristics	APA n (%)	NAS n (%)	OSG n (%)	TOTAL n (%)
Methods
Meta-analyses, systematic, or narrative reviews	13 (31.7)	104 (36.2)	18 (36.7)	127 (35.8)
Experimental studies	5 (12.2)	49 (17.1)	12 (24.5)	62 (17.5)
Cross-sectional or cohort studies	14 (34.1)	94 (32.8)	14 (28.6)	111 (31.3)
Longitudinal studies	7 (17.1)	9 (3.1)	1 (2.0)	17 (4.8)
Mixed-methods studies	1 (2.4)	7 (2.4)	1 (2.0)	9 (2.5)
Ethnographic or qualitative studies	1 (2.4)	22 (7.7)	0 (0.0)	23 (6.5)
Inconclusive	0 (0.0)	2 (0.7)	3 (6.1)	6 (1.7)
Causality
Plausible	7 (17.1)	44 (15.3)	15 (30.6)	59 (16.6)
Unable to be determined by method	34 (82.9)	243 (84.7)	34 (69.4)	296 (83.4)
Inconclusive	0 (0.0)	0 (0.0)	0 (0.0)	0 (0.0)
Total Documents	41 (100.0)	287 (100.0)	49 (100.0)	355 (100.0)

Statistical testing revealed no significant organisational differences in methodological distributions after appropriate corrections for multiple comparisons.^[69] Figure 4.3 indicates absolute counts and proportions of methodological characteristics in each report’s cited evidence.

Figure 4.3: Methodological characteristics of cited academic work by report

Thematic characteristics

Thematic analysis revealed broad similarity in research focus across organisations. Most articles (82%, k = 292) studied general social media use or screen time without specifying platforms, potentially inclusive of TV or gaming. When platforms were specified, Facebook was most common (9%, k = 32), followed by Instagram (3%, k = 12). The most common health outcomes studied were generalised or unspecified mental health (39%, k = 137), depression (15%, k = 53), and body dissatisfaction (6%, k = 21). This is consistent with the historical focus on the negative impacts of media use on mental health, rather than the potential positive outcomes. Approximately 63% (k = 223) focused specifically on adolescent populations. Table 4.4 shows thematic characteristics by report.

Table 4.4: Thematic characteristics by report
Study characteristics	APA n (%)	NAS n (%)	OSG n (%)	TOTAL n (%)
Platforms studied
General social media or screen time	36 (87.8)	235 (81.9)	38 (77.6)	292 (82.3)
Facebook	0 (0.0)	26 (9.1)	8 (16.3)	32 (9.0)
Instagram	3 (7.3)	9 (3.1)	1 (2.0)	12 (3.4)
Combination of platforms	1 (2.4)	6 (2.1)	2 (4.1)	6 (1.7)
TikTok	1 (2.4)	3 (1.0)	0 (0.0)	4 (1.1)
Twitter	0 (0.0)	4 (1.4)	0 (0.0)	4 (1.1)
Inconclusive or unspecified	0 (0.0)	3 (1.0)	0 (0.0)	4 (1.1)
Youtube	0 (0.0)	1 (0.3)	0 (0.0)	1 (0.3)
Health outcomes measured
Generalised mental health	15 (36.6)	106 (36.9)	24 (49.0)	137 (38.6)
Depression	2 (4.9)	28 (9.8)	7 (14.3)	53 (14.9)
Body dissatisfaction	2 (4.9)	17 (5.9)	3 (6.1)	21 (5.9)
Wellbeing or life satisfaction	3 (7.3)	17 (5.9)	3 (6.1)	20 (5.6)
Sleep outcomes	2 (4.9)	13 (4.5)	0 (0.0)	15 (4.2)
Eating disorders or body dysmorphia	1 (2.4)	6 (2.1)	3 (6.1)	10 (2.8)
Generalised physical health	1 (2.4)	7 (2.4)	1 (2.0)	8 (2.3)
Suicidal ideation, attempts, suicide	2 (4.9)	6 (2.1)	1 (2.0)	8 (2.3)
Isolation and loneliness	2 (4.9)	3 (1.0)	1 (2.0)	6 (1.7)
Child Sexual Abuse	0 (0.0)	5 (1.7)	1 (2.0)	5 (1.4)
Anxiety	1 (2.4)	3 (1.0)	0 (0.0)	4 (1.1)
Drug or alcohol addiction	1 (2.4)	2 (0.7)	1 (2.0)	4 (1.1)
Neurological development	1 (2.4)	2 (0.7)	1 (2.0)	2 (0.6)
Social connections	0 (0.0)	1 (0.3)	0 (0.0)	1 (0.3)
Inconclusive or unspecified^[70]	8 (19.5)	71 (24.7)	3 (6.1)	61 (17.2)
Focused on adolescent populations
Yes	34 (82.9)	175 (61.0)	32 (65.3)	223 (62.8)
No mention	5 (12.2)	90 (31.4)	12 (24.5)	105 (29.6)
Inconclusive	2 (4.9)	22 (7.7)	5 (10.2)	27 (7.6)
Total documents	41 (100.0)	287 (100.0)	49 (100.0)	355 (100.0)

Statistical tests revealed no significant organisational differences in platforms studied, health outcomes investigated, or focus on adolescent populations.^[71]

Summary

The three reports appeared to select broadly similar types of scientific evidence, but contained little overlap with each other. Citation mapping revealed minimal differences in the methodological and thematic characteristics of cited evidence after robust statistical testing. However, only four publications appeared in all three reports, with a further 19 shared between two reports — comprising less than 1% of all cited literature. This fragmentation might reflect methodological differences, distinct disciplinary foci, or the inherently fragmented nature of a rapidly growing research area.

The patterns identified through this analysis reflect broader methodological and thematic constraints well documented by experts in the social media field, including reliance on correlational methodologies and under-specification of both platforms and outcomes. We found that 75–90% of cited work on social media and mental health did not specify which platforms were studied, while over 50% examined general or unspecified mental health outcomes. Only 17% of cited work employed methodologies that could plausibly support causal inferences. These findings echo calls from field experts for future research to prioritise specificity and avoid treating social media use as homogeneous.

75–90% of cited work on social media and mental health did not specify which platforms were studied, while over 50% examined general or unspecified mental health outcomes.

The “evidence profiles” of each report can be best described as follows: the OSG report citations appear highly focused and selective of influential work, the NASEM report citations demonstrate comprehensive breadth across diverse research areas, while the APA reports’ cited work show lower overall engagement with literature and are highly self-referential. This raises important questions about evidence readiness for strong policy recommendations from this literature, and suggests that divergences emerge instead from how organisations synthesised, contextualised, and communicated similar bodies of research. This constitutes the focus of our subsequent qualitative analysis.