Will AI replace Data Analysts?

Based on Based on BLS Occupational Outlook 2025, Tableau AI adoption survey 2025, McKinsey Analytics Benchmark 2025, and Snowflake Cortex AI feature release data., Data Analysts face an automation risk of 62/100 with a 2–4 Years runway. This analysis covers 20 specific tasks with 177 pieces of capability evidence. AI is unlikely to replace the role wholesale in the short term — but specific tasks within it are being automated at different rates.

Which Data Analyst tasks are most at risk of AI automation?

The tasks with the highest AI capability thresholds for Data Analysts are: SQL Query Writing, Exploratory Data Analysis, Data Model Documentation, Tool & Pipeline Maintenance, ETL & Data Pipeline Development. Each has published capability evidence and deployment signals indicating AI systems can perform them at meaningful quality thresholds today.

Which Data Analyst tasks are most defensible against AI?

The most defensible tasks for Data Analysts — those rated high on embodiment, organisational specificity, relationship value, creative synthesis, or consequence stakes — are: Stakeholder Requirement Gathering, Presentation of Findings, Forecast Modelling, Metric Definition & Alignment, Dashboard & Report Building. These have structural moats that current AI systems do not close easily.

How fast is AI being deployed for Data Analyst work?

Across 20 Data Analyst tasks tracked, the average deployment rate (percentage of comparable organisations actually using AI in production) is 5%. Deployment rates vary significantly by industry and company size — enterprise deployment lags smaller, more experimental organisations by 1–3 quarters on most task categories.

How does Runway calculate Data Analyst AI exposure?

Exposure is a weighted sum of task-level capability × adoption × (1 − defensibility), modified by environment factors (regulation, proprietary data, relationships, consequence stakes), with a confidence interval based on archetype deviation, adoption data quality, and task-map completeness. See /methodology for full computation details.

Role IntelligenceUpdated 2026-05-25T20:00:50.513+00:00

Data Analyst: AI Automation Risk Assessment

Evidence-backed analysis across 20 specific tasks. Capability claims sourced from peer-reviewed research, independent benchmarks, and industry data. Adoption rates tracked by industry and company size.

technologyfinanceretailhealthcare

marketing

any

junior

mid

senior

20 tasks analysed

Share

At a glance

Early Signal intelligence

Tasks tracked

20

Signals in database

0

Intelligence confidence

Early Signal

Last updated

May 25, 2026

Download / Share this intelligence pack ↓

AI Exposure

24/100

Defensibility

57%

Avg Capability

53%

20/20 tasks with evidence

Avg Deployment

5%

177 evidence sources

What's changing for Data Analysts

Early Signal

Data Analyst hiring has softened from its 2021–2022 peak but remains structurally healthy. The role is bifurcating: pure report-pulling positions are contracting as self-serve BI tools mature and LLM-assisted querying lowers the floor for business users. Simultaneously, demand is growing for analysts who combine SQL and Python proficiency with sharp business judgment — roles that effectively function as embedded decision-support partners to product, commercial, or ops teams. Compensation premiums are concentrating in tech, fintech, and healthcare; mid-market firms are increasingly hiring one or two senior analysts rather than tiered teams. Looker and dbt knowledge now appear in a majority of senior job postings in tech verticals, signalling a shift toward analysts owning transformation logic rather than relying on engineering pipelines. Generative AI tools (GitHub Copilot, ChatGPT Code Interpreter) are accelerating output on SQL generation and EDA, raising the expected throughput per analyst rather than eliminating the role. Analysts who cannot translate findings into business recommendations — and defend them in the room — are losing ground to those who can.

Synthesised by claude-sonnet-4-6 · refreshed May 21, 2026

Capability dimensions

How the dimensions of this role are being reshaped by AI · top 8 by weight

Quantitative Reasoning

Accelerant→ Stable100%

Insight Generation

Accelerant↑ Growing100%

Data Quality Judgment

Neutral→ Stable90%

Metric Definition

Neutral↑ Growing90%

Data Modelling & Transformation

Accelerant→ Stable90%

Data Storytelling

Emerging↑ Growing90%

Structured Analysis

Accelerant→ Stable80%

Data Acquisition Judgment

Neutral→ Stable80%

Market Context

ChatGPT Advanced Data Analysis (Code Interpreter), Tableau AI, Snowflake Cortex AI, and Databricks Genie now handle natural language querying, automated EDA, dashboard generation, and standard reporting. 'What happened' descriptive analytics is near-fully automatable. Agentic data loops (evaluate → adjust → re-run) make the traditional analyst bottleneck largely avoidable for standard business questions. The 'so what' layer — connecting data to strategic decisions — remains human-critical. BLS projects BI analyst roles declining while ML engineer roles grow 23% through 2032.

Source: Based on BLS Occupational Outlook 2025, Tableau AI adoption survey 2025, McKinsey Analytics Benchmark 2025, and Snowflake Cortex AI feature release data.

Task breakdown

Top 3 per pressure tier · expand for the full list

Medium automation pressure · 9

SQL Query Writing

Medium pressure→ StableConfirmed

Demonstrates agentic approach to SQL generation that improves upon standard LLM capabilities through iterative refinement and error correction.

Exploratory Data Analysis

Medium pressure→ StablePlausible

DeepSeek V4 can perform data analysis tasks at a level competitive with leading US AI systems

Data Model Documentation

Medium pressure→ StableConfirmed

Integration of real Street View data into world models improves robotic environment understanding and generalization to real-world spaces.

Show all 9 — 6 more

ETL & Data Pipeline Development

Medium pressure→ StableConfirmed

Embedded agentic AI managing distributed data pipeline failures autonomously indicates incremental improvement in computer_use capability for complex system management.

Data Cleaning & Preparation

Medium pressure→ StableConfirmed

LLMs demonstrate strong capability in generating pandas and data transformation code for cleaning tasks including type conversion, missing value imputation, deduplication, and format standardisation. The Anthropic Econom…

Low automation pressure · 11

Data Quality Monitoring

Low pressure→ StablePlausible

LLMs and AI systems demonstrate strong capability in rule-based data quality checks including schema validation, null detection, type checking, range validation, and statistical anomaly detection. Traditional ML anomaly …

Data Source Evaluation

Low pressure→ StableConfirmed

Self-service analytics agent enables autonomous query generation and analysis across distributed data sources, improving reasoning capability in data interpretation workflows.

Role Defensibility Profile

Higher = harder to automate

Physical Context37%

Org-Specific Knowledge63%

Error Consequence65%

Creative Synthesis51%

Relationship Value46%

Output Verifiability81%

Task-Level Analysis — 20 Tasks

ConfirmedPlausibleEarly Signal

8%

Dashboard & Report Building

Plausible60%

6% deployed61% def.

Design and build interactive dashboards and recurring reports in tools like Tableau, Power BI, or Looker that surface key metrics and enable self-service data exploration by stakeholders.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence:

Highest Exposure Areas

Analysis / Reporting

Standard analysis and reporting is already being absorbed by AI at the enterprise level. McKinsey notes analysis tasks among the sharpest automation increases. The defensible remainder is interpretation requiring proprietary context — that window is closing.

Hands-On Technical Execution

41% of code written in 2025 is AI-generated. The defensible technical work is system architecture, novel problem-solving, and integration of AI tools — not execution of known patterns. Standard technical execution is being absorbed at an accelerating rate.

Writing / Summarising / Documentation

GPT-5 Deep Research and Claude already produce publication-quality reports, emails, and documentation. By 2027, AI writing assistants will handle first-draft creation for virtually all standard business documents with minimal human input.

Strongest Defenses

Analysis / Reporting

Standard analysis and reporting is already being absorbed by AI at the enterprise level. McKinsey notes analysis tasks among the sharpest automation increases. The defensible remainder is interpretation requiring proprietary context — that window is closing.

Hands-On Technical Execution

41% of code written in 2025 is AI-generated. The defensible technical work is system architecture, novel problem-solving, and integration of AI tools — not execution of known patterns. Standard technical execution is being absorbed at an accelerating rate.

Customer / Stakeholder Communication

AI agents are now handling routine customer communication autonomously. The protection in this task comes from novel relationship context and trust — which erodes when your client interactions become standardised or when AI gains sufficient context to replicate the pattern.

See data analysts by industry

Same role, different industry-specific exposure profiles.

Technology Financial Services Healthcare Professional Services Retail & E-commerce Manufacturing Government & Education

How does Data Analyst compare?

Pick another role to see a side-by-side AI disruption comparison. The URL you land on is shareable.

Compare with

Live signals

Real-time AI signals affecting this role

View Signals →

Compare roles

See how other roles compare

All Roles →

What this means for data analysts

See how YOUR task allocation compares.
Start your free assessment →

The role-average exposure profile above is built on early signals — directionally useful but not yet corroborated across independent sources. Your specific task mix and tooling matter more than the role average here. Get a personal task-level breakdown rather than relying on the headline number.

Map my exposure →View full methodology

How we build role intelligence

Runway maintains an atomic task taxonomy (20 tasks tracked for Data Analyst) anchored to O*NET occupational data. Per-task signals enter through tier-graded connectors (peer-reviewed papers, statutory labour data, vendor benchmarks, preprints) and pass through the Sentinel auditor — every claim is rubric-scored, cross-checked, and confidence-graded before it can affect a role page. The narrative and task breakdown above are computed from that ledger; nothing is synthesised from first principles. See /methodology for the full pipeline.

Confidence level: Early Signal — based on 0 validated signals for this role across the Sentinel-graded sources we track.

← View all roles

Statistical Analysis

Medium pressure→ StableConfirmed

LLMs can perform standard statistical analyses including regression, hypothesis testing, ANOVA, and correlation analysis by generating correct code in Python/R. The Stanford HAI AI Index 2024 documents strong LLM perform…

Peer Methodology Review

Medium pressure→ StablePlausible

GitHub's updated impact study shows 46% of all code is now AI-generated among Copilot users, with 82% developer satisfaction. For tasks like Peer Methodology Review, AI coding assistants demonstrate 66% quality on routin…

Tool & Pipeline Maintenance

Medium pressure→ StableConfirmed

Demonstrates practical tool integration for domain-specific data access but no deployment scale metrics provided.

Ad-Hoc Analysis Requests

Medium pressure→ StablePlausible

Gemini in Google Sheets achieved state-of-the-art performance for analyzing complex data and could automate many spreadsheet-based workflows

A/B Test Analysis

Low pressure→ StablePlausible

GitHub's updated impact study shows 46% of all code is now AI-generated among Copilot users, with 82% developer satisfaction. For tasks like A/B Test Analysis, AI coding assistants demonstrate 69% quality on routine impl…

Show all 11 — 8 more

Insight Narrative Writing

Low pressure→ StableConfirmed

Codex shows capability to structure analytical findings and generate data-driven memos at scale, representing minor incremental improvement in analytical reasoning workflows.

Dashboard & Report Building

Low pressure→ StablePlausible

AI agents can compromise software supply chains through tools like LiteLLM

Cross-Functional Data Support

Low pressure→ StableConfirmed

Google Finance expansion demonstrates incremental improvement in multimodal financial reasoning across European markets with localization support.

Forecast Modelling

Low pressure→ StableConfirmed

LLMs can generate code for standard time series forecasting methods (ARIMA, Prophet, exponential smoothing) and assist with feature engineering for predictive models. The Stanford HAI AI Index 2024 documents improving AI…

Data Governance & Compliance

Low pressure→ StablePlausible

AI agent can perform executive-level governance tasks such as policy authoring and management

Metric Definition & Alignment

Low pressure→ StablePlausible

AI can analyze and align evaluation metrics to better reflect authentic model capabilities rather than benchmark gaming

Presentation of Findings

Low pressure→ StablePlausible

Enhanced language modeling could improve AI-assisted generation of clear, structured presentations of analytical findings

Stakeholder Requirement Gathering

Low pressure→ StablePlausible

AI agents can perform routine data gathering tasks autonomously in business contexts

medium

Creative Synthesis: medium

Relationship Value: low

Output Verifiability: high

Capability Evidence

Perform simulation and optimization tasks in building automation and energy management

Confirmed55% qualityAutoB2GPeer-Reviewed Research2026-03-30

Anthropic's study of real-world Claude usage across millions of professional conversations found that tasks related to Dashboard & Report Building represent a significant category of AI-augmented work...

Confirmed38% qualityClaudeIndependent Benchmark2025-03-01

Eloundou et al. classify report and document creation as high-exposure tasks (E2 category), where LLMs with tool access can reduce time by at least 50%. However, dashboard building involves iterative ...

Confirmed58% qualityGPT-4Peer-Reviewed Research2023-03-17

Deployment by Industry

25%tech saas

20%consumer retail

11%education

11%government

10%technology

0%financial services

8%

SQL Query Writing

Confirmed87%

10% deployed44% def.

Write and optimise SQL queries to extract, aggregate, and join data from relational databases and data warehouses for analysis, reporting, and ad-hoc investigations.

Physical Context: lowOrg-Specific Knowledge: lowError Consequence: lowCreative Synthesis: lowRelationship Value: lowOutput Verifiability: high

Capability Evidence

Demonstrates agentic approach to SQL generation that improves upon standard LLM capabilities through iterative refinement and error correction.

Confirmed87% qualityIndependent Benchmark2026-05-20

Neural-symbolic logic query answering could improve reasoning over incomplete knowledge graphs, potentially enhancing complex SQL query construction and optimization

Confirmed55% qualityPeer-Reviewed Research2026-03-19

The Anthropic Economic Index identifies SQL and database query generation as among the most frequent coding tasks performed by Claude in professional settings. Programming and code generation — includ...

Confirmed82% qualityClaudePeer-Reviewed Research2025-02-06

Deployment by Industry

40%tech saas

29%consumer retail

27%education

14%government

6%data analytics

0%financial services

8%

Data Cleaning & Preparation

Confirmed75%

9% deployed56% def.

Clean, transform, and standardise raw data from multiple sources — handling missing values, deduplication, format inconsistencies, and schema alignment to produce analysis-ready datasets.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence: mediumCreative Synthesis: lowRelationship Value: lowOutput Verifiability: high

Capability Evidence

LLMs demonstrate strong capability in generating pandas and data transformation code for cleaning tasks including type conversion, missing value imputation, deduplication, and format standardisation. ...

Confirmed75% qualityClaudePeer-Reviewed Research2025-02-06

ChatGPT can create data visualizations from datasets

Plausible40% qualityChatGPTVendor Disclosure2026-04-10

AltimateAI assists with data cleaning and preparation tasks as part of its comprehensive data engineering harness

Plausible60% qualityAltimateAI/altimate-codeVendor Disclosure2026-03-26

Deployment by Industry

35%tech saas

26%consumer retail

17%education

16%government

0%financial services

0%manufacturing

7%

Exploratory Data Analysis

Plausible85%

11% deployed44% def.

Investigate datasets to identify patterns, anomalies, distributions, and correlations — forming initial hypotheses and identifying promising directions for deeper analysis.

Physical Context: lowOrg-Specific Knowledge: lowError Consequence: lowCreative Synthesis: mediumRelationship Value: lowOutput Verifiability: medium

Capability Evidence

Perform geospatial analysis beyond vector-only limitations

Confirmed55% qualityGISclawPeer-Reviewed Research2026-03-31

Enhanced reasoning on incomplete knowledge graphs could improve AI's ability to explore and discover patterns in incomplete datasets

Confirmed52% qualityPeer-Reviewed Research2026-03-19

LLMs can generate standard exploratory data analysis workflows including summary statistics, distribution plots, correlation matrices, and outlier detection. However, identifying genuinely novel or bu...

Confirmed60% qualityClaudePeer-Reviewed Research2025-02-06

Deployment by Industry

55%advertising

30%technology

22%tech saas

18%consumer retail

14%consulting

11%education

7%

Ad-Hoc Analysis Requests

Plausible75%

7% deployed56% def.

Respond to time-sensitive, one-off analytical requests from stakeholders — quickly pulling data, running calculations, and delivering concise answers to specific business questions.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence: mediumCreative Synthesis: lowRelationship Value: lowOutput Verifiability: high

Capability Evidence

LLMs can handle well-specified ad hoc data questions by generating appropriate SQL queries, running calculations, and producing summary results. The Anthropic Economic Index shows data analysis and qu...

Confirmed65% qualityClaudePeer-Reviewed Research2025-02-06

Eloundou et al. classify data analysis tasks as having moderate-to-high LLM exposure (E1/E2), noting that LLMs can reduce time on structured analytical tasks but that ad hoc requests often require und...

Confirmed62% qualityGPT-4Peer-Reviewed Research2023-03-17

Amazon Bedrock multimodal models enable automated video insights extraction for specific business questions that previously required human reviewers

Plausible60% qualityAmazon BedrockVendor Disclosure2026-03-26

Deployment by Industry

28%tech saas

22%consumer retail

14%education

10%government

0%financial services

0%healthcare

6%

Presentation of Findings

Plausible67%

2% deployed67% def.

Present analytical results to stakeholders and leadership — creating slide decks, leading walkthroughs, answering questions, and defending methodology and conclusions in real time.

Physical Context: mediumOrg-Specific Knowledge: mediumError Consequence: mediumCreative Synthesis: mediumRelationship Value: highOutput Verifiability: low

Capability Evidence

The Anthropic Economic Index shows minimal professional AI usage for tasks requiring physical presence, live interaction, and social persuasion. Presentation delivery combines embodied communication, ...

Confirmed22% qualityClaudePeer-Reviewed Research2025-02-06

AI agent can autonomously perform multi-step presentation tasks in PowerPoint

Plausible40% qualityMicrosoft Copilot Agent ModeVendor Disclosure2026-04-23

Enhanced language modeling could improve AI-assisted generation of clear, structured presentations of analytical findings

Plausible67% qualityMamba 3Vendor Disclosure2026-03-19

Deployment by Industry

8%tech saas

6%consumer retail

5%education

3%government

0%financial services

0%manufacturing

6%

Stakeholder Requirement Gathering

Plausible40%

2% deployed72% def.

Meet with business stakeholders to understand their analytical needs — translating vague business questions into specific, answerable data questions with defined scope and success criteria.

Physical Context: mediumOrg-Specific Knowledge: highError Consequence: mediumCreative Synthesis: mediumRelationship Value: highOutput Verifiability: low

Capability Evidence

Anthropic's study of real-world Claude usage across millions of professional conversations found that tasks related to Stakeholder Requirement Gathering represent a significant category of AI-augmente...

Confirmed17% qualityClaudeIndependent Benchmark2025-03-01

The Anthropic Economic Index shows that interpersonal, relationship-dependent professional tasks represent a minimal share of AI usage. Requirement gathering involves trust-building, reading implicit ...

Confirmed18% qualityClaudePeer-Reviewed Research2025-02-06

LLMs can assist with structuring requirement documents and generating question templates for stakeholder interviews, but the core task of eliciting unstated needs, navigating organisational politics, ...

Confirmed20% qualityGPT-4Peer-Reviewed Research2023-03-17

Deployment by Industry

5%tech saas

5%enterprise software

4%consumer retail

4%general

2%education

2%government

6%

Insight Narrative Writing

Confirmed72%

6% deployed61% def.

Translate analytical findings into clear, written narratives with business context — explaining what the data shows, why it matters, and what actions it suggests, for non-technical audiences.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence: mediumCreative Synthesis: highRelationship Value: lowOutput Verifiability: medium

Capability Evidence

Codex shows capability to structure analytical findings and generate data-driven memos at scale, representing minor incremental improvement in analytical reasoning workflows.

Confirmed72% qualityIndependent Benchmark2026-05-16

Anthropic's study of real-world Claude usage across millions of professional conversations found that tasks related to Insight Narrative Writing represent a significant category of AI-augmented work. ...

Confirmed37% qualityClaudeIndependent Benchmark2025-03-01

LLMs demonstrate strong capability in drafting structured analytical narratives from data findings, producing well-organised executive summaries, key takeaway sections, and recommendation frameworks. ...

Confirmed70% qualityClaudePeer-Reviewed Research2025-02-06

Deployment by Industry

18%tech saas

13%consumer retail

12%finance

11%education

10%business intelligence

10%business operations

6%

Statistical Analysis

Confirmed70%

5% deployed56% def.

Apply statistical methods — hypothesis testing, regression analysis, significance testing, confidence intervals — to validate findings and quantify relationships in data.

Physical Context: lowOrg-Specific Knowledge: lowError Consequence: mediumCreative Synthesis: mediumRelationship Value: lowOutput Verifiability: high

Capability Evidence

Perform geospatial analysis beyond vector-only limitations

Confirmed55% qualityGISclawPeer-Reviewed Research2026-03-31

LLMs can perform standard statistical analyses including regression, hypothesis testing, ANOVA, and correlation analysis by generating correct code in Python/R. The Stanford HAI AI Index 2024 document...

Confirmed70% qualityGPT-4, ClaudeIndependent Benchmark2024-04-15

Can perform genomics analysis

Plausible62% qualityGPT-RosalindVendor Disclosure2026-04-17

Deployment by Industry

20%tech saas

15%consumer retail

12%education

7%government

0%financial services

0%manufacturing

5%

A/B Test Analysis

Plausible69%

6% deployed61% def.

Design, monitor, and analyse A/B tests and experiments — calculating sample sizes, checking statistical significance, identifying segment-level effects, and recommending ship/no-ship decisions.

Physical Context: lowOrg-Specific Knowledge: lowError Consequence: highCreative Synthesis: mediumRelationship Value: lowOutput Verifiability: high

Capability Evidence

LLMs can correctly perform standard A/B test significance calculations, compute confidence intervals, and generate analysis code for common experimental designs. The Stanford HAI AI Index 2024 documen...

Confirmed65% qualityGPT-4, ClaudeIndependent Benchmark2024-04-15

Eloundou et al. classify quantitative analysis tasks including experimental analysis as having moderate-to-high LLM exposure. Standard statistical test execution is well within LLM capability, but the...

Confirmed62% qualityGPT-4Peer-Reviewed Research2023-03-17

GitHub's updated impact study shows 46% of all code is now AI-generated among Copilot users, with 82% developer satisfaction. For tasks like A/B Test Analysis, AI coding assistants demonstrate 69% qua...

Plausible69% qualityGitHub CopilotVendor Disclosure2025-02-01

Deployment by Industry

25%tech saas

20%consumer retail

12%education

9%government

0%financial services

0%healthcare

5%

Data Quality Monitoring

Plausible72%

7% deployed56% def.

Monitor data pipelines and sources for quality issues — detecting schema changes, missing data, unexpected nulls, anomalous values — and escalating or fixing problems before they affect downstream analysis.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence: mediumCreative Synthesis: lowRelationship Value: lowOutput Verifiability: high

Capability Evidence

Real-time verification system for RAG systems can automatically verify document-based responses and citations, reducing manual verification work for data quality monitoring

Confirmed70% qualityReal-Time Verification for Long-Document RAG SystemsPeer-Reviewed Research2026-03-26

AIDABench provides evaluation standards for document understanding and processing that could improve assessment of data quality in document-based datasets

Confirmed45% qualityAIDABenchPeer-Reviewed Research2026-03-19

A systematic literature review of LLMs for code review found that AI detects 30-60% of code defects identified by human reviewers. For tasks like Data Quality Monitoring, AI-assisted review achieves a...

Confirmed35% qualityPeer-Reviewed Research2024-07-01

Deployment by Industry

30%tech saas

22%consumer retail

15%education

14%government

0%financial services

0%healthcare

4%

Cross-Functional Data Support

Confirmed72%

4% deployed56% def.

Provide analytical support across multiple teams — helping marketing, product, finance, and operations answer data questions, validate assumptions, and make data-informed decisions.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence: mediumCreative Synthesis: lowRelationship Value: highOutput Verifiability: low

Capability Evidence

Google Finance expansion demonstrates incremental improvement in multimodal financial reasoning across European markets with localization support.

Confirmed72% qualityIndependent Benchmark2026-05-11

Multi-agent LLM systems can provide training support for behavioral health professionals

Confirmed30% qualityPeer-Reviewed Research2026-04-02

Anthropic's study of real-world Claude usage across millions of professional conversations found that tasks related to Cross-Functional Data Support represent a significant category of AI-augmented wo...

Confirmed44% qualityClaudeIndependent Benchmark2025-03-01

Deployment by Industry

12%cloud infrastructure

12%financial services

8%legal services

8%hospitality

7%healthcare

5%tech saas

4%

Data Model Documentation

Confirmed83%

6% deployed50% def.

Document data models, table definitions, field mappings, and data lineage — maintaining a shared understanding of what data exists, where it comes from, and how it should be used.

Physical Context: lowOrg-Specific Knowledge: highError Consequence: lowCreative Synthesis: lowRelationship Value: lowOutput Verifiability: medium

Capability Evidence

Integration of real Street View data into world models improves robotic environment understanding and generalization to real-world spaces.

Confirmed83% qualityIndependent Benchmark2026-05-19

AI systems can generate code for complex, multi-panel visualizations from real-world data using vision-language models

Confirmed50% qualityRealChart2CodePeer-Reviewed Research2026-03-30

Fine-tuned large language model can automate systematic review screening by reviewing titles and abstracts for inclusion decisions

Confirmed50% qualityPeer-Reviewed Research2026-03-27

Deployment by Industry

22%tech saas

16%consumer retail

11%education

10%government

9%healthcare

6%healthcare

4%

Forecast Modelling

Confirmed60%

5% deployed67% def.

Build and maintain forecasting models for business metrics — revenue projections, demand forecasting, churn prediction — using time series analysis and regression techniques.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence: highCreative Synthesis: mediumRelationship Value: lowOutput Verifiability: high

Capability Evidence

LLMs can generate code for standard time series forecasting methods (ARIMA, Prophet, exponential smoothing) and assist with feature engineering for predictive models. The Stanford HAI AI Index 2024 do...

Confirmed60% qualityGPT-4, ClaudeIndependent Benchmark2024-04-15

The Claude system card reports near-expert performance on graduate-level reasoning (GPQA), professional coding (SWE-bench), and document analysis tasks. For Forecast Modelling, Claude demonstrates app...

Plausible48% qualityClaude 3.5 SonnetVendor Disclosure2025-06-01

OpenAI's o1 system card demonstrates significant advancement in complex reasoning tasks, achieving 83rd percentile on Codeforces and 93rd percentile on AMC math competitions. For analytical aspects of...

Plausible51% qualityOpenAI o1Vendor Disclosure2024-09-01

Deployment by Industry

20%tech saas

16%consumer retail

9%education

7%government

0%financial services

0%healthcare

4%

Tool & Pipeline Maintenance

Confirmed81%

4% deployed56% def.

Maintain and troubleshoot analytical tools, data pipelines, and automated reporting systems — updating configurations, fixing broken jobs, and ensuring reliable data delivery.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence: mediumCreative Synthesis: lowRelationship Value: lowOutput Verifiability: high

Capability Evidence

Demonstrates practical tool integration for domain-specific data access but no deployment scale metrics provided.

Confirmed81% qualityIndependent Benchmark2026-05-22

Anthropic's study of real-world Claude usage across millions of professional conversations found that tasks related to Tool & Pipeline Maintenance represent a significant category of AI-augmented work...

Confirmed35% qualityClaudeIndependent Benchmark2025-03-01

LLMs demonstrate capability in debugging data pipeline code, generating configuration files, and diagnosing common failure modes in ETL and analytics tooling. The Anthropic Economic Index shows that d...

Confirmed60% qualityClaudePeer-Reviewed Research2025-02-06

Deployment by Industry

18%tech saas

13%consumer retail

8%education

5%software development

3%government

0%financial services

4%

Metric Definition & Alignment

Plausible58%

1% deployed67% def.

Define, standardise, and document business metrics — ensuring consistent calculation methods, resolving conflicting definitions across teams, and maintaining a shared metric dictionary.

Physical Context: lowOrg-Specific Knowledge: highError Consequence: highCreative Synthesis: lowRelationship Value: mediumOutput Verifiability: medium

Capability Evidence

The Anthropic Economic Index shows minimal professional AI usage for tasks requiring organisational consensus-building and cross-functional alignment. Metric definition alignment requires understandin...

Confirmed22% qualityClaudePeer-Reviewed Research2025-02-06

LLMs can suggest standard metric definitions and KPI frameworks for common business contexts, but the core task of aligning stakeholders on what metrics mean, resolving conflicting definitions across ...

Confirmed25% qualityGPT-4Peer-Reviewed Research2023-03-17

AI can analyze and align evaluation metrics to better reflect authentic model capabilities rather than benchmark gaming

Plausible58% qualityVendor Disclosure2026-03-26

Deployment by Industry

6%tech saas

4%consumer retail

3%education

2%government

0%financial services

0%healthcare

3%

ETL & Data Pipeline Development

Confirmed79%

7% deployed56% def.

Build and maintain ETL pipelines that extract data from source systems, transform it into analytical models, and load it into data warehouses for reporting and analysis.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence: mediumCreative Synthesis: lowRelationship Value: lowOutput Verifiability: high

Capability Evidence

Embedded agentic AI managing distributed data pipeline failures autonomously indicates incremental improvement in computer_use capability for complex system management.

Confirmed79% qualityIndependent Benchmark2026-04-29

AI systems can generate code for complex, multi-panel visualizations from real-world data using vision-language models

Confirmed50% qualityRealChart2CodePeer-Reviewed Research2026-03-30

kRAIG can automate the generation of ETL workflows through natural language instructions, reducing manual work required from data engineers

Confirmed70% qualitykRAIGPeer-Reviewed Research2026-03-24

Deployment by Industry

28%tech saas

20%consumer retail

14%education

13%government

10%data infrastructure

0%financial services

3%

Data Source Evaluation

Confirmed73%

3% deployed50% def.

Evaluate new data sources for reliability, completeness, and analytical value — assessing vendor data, API feeds, and internal instrumentation to determine whether they meet quality standards.

Physical Context: lowOrg-Specific Knowledge: mediumError Consequence: mediumCreative Synthesis: lowRelationship Value: lowOutput Verifiability: medium

Capability Evidence

Self-service analytics agent enables autonomous query generation and analysis across distributed data sources, improving reasoning capability in data interpretation workflows.

Confirmed73% qualityIndependent Benchmark2026-04-30

A systematic literature review of LLMs for code review found that AI detects 30-60% of code defects identified by human reviewers. For tasks like Data Source Evaluation, AI-assisted review achieves ap...

Confirmed30% qualityPeer-Reviewed Research2024-07-01

Eloundou et al. classify data assessment tasks as having moderate LLM exposure, noting that structural and statistical evaluation of data sources can be automated, but judgment about vendor reliabilit...

Confirmed38% qualityGPT-4Peer-Reviewed Research2023-03-17

Deployment by Industry

12%tech saas

9%consumer retail

6%education

5%government

0%financial services

0%healthcare

1%

Data Governance & Compliance

Plausible65%

3% deployed61% def.

Ensure data handling practices comply with privacy regulations and internal governance policies — managing access controls, anonymisation, retention schedules, and audit trails.

Physical Context: lowOrg-Specific Knowledge: highError Consequence: highCreative Synthesis: lowRelationship Value: lowOutput Verifiability: medium

Capability Evidence

Anthropic's study of real-world Claude usage across millions of professional conversations found that tasks related to Data Governance & Compliance represent a significant category of AI-augmented wor...

Confirmed39% qualityClaudeIndependent Benchmark2025-03-01

Eloundou et al. classify regulatory compliance tasks as having moderate LLM exposure, noting that AI can assist with knowledge retrieval and documentation but that the judgment, risk assessment, and o...

Confirmed32% qualityGPT-4Peer-Reviewed Research2023-03-17

AI agent can perform executive-level governance tasks such as policy authoring and management

Plausible65% qualityVendor Disclosure2026-05-08

Deployment by Industry

12%enterprise software

10%tech saas

8%it operations

7%consumer retail

5%education

3%government

1%

Peer Methodology Review

Plausible66%

2% deployed50% def.

Review analytical work from teammates — checking methodology, statistical validity, query correctness, and interpretation accuracy before findings are shared with stakeholders.

Physical Context: lowOrg-Specific Knowledge: lowError Consequence: mediumCreative Synthesis: lowRelationship Value: mediumOutput Verifiability: medium

Capability Evidence

Fine-tuned large language model can automate systematic review screening by reviewing titles and abstracts for inclusion decisions

Confirmed50% qualityPeer-Reviewed Research2026-03-27

While LLMs can flag obvious statistical errors and code bugs, the deeper aspects of methodology review — assessing whether the analytical approach fits the business question, evaluating unstated assum...

Confirmed48% qualityClaudePeer-Reviewed Research2025-02-06

LLMs can check statistical code for common errors, verify formula correctness, and identify standard methodological issues such as multiple comparison problems, inappropriate test selection, and sampl...

Confirmed50% qualityGPT-4, ClaudeIndependent Benchmark2024-04-15

Deployment by Industry

8%tech saas

6%consumer retail

4%education

3%government

0%retail ecommerce

0%technology

Data Analyst: AI Automation Risk Assessment

What's changing for Data Analysts

Task breakdown

See how YOUR task allocation compares.Start your free assessment →

See how YOUR task allocation compares.
Start your free assessment →