Software

AI + ML

After Copilot trial, government staff rated Microsoft's AI less useful than expected

Not all bad news for Redmond as Australian agency also found strong ROI and some unexpected upsides


Australia’s Department of the Treasury has found that Microsoft’s Copilot can easily deliver return on investment, but staff exposed to the AI assistant came away from the experience less confident it will help them at work.

The Department conducted a 14-week trial of Microsoft 365 Copilot during 2024 and asked for volunteers to participate. 218 put up their hands and then submitted to surveys about their experiences using Microsoft’s AI helpers. Those surveys are the basis of an evaluation report published on Tuesday.

The report reveals that after the trial participants rated Copilot less useful than they hoped it would be, as it was applicable to fewer workloads than they hoped would be the case.

Expected and actual proportion of workload participants felt Copilot could/did support: - Click to enlarge

Workers’ views on Copilot’s ability to improve their work also fell.

Participant ratings of Copilot’s impact on work quality - Click to enlarge

Usage of Copilot was lower than expected, with most participants using it two or three times a week, or less. Treasury thinks it probably set unrealistically high expectations before the trial, and noted that participants often suggested extra training would be valuable.

The trial proposed four use cases for Copilot - generating structured content, supporting knowledge management, synthesising and prioritising information, and undertaking process tasks - and participants agreed they were appropriate. But the report also found they also emerged with the belief that “Copilot was not appropriate for more complex tasks, mostly due to the limitations of the product itself.”

The tasks participants felt Copilot handled best were “finding and summarising information, generating meeting minutes, knowledge management and drafting content”. The report describes those as “basic administrative tasks”.

But saving even a little time on such tasks can pay off: the report finds that if Copilot saves 13 minutes a week for mid-level workers, it will pay for itself.

Other findings Microsoft will likely appreciate include the unanticipated benefit that Copilot displayed helped “to contribute to accessibility and inclusion for neurodivergent and part-time staff, or those experiencing medical conditions that require time off work.”

The AI assistant did so by producing automatic summaries of missed meetings and “levelling the playing field for those who struggle to navigate workplace norms or culture.” Some staff therefore reported “a small increase in work confidence”, with junior or recent hires more likely to express such sentiments.

Treasury’s learnings from the pilot include more careful selection of staff who use Copilot, the need for more consideration of necessary training on how to use AI and the risks of doing so, and the desirability of ongoing monitoring to test AI’s impact in the workplace.

Another finding suggests as-a-service AI might not be appropriate for agencies like Treasury.

“While security of protected government data and advice is of upmost importance, ideally the core functions of a generative AI product should work alongside security requirements,” the report states. “It is not clear whether products are likely to evolve over time to meet Treasury’s strict security needs, or whether Copilot itself will continue to evolve to incorporate external information into its outputs without feeding the algorithm with internal Treasury data.”

That opinion suggests orgs that handle sensitive information will likely do better with on-prem AI infrastructure. ®

Send us news
54 Comments

Microsoft expands Copilot bug bounty targets, adds payouts for even moderate messes

Said bugs 'can have significant implications' – glad to hear that from Redmond

Microsoft's drawback on datacenter investment may signal AI demand concerns

Investment bank claims software giant ditched 'at least' 5 land parcels due to potential 'oversupply'

Microsoft warns Trump: Where the US won't sell AI tech, China will

Rule hamstringing our datacenters is 'gift' to Middle Kingdom, vice chair argues

Under Trump 2.0, Europe's dependence on US clouds back under the spotlight

Technologist Bert Hubert tells The Reg Microsoft Outlook is a huge source of geopolitical risk

Some workers already let AI do the thinking for them, Microsoft researchers find

Dammit, that was our job here at The Reg. Now if you get a task you don't understand, you may assume AI has the answers

Microsoft names alleged credential-snatching 'Azure Abuse Enterprise' operators

Crew helped lowlifes generate X-rated celeb deepfakes using Redmond's OpenAI-powered cloud – claim

Despite Wall Street jitters, AI hopefuls keep spending billions on AI infrastructure

Sunk cost fallacy? No, I just need a little more cash for this AGI thing I’ve been working on

Satya Nadella says AI is yet to find a killer app that matches the combined impact of email and Excel

Microsoft CEO is more interested in neural nets boosting GDP than delivering superhuman intelligence

Microsoft boffins promise entire game worlds made from AI slop

WHAM, bam, no thank you, ma'am?

We meet the protesters who want to ban Artificial General Intelligence before it even exists

STOP AI warns of doomsday scenario, demands governments pull the plug on advanced models

Microsoft 365 price rises are coming – pay up or opt out (if you can find the button)

It's not auto-enrollment. It's just your current plan with extra Copilot for more money. Completely different

How nice that state-of-the-art LLMs reveal their reasoning ... for miscreants to exploit

Blueprints shared for jail-breaking models that expose their chain-of-thought process