Alaska's court system built an AI chatbot. It didn’t go smoothly.

The AI Justice Gap: Why Building Trustworthy AI for Government is Harder Than It Looks

Dealing with life’s complexities is difficult enough. But when those complexities intersect with legal processes, and are then mediated by artificial intelligence, things can quickly become fraught with challenges. Alaska’s ambitious attempt to build an AI-powered probate assistant, dubbed AVA, offers a stark lesson: deploying AI in high-stakes government services demands far more than just technological prowess. It requires a relentless focus on accuracy, reliability, and a healthy dose of skepticism.

The Alaska Experiment: A Year of False Starts

For over a year, the Alaska Court System has been wrestling with AVA, designed to guide residents through the often-intimidating probate process. The goal was simple: increase access to justice by providing a readily available, AI-driven resource. However, as consultant Aubrie Souza of the National Center for State Courts (NCSC) explained, the project quickly ballooned from a projected three months to over a year, largely due to the painstaking effort required to ensure accuracy. “With a project like this, we need to be 100% accurate, and that’s really difficult with this technology,” stated Stacey Marz, Administrative Director of the Alaska Court System.

AVA’s struggles aren’t unique. Across the US, government agencies are cautiously exploring AI, from New York City’s chatbot for driver’s license applications to initiatives aimed at streamlining housing benefit processing. Yet, a recent Deloitte report reveals that less than 6% of local governments prioritize AI for service delivery, hinting at a widespread hesitancy.

The Hallucination Problem and the Pursuit of ‘Rule-Following’ AI

A core challenge lies in the inherent fallibility of current AI models. “Hallucinations” – instances where AI confidently presents false information – proved a significant hurdle for AVA. Early versions, according to Souza, even provided incorrect information about legal resources, claiming the existence of a law school in Alaska where none exists. This underscores a critical point: AI isn’t simply a knowledge repository; it’s a pattern-matching engine prone to errors.

Tom Martin, founder of LawDroid, the company developing AVA, emphasizes the importance of building “rule-following” AI for legal applications. Unlike models designed to be creative or engaging, legal AI must prioritize accuracy and adherence to established rules. Martin notes that different AI models exhibit distinct “personalities,” some more prone to straying from factual information than others. Interestingly, the team even had to dial down AVA’s empathy, finding that grieving users simply wanted direct answers, not condolences.

Did you know? The cost of AI usage is decreasing rapidly. Martin estimates that 20 AVA queries cost only around 11 cents, making AI a potentially cost-effective solution for government services – provided accuracy can be guaranteed.

Beyond Accuracy: The Human Oversight Imperative

Even as AI models improve, the need for human oversight remains paramount. The AVA team initially designed a 91-question test to evaluate the chatbot’s accuracy, but found it too time-consuming to thoroughly review. They ultimately refined the test to 16 questions, highlighting the practical challenges of validating AI-generated information in complex domains.

This raises a broader question: how much human review is sufficient? The answer likely varies depending on the application. For low-risk tasks, automated monitoring may suffice. But for critical services like legal guidance, a robust system of human oversight is essential to catch errors and prevent harm.

The AI Investment Disconnect: Hype vs. Reality

The struggles with AVA reflect a growing disconnect between the hype surrounding AI and its actual adoption. Despite booming investment in AI technologies, real-world implementation remains surprisingly limited. This discrepancy is partly due to concerns about reliability and trustworthiness, particularly in high-stakes contexts. The promise of AI-driven efficiency clashes with the reality of ongoing maintenance, validation, and the need for continuous adaptation as AI models evolve.

Pro Tip: When evaluating AI solutions for government services, prioritize transparency and explainability. Understanding *how* an AI arrives at a particular answer is just as important as the answer itself.

Future Trends: Towards More Reliable and Responsible AI

Several trends are emerging that could address the challenges highlighted by the AVA experience:

Retrieval-Augmented Generation (RAG): This technique focuses on grounding AI responses in verified knowledge sources, reducing the risk of hallucinations. Instead of relying solely on its internal knowledge, the AI retrieves relevant information from a trusted database before generating a response.
AI Verification Layers: Companies like Manus are building AI applications with multiple layers of AI-powered verification to ensure accuracy.
Specialized AI Models: Developing AI models specifically tailored to narrow domains, like probate law, can improve accuracy and reduce the risk of errors.
Continuous Monitoring and Feedback Loops: Regularly monitoring AI performance and incorporating user feedback is crucial for identifying and correcting errors.

FAQ: AI and Government Services

Q: Is AI ready to replace human legal professionals?
A: Not yet. AI can assist legal professionals and provide basic guidance, but it cannot replace the nuanced judgment and expertise of a qualified attorney.
Q: What are the biggest risks of using AI in government?
A: Inaccuracy, bias, lack of transparency, and potential for harm to citizens are key concerns.
Q: How can governments ensure AI systems are fair and unbiased?
A: Careful data selection, algorithmic auditing, and ongoing monitoring are essential.

The AVA project serves as a valuable case study, demonstrating that the path to AI-powered government services is not a straight line. It’s a journey marked by challenges, setbacks, and the constant need for vigilance. While the potential benefits of AI are undeniable, realizing those benefits requires a commitment to building trustworthy, reliable, and responsible AI systems.

Reader Question: What other areas of government do you think could benefit from AI, and what safeguards should be in place?

Explore further: Read our article on the ethical considerations of AI in public services and subscribe to our newsletter for the latest updates on AI and government technology.

Worth a look

Alaska’s court system built an AI chatbot. It didn’t go smoothly.

The AI Justice Gap: Why Building Trustworthy AI for Government is Harder Than It Looks

The Alaska Experiment: A Year of False Starts

The Hallucination Problem and the Pursuit of ‘Rule-Following’ AI

Beyond Accuracy: The Human Oversight Imperative

The AI Investment Disconnect: Hype vs. Reality

Future Trends: Towards More Reliable and Responsible AI

FAQ: AI and Government Services

Related

Leave a Comment Cancel reply

The AI Justice Gap: Why Building Trustworthy AI for Government is Harder Than It Looks

The Alaska Experiment: A Year of False Starts

The Hallucination Problem and the Pursuit of ‘Rule-Following’ AI

Beyond Accuracy: The Human Oversight Imperative

The AI Investment Disconnect: Hype vs. Reality

Future Trends: Towards More Reliable and Responsible AI

FAQ: AI and Government Services

Share this:

Related

Leave a Comment Cancel reply

Latest

Popular