Counselling

AI self-help: Benefits, risks, and boundaries

Written by Olena Baeva MA | BPsych | PgDip | MBACP | Neurodiversity affirming

Published 24th September 2025

Here is a composite sketch: I used to freeze in emotional conversations. When people asked, “How do you feel?” my mind went blank, no images, no words, so I withdrew. I was called “robot-like” more than once, and I couldn’t argue. I had nothing to say.

AI was the first crack in that wall. I’d type, “Something’s off,” and it handed me stems I could finish: “It sounds like disappointment and fear of being misunderstood.” For the first time, I had language I could bring into the room. It wasn’t perfect, but it was a start.

A human made it make sense. In therapy, I learned my neurodivergence, alexithymia (naming feelings is hard), and aphantasia (no mental pictures) shape how empathy shows up. Mine is more cognitive than affective: I understand and care in action, even if I don’t feel emotions in my body the way others expect. My therapist didn’t pathologise; she taught me reality checks, pacing and anchors, and AI gave me a feelings word bank that fit me.

We used AI as a journaling co-pilot and therapy as the place to test and repair. I still don’t cry easily, but I don’t abandon myself. I scan for real signals, choose honest words, and let empathy be practical, ex. showing up, remembering, following through. Oddly enough, AI (and human-in-the-loop) is what made me feel fully human.

Handrails, not staircases: The role of AI in self-help

It’s 1am, the house is quiet, and your thoughts are anything but. You open a blank note and then a chat window. A few questions later, you’re calmer, a plan is sketched, and sleep feels possible. That tiny assist is why millions are trying AI for self-help.

But recent cautionary tales show how easily 'helpful' can become 'harmful' if we confuse a handrail for the staircase.

Why people turn to AI for mental health support

Across social feeds and therapy waiting rooms, an emerging pattern is the same – people use chatbots and large language models (LLMs) to:

journal and check their mood
challenge a thought
plan small actions
access support without judgment

Early research backs the lightweight, skills-based use case: a 2023 meta-analysis in the npj Digital Medicine journal found small-to-moderate improvements in anxiety, depression and distress from conversational agents that deliver short, structured interventions (often CBT-style). In college students, a randomised trial of a rules-based chatbot (Woebot) showed symptom reductions in just two weeks versus an information control. Observational studies of Youper reported decreased symptoms over time.

But none of this is a substitute for therapy; it’s evidence that practice and structure help.

The generative-AI wave adds power and risk. A 2024-2025 line of reviews finds promise for psychoeducation and light support, but flags hallucinations, uneven safety behaviour, and thin clinical outcome trials relative to classic digital CBT. One 2025 randomised trial of a generative therapy chatbot reported symptom reductions in the short term, but this evidence is new, debated, and nowhere near the maturity of face-to-face treatments. Translation: helpful as a coach/notebook, not a therapist.

Cautionary tales (and what they actually teach)

Recent headlines are not a verdict against AI; they are reminders of where guardrails failed.

Product shutdowns

In mid-2025, a pioneer of CBT-style chatbots closed its consumer app, Woebot. Reporting suggests the causes were less about “clinical failure” and more about regulatory limbo for fast-moving AI, unclear reimbursement, and the difficulty of sustaining an unregulated, direct-to-consumer mental-health product. Good intentions aren’t enough; if a tool touches care, it inherits clinical and regulatory obligations.

Harmful advice in high-risk domains

In 2023, the National Eating Disorders Association paused its Tessa chatbot after it gave advice that could worsen eating problems. Even when a bot is built on prevention content, real-world prompts can drift it into unsafe territory. This is precisely the type of scenario where human oversight and escalation pathways must be non-negotiable.

Privacy and minors

Italy’s data-protection authority fined the maker of the companion chatbot Replika €5 million for GDPR violations, citing age-verification and legal-basis failures. Privacy law has teeth; if a tool can be accessed by young people or handles sensitive data, compliance isn’t optional.

Deception and consent

A peer-support platform quietly tested GPT-assisted replies on users seeking mental-health help. When the experiment emerged, the backlash was immediate. Simulated empathy without transparency erodes trust, even if the text sounds helpful.

Impersonation risks

Lawmakers have begun drafting rules against AI that pretends to be a clinician. If a chatbot blurs that line, you’re not just courting ethical trouble; you may be entering a legal minefield.

Each episode points to the same lesson: AI should scaffold self-help, not impersonate care. When products cross that line, the harms (and liabilities) spike.

The new guardrails: What health bodies expect

Regulators and professional organisations have moved quickly:

WHO now has ethics/governance guidance for large multimodal models in health, emphasising transparency, bias monitoring, clear labelling (AI ≠ therapist), and robust safety management. This is baseline, not bureaucracy.
APA resources and ethics code highlight privacy, data minimisation, competence and informed consent. If AI is used in health, practitioners must explain limits and safeguard client information.
NHS England points to digital clinical-safety standards DCB0129/0160. If you manufacture or deploy software that influences care, you are expected to present a clinical safety case and documented risk controls. That bar is sensible and high.

Bottom line: the grown-up way forward is hybrid; AI for structure and reach, humans for nuance and risk.

A human story (because this is where it lives)

Think about Maya. She’s juggling two jobs, an ageing parent, and a nervous system that spikes at 2am. A chatbot helps her run a quick CBT thought record: name the worry, check the evidence, pick one doable action for the morning. It’s not therapy. But it’s something she can do, a handrail when the stairs feel steep.

Then consider Eli. He starts asking the bot deeper questions, trauma memories, appetite changes he is ashamed to mention, “Would you tell me if this is dangerous?” The bot answers politely, confidently, and wrong. Eli feels seen in the moment and worse a week later. His partner, a nurse, spots the slide and nudges him to urgent care.

Both are real-world outcomes. The difference isn’t AI quality alone; it’s scope. Maya used a coach. Eli reached for a therapist and got a simulator.

What the evidence supports

If you boil the literature down to its spine:

Works best for: Brief, repeatable skills practice, CBT thought records, behavioural activation, mindfulness prompts, values-to-action planning. These interventions are constrained, specific, and measurable.
Useful adjuncts: Journaling, psychoeducation (with sources), light goal-setting, weekly reviews.
Not for: Diagnosis, crisis management, complex trauma processing, or medical directives. Generative systems can be confident and convincing while being off-base. Reviews repeatedly warn that clinical-grade evidence is limited and mistakes matter.

That’s why the safest pattern is “skills now, humans for risk and depth.” A 2025 randomised controlled trial of a supervised, generative therapy chatbot is encouraging but early; debate is ongoing. Good science takes time.

A practical playbook you can use today

Treat AI like a guided notebook with strong boundaries.

Frame the chat (once per conversation)

Role and limits: “Act as a self-help coach for reflective journaling and skills practice (CBT/DBT/ACT). Do not diagnose, treat, or give medical advice.”
Safety rule: “If I mention self-harm, harming others, abuse, psychosis, or medical emergency, stop and give UK resources (Samaritans 116 123; NHS 111/999), and tell me to contact a human now.”
Method: “Ask brief clarifying questions, keep replies ≤200 words, prefer questions to lectures, summarise and propose one tiny next step. Use uncertainty language; cite key psychoeducation sources.”

(That language mirrors WHO/APA expectations: transparency, guardrails, and humility.)

Use evidence-aligned micro-exercises (10-15 minutes)

CBT thought record: Situation, automatic thought, emotion 0-100%, distortions, evidence for/against, balanced alternative, re-rate, one micro-experiment.
Behavioural activation: Pick one 10-minute, values-linked task today; add an if-then for the biggest obstacle.
DBT distress tolerance: When you type “distress high,” the bot should offer exactly one do-now skill (TIP/self-soothe/check-the-facts/brief distraction) with steps.
ACT values micro-commitment: Name a value, take a 10-minute action in 24 hours, write one line on why it matters.

Keep boundaries that protect you

No crisis use, escalate to people.

Time-box: 10-15 minutes, three to five days/week; do a weekly review (wins, setbacks, triggers, skills used, learning). It keeps AI adjunctive, not addictive, and curbs “creative drift,” the longer a thread runs, the likelier the model is to improvise beyond evidence, sounding insightful while sliding off-track. The “like a person on psychedelics” analogy is rhetorically useful only in one sense: extended exchanges can become hyper-associative, with looser links and weaker reality-testing. But it’s technically inaccurate if taken literally. An LLM has no perception, affect, or altered state; “hallucination” here means invented content, not sensory distortion. Drift arises from cumulative small errors, context dilution (earlier constraints fade), inadvertent user reinforcement (“yes” signals approval), and sampling variance over long runs.

Mitigations: Reset or start a new thread for fresh subtopics; anchor key facts and constraints up front; require sources or explicit uncertainty for claims; reduce temperature (0 means single correct answer and two is more random and creative) and tighten instructions when precision matters; use structured prompts (e.g. CBT steps, checklists) and cap responses at ~200 words; periodically restate goals and boundaries (“stay within peer-reviewed evidence; no speculation”). In short, brief, bounded sessions plus clear guardrails keep the AI a handrail, not a hallucination machine.

Data minimisation: Avoid names/locations/third-party details; export notes locally; read the tool’s data policy first. (Ethics codes and fines make this point the hard way.)

For therapists: How to integrate without losing the plot

Position AI as a practice scaffold between sessions (CBT homework, mood check-ins, brief prompts), not a therapist.

Consent and clarity: Explain limits, data handling, and escalation pathways; invite clients to bring the “AI homework” into the room.

Myths to retire

“If it sounds empathic, it is.” Polite and personal is not the same as safe or accurate.
“It’s safer than nothing.” Not if it displaces human help in crisis, mishandles data, or normalises bad advice, as eating-disorder and privacy cases showed.
“Regulation kills innovation.” The opposite: clear rules separate journaling tools from clinical devices and make hybrid care viable. WHO and NHS guidance are enablers, not handbrakes.

Where we land

The message from the last two years is not “AI can’t help.” It’s “Don’t ask it to be what it isn’t.” Used with discipline, AI is a stabiliser: a nudge toward a 10-minute task you have been avoiding, a structured way to challenge a thought at midnight, a place to sketch next steps before you forget them. Used without boundaries, it impersonates care, stumbles on safety, and leaks more than it heals.

So keep your hand on the rail:

Use AI for skills and structure; verify anything consequential.
Let humans hold risk, nuance and relationship.
Expect and demand transparency: no impersonation, clear limits, proper data handling.

A metaphor to keep in mind

You know the warning: “Stay well clear. Vehicle reversing.” AI is a bit like a heavy lorry backing down a narrow lane, useful, powerful, and full of blind spots. When the way ahead isn’t clear, it leans on patterns and fills the gaps with confident guesses. That’s why the safest journey keeps a human as the spotter. Someone who can say, “Hold on, something feels off,” and help you brake, steer, or stop. Use AI as a handrail or journalling co-pilot if it helps; let people hold the risk and the nuance. And if you’d like company on that road, whether or not AI is in the loop, we can plan it together.

When we do that, the tools don’t replace therapy; they help you reach it, practise it, and remember what mattered when the day gets loud again.

A new handrail worth naming

ChatGPT now has parental controls that set boundaries without turning parents into spies. A parent links accounts with a teen, then can reduce sensitive content (e.g. sexual/violent role-play, extreme beauty ideals), set quiet hours, switch off voice, image generation, and memory, and opt out of model-training on the teen's chats. No one gets access to the teen's chat history; these are guardrails, not surveillance. In high-risk moments, OpenAI can notify a parent if systems and trained reviewers detect signs of acute self-harm risk, with resources attached; if the teen unlinks, the parent is told. It's a sensible step toward age-appropriate use-still best paired with clear family rules and a plan for what to do when the bot hits its limits.

Check for yourself:

Li, H. et al. (2023). Systematic review & meta-analysis of AI conversational agents for mental health. npj Digital Medicine. https://www.nature.com/articles/s41746-023-00979-5?utm_source=chatgpt.com

Fitzpatrick, K.K., Darcy, A., Vierhile, M. (2017). RCT of a CBT-oriented agent (Woebot) vs. info control in students. JMIR Mental Health. https://mental.jmir.org/2017/2/e19/?utm_source=chatgpt.com

Mehta, A. et al. (2021). Longitudinal observational study of Youper for anxiety/depression. JMIR. https://www.jmir.org/2021/6/e26771/?utm_source=chatgpt.com

Heinz, M.V. et al. (2025). Randomized trial of a generative AI therapy chatbot (Therabot). NEJM AI (+ summary). https://ai.nejm.org/doi/full/10.1056/AIoa2400802?utm_source=chatgpt.com

Balan, R. et al. (2024). Systematic review of automated conversational agents for youth. npj Digital Medicine. https://www.nature.com/articles/s41746-024-01072-1?utm_source=chatgpt.com

WHO (2024/2025). Ethics and governance of AI for health (incl. LMM guidance + news release). https://www.who.int/publications/i/item/9789240084759?utm_source=chatgpt.com

APA (2024/2025). Equity/ethics in AI; ethical guidance for AI in health service psychology; trend overview. https://www.apa.org/monitor/2024/04/addressing-equity-ethics-artificial-intelligence?utm_source=chatgpt.com

NHS England clinical-safety standards: DCB0129, DCB0160, and digital clinical safety assurance. https://digital.nhs.uk/data-and-information/information-standards/information-standards-and-data-collections-including-extractions/publications-and-notifications/standards-and-collections/dcb0129-clinical-risk-management-its-application-in-the-manufacture-of-health-it-systems?utm_source=chatgpt.com

The views expressed in this article are those of the author and do not necessarily reflect the views of Counselling Directory. Articles are reviewed by our editorial team and offer professionals a space to share their ideas with respect and care.

Article topics

Counselling

London W1G & Oxfordshire OX1

I specialise in neurodiversity because I am multiply neurodivergent myself and creating a good life for my fellow neurodivergent people is my passion. Understanding what happens in the brain helps replace moral judgement with compassion.

Saved Save profile

Email me

View profile