Back to Blog

AI Subtitles for Online Courses: Accessibility, Completion Rates, and ADA Compliance

By Picute Team··4 min read
online-courseseducationaccessibilitysubtitleslectures

The Completion-Rate Case for Captions

Across EdTech research:

  • 15-20% completion rate lift for captioned vs uncaptioned courses (controlled for content type)
  • ESL students prefer captioned video 71% of the time per Verbit 2024 survey
  • 65% of non-native speakers find learning significantly easier with captions
  • Search within video becomes possible when transcripts are indexed — a cited feature in course retention

Captions are not just accessibility; they're a quality signal.

The Compliance Case for Captions

US — Public-facing courses

  • ADA Title III — Businesses providing public services, including MOOCs and bootcamps
  • Section 504 & IDEA — Schools receiving federal funding (most universities, K-12)
  • Unruh Civil Rights Act (CA) — Active enforcement has driven numerous EdTech lawsuits

EU — All digital services

  • European Accessibility Act (June 2025) — Includes e-learning platforms
  • Fines up to 4% of global revenue for covered entities

Technical standard

  • WCAG 2.1 AA — De facto baseline
  • Success Criterion 1.2.2 — Captions required for prerecorded media
  • Auto-captions alone are not compliant (DOJ guidance)
  • AI captions + human review = compliant

The Economics of Captioning at Scale

Manual captioning (professional transcriptionist): $60-120/hour. A 100-hour course library: $6,000-$12,000.

Platform auto-captions (YouTube, Vimeo): Free, but non-compliant and accuracy-limited.

AI + human review: $1-3/hour for AI; 6-9 minutes review per hour of content. 100-hour library: ~$200 in tooling + 10-15 hours of review. 95%+ cost reduction vs manual while meeting compliance.

Caption an online course with PicuteBuilt for universities and training teams · unlimited lecture length · 85+ languages · LMS-ready SRT

Workflow for Captioning a Course Library

Step 1 — Batch Upload

Most AI transcription tools accept folder/URL-batch upload. A semester's worth of lectures (40-80 hours) can upload overnight.

Step 2 — Language and Glossary Setup

  • Specify source language (don't auto-detect for non-English)
  • Upload glossary of domain terms if your tool supports it
  • Reuse glossary across subsequent lectures

Step 3 — Review Pass

For each lecture:

  • Correct proper nouns (lecturer names, guest speakers, institutions)
  • Fix domain terminology (formulas, citations, jargon)
  • Verify numbers and dates
  • Skip filler unless publishing transcript externally

Budget 6-9 minutes per hour of lecture.

Step 4 — Export and Upload

  • Export SRT per lecture (universal LMS compatibility)
  • Optional: VTT for HTML5-based players with positioning needs
  • Bulk-upload to your LMS via native subtitle manager

Step 5 — Quality Check

Spot-check 3-5 lectures in the final player:

  • Timing accuracy (captions match speech)
  • Font readability (default LMS styling works for most)
  • Special characters rendering correctly (formulas, non-English names)

Language Considerations for Education

English courses

Most AI tools handle well. Domain vocabulary is the main review focus.

Asian-language courses (Korean, Japanese, Chinese, Thai, Vietnamese)

Accuracy varies dramatically by tool. English-first tools often underperform. Use a tool specifically optimized for the language, or check accuracy on a sample lecture before committing to a library-wide run.

Multilingual courses (English + native language)

Code-switching is handled well by modern models for brief switches. For full multilingual lectures (40%+ second language), transcribe in segments by language for best accuracy.

Special Content Types

Math / formulas

AI transcribes spoken formulas phonetically ("x squared plus y squared equals r squared"). For published transcripts, replace with LaTeX or inline math notation during review.

Lab demonstrations

Include [screen shows X] or [demonstrates technique] tags for visual context. AI won't add these automatically; add during review for accessibility.

Q&A with audience

Multi-speaker segments benefit from diarization. For students asking questions off-mic, expect to flag [inaudible] during review.

Guest lecturers

Add their name and affiliation to the glossary before transcription if possible; avoids "Speaker 2" labels that require manual renaming.

LMS-Specific Upload Tips

  • Canvas — Studio or MediaGallery; SRT native
  • Coursera — Partner dashboard; requires SRT with standard formatting
  • edX — Studio; accepts SRT and SBV
  • Udemy — Course manager; one language per track
  • Teachable / Thinkific — Video manager; SRT upload per lesson
  • Moodle / Blackboard — Media plugin; SRT + VTT

Related Reading

See the lecture subtitles hubLecture subtitle hub — batch uploads, glossary support, LMS exports

Frequently asked questions

What's the actual accessibility requirement for online courses in 2026?

Depends on jurisdiction and institution. In the US: ADA Title III for public-facing courses (MOOCs, bootcamps); Rehabilitation Act Section 504 + IDEA for any school receiving federal funding; state-level enforcement is active (California's Unruh Civil Rights Act has driven numerous EdTech lawsuits). In EU: European Accessibility Act (June 2025). WCAG 2.1 AA is the de facto technical standard — Success Criterion 1.2.2 requires captions for prerecorded media. Auto-captions alone don't meet it; reviewed captions do.

How much does completion rate actually improve with captions?

EdTech studies converge on 15-20% lift in completion rate for captioned vs uncaptioned courses, controlled for content type. Mechanism: (1) accessibility for deaf/hard-of-hearing students; (2) ESL students learn faster with captions in their second language; (3) noisy-environment viewing (commute, kids, office) becomes feasible; (4) comprehension retention is higher when reading + listening. The 15-20% figure compounds with playback speed controls and chapter markers.

Do I need to caption every minute, or can I skip intros/outros?

Caption everything that has speech. Intros with spoken content, Q&A at the end, student questions from the audience — all should be captioned for compliance and accessibility. Non-speech segments (silent title cards, music-only transitions) don't require captions but benefit from [music] tags. In practice, AI captions the whole recording automatically, and 'everything' is easier than curating what to skip.

Can AI handle lecturer-specific vocabulary — chemistry formulas, legal terms, medical names?

AI gets the general vocabulary right; domain-specific terms need a glossary. Practical workflow: (1) transcribe the first lecture in a series with your tool; (2) review and build a glossary of terms the AI got wrong (reagent names, case citations, medical jargon); (3) feed the glossary to subsequent lectures. Tools like Picute support glossary upload that pre-trains the model on your domain. After 2-3 lectures of glossary tuning, accuracy on domain terms reaches the same level as general vocabulary.

Which LMS platforms accept uploaded SRT files?

Almost all. Canvas, Coursera, edX, Udemy, Teachable, Thinkific, LearnWorlds, Moodle, Blackboard — all accept SRT via native subtitle upload. Some also accept VTT for HTML5 player compatibility. Workflow: generate SRT per lecture, upload via LMS bulk-import tool, test rendering on one video before batch-processing the library. SRT is the safest format for broad LMS compatibility in 2026.