The Situation
Health Connect was a content marketing initiative developed by Viatris to deliver current medical research findings to practicing physicians in an accessible, time-efficient format. The program produced approximately 56 monthly videos (28 in English and 28 in Spanish), each approximately 90 seconds in length, summarizing recent scientific articles from prestigious medical journals.
Initially, the program focused primarily on text-based content with supporting visuals. However, as the initiative evolved, it transitioned to a more sophisticated audiovisual format incorporating AI-generated voice-overs, on-screen text, and supporting imagery — creating a more engaging and comprehensive educational experience for busy healthcare professionals.
The Challenge
The transition to fully integrated audiovisual content presented several interconnected challenges:
- Terminology inconsistency: Medical terms varied across videos, creating potential confusion and undermining the professional credibility of the content.
- Voice-over optimization: AI text-to-speech systems required specialized text formatting to properly pronounce complex medical terminology, acronyms, and numerical data.
- Multi-element synchronization: Voice-over narration, on-screen text, and visual elements needed precise coordination to create a cohesive viewing experience.
- Dual-language production: Maintaining consistent quality across parallel English and Spanish productions added complexity to the review process.
- High-volume production schedule: The substantial monthly output (56 videos) required efficient quality assurance processes to meet deadlines without sacrificing standards.
- Technical-educational balance: Content needed to maintain scientific accuracy while remaining accessible and engaging in a brief format.
My Solution
I developed a comprehensive quality assurance framework to address these challenges:
- Standardization manual: I created a comprehensive reference guide establishing consistent terminology, information hierarchy principles, and tone guidelines for all videos.
- Script optimization: I implemented specialized script formatting protocols to ensure AI voice-over systems could accurately pronounce medical terminology, including phonetic spelling of complex terms and proper formatting of numerical data.
- Three-stage review process: I developed a structured quality assurance workflow:
- Stage 1: Verification of on-screen text accuracy and compliance with style guidelines
- Stage 2: Cross-checking voice-over narration against approved script content
- Stage 3: Comprehensive review of timing, flow, and synchronization between all elements
- Branding integration: I ensured consistent visual branding appropriate to each video’s medical specialty category.
- Pronunciation guidelines: I developed specialized instructions for the AI voice-over system to correctly handle medical terminology in both English and Spanish.
Results
The implementation of this structured approach to audiovisual quality assurance delivered several significant benefits:
- Reduced revision cycles: Videos typically achieved error-free status after just two revisions, compared to multiple rounds previously required.
- Enhanced professional credibility: Consistent terminology and pronunciation reinforced the content’s authority for the intended audience.
- Improved viewing experience: Synchronized elements created a more cohesive and engaging educational experience.
- Maintained production schedule: The structured review process supported efficient completion of the high-volume monthly output.
- Cross-language consistency: Parallel videos maintained equivalent quality standards in both English and Spanish versions.
Working with these videos revealed several important insights about quality assurance in audiovisual content. The successful transition from text-focused to fully integrated audiovisual production demonstrated that editorial principles traditionally applied to written content can be effectively adapted to multimedia formats when approached systematically. The implementation highlighted that script quality fundamentally determines final video quality, establishing the critical importance of pre-production editorial standards. The process showed that AI-based voice technology requires specialized text preparation to handle domain-specific terminology accurately — a consideration particularly crucial in medical content where pronunciation errors could undermine credibility.
This experience demonstrated that structured, sequential review processes are more effective than holistic approaches when evaluating complex multi-element content. By breaking quality assurance into discrete stages focused on specific elements (text, audio, synchronization), I achieved more thorough error detection and efficient revision cycles. This methodology not only improved the end product but also streamlined the production process, allowing for sustainable management of high-volume content creation without sacrificing quality standards.