Skip to the content.

Home | Architecture | Technical Stack | Deployment | Demo

Movie2U Documentation

Overview

Movie2U is a serverless application that enhances video accessibility by providing comprehensive visual and audio descriptions, specifically designed to help blind or visually impaired users understand video content.

Features

1. Visual Analysis

2. Audio Processing

3. Comprehensive Narrative Generation

4. Audio Narration

Output Files

  1. Original video in S3
  2. Extracted audio in S3
  3. Transcription JSON in S3
  4. Generated narration audio in S3
  5. Analysis data in DynamoDB

Data Structure

interface VideoAnalysis {
    videoId: string;
    timestamp: string;
    labels: Array<{
        label: string;
        confidence: number;
        timestamp: number;
    }>;
    audioPath: string;
    visualNarrative: string;
    transcriptionPath: string;
    transcript: string;
    comprehensiveNarrative: string;
    narrationAudioPath: string;
    status: string;
}

Additional Resources