✓ Verified 💻 Development ✓ Enhanced Data

Video News Downloader

Automated daily news video downloader with AI subtitle proofreading.

Rating
4 (421 reviews)
Downloads
7,433 downloads
Version
1.0.0

Overview

Automated daily news video downloader with AI subtitle proofreading.

Complete Documentation

View Source →

Video News Downloader with AI Subtitle Proofreading

Complete workflow for downloading daily news videos, processing subtitles, and serving them via HTTP with web players.

Overview

This skill automates:

  • Video Download: CBS Evening News + BBC News at Ten from YouTube
  • Subtitle Processing: Extract auto-captions and convert to VTT format
  • AI Proofreading: Use DeepSeek to fix speech recognition errors
  • HTTP Streaming: Serve videos with embedded web players
  • Scheduled Updates: Daily cron jobs at configurable times

Quick Start

1. Download Latest News

bash
python3 scripts/video_download.py --cbs --bbc

2. Proofread Subtitles

bash
python3 scripts/subtitle_proofreader.py /path/to/subtitle.vtt

Or use DeepSeek directly:

"校对字幕文件 /path/to/subtitle.vtt"

3. Start HTTP Servers

bash
bash scripts/setup_server.sh

4. Setup Daily Cron Jobs

bash
bash scripts/setup_cron.sh

Commands

Video Download Script

Download CBS only:

bash
python3 scripts/video_download.py --cbs

Download BBC only:

bash
python3 scripts/video_download.py --bbc

Download both:

bash
python3 scripts/video_download.py --cbs --bbc

With subtitle proofreading:

bash
python3 scripts/video_download.py --cbs --bbc --proofread

Subtitle Proofreading

Proofread single file:

bash
python3 scripts/subtitle_proofreader.py <vtt_file_path>

Auto-proofread all news subtitles:

bash
python3 scripts/subtitle_proofreader.py --all

Server Management

Start servers:

bash
bash scripts/setup_server.sh start

Check status:

bash
bash scripts/setup_server.sh status

Stop servers:

bash
bash scripts/setup_server.sh stop

File Structure

text
/workspace/
├── cbs-live-local/
│   ├── cbs_latest.mp4
│   ├── cbs_latest.en.vtt          # Original subtitle
│   ├── cbs_latest.en.vtt-backup   # Backup
│   ├── cbs_latest-corrected.txt   # DeepSeek corrected text
│   └── cbs_latest-corrections.md  # Error list
│
├── bbc-news-live/
│   ├── bbc_news_latest.mp4
│   ├── bbc_news_latest.en.vtt
│   ├── bbc_news_latest.en.vtt-backup
│   ├── bbc_news_latest-corrected.txt
│   └── bbc_news_latest-corrections.md
│
└── temp/                           # Temporary download files

HTTP Endpoints

EndpointDescription
http://IP:8093/CBS Evening News player
http://IP:8093/cbs_latest.mp4CBS video direct
http://IP:8095/BBC News at Ten player
http://IP:8095/bbc_news_latest.mp4BBC video direct

Cron Jobs

Default Schedule (Beijing Time)

TimeTask
20:00Download latest CBS + BBC videos
20:30DeepSeek proofread subtitles

Manual Cron Setup

See references/cron-setup.md for detailed cron configuration.

DeepSeek Proofreading

What Gets Fixed

  • Speech recognition errors (e.g., "noraster" → "nor'easter")
  • Name errors (e.g., "trunk" → "Trump")
  • Location name errors
  • Professional terminology errors
  • Obvious spelling mistakes

Output Files

For each subtitle file, generates:

  • -backup.vtt - Original subtitle (never modified)
  • -corrected.txt - AI-corrected plain text
  • -corrections.md - List of corrections made

Troubleshooting

Video Download Fails

  • Check yt-dlp is installed: yt-dlp --version
  • Check YouTube URL is accessible
  • Try manual download first

Subtitle Extraction Fails

  • Some videos don't have auto-captions
  • Check if --list-subs shows available languages

Server Won't Start

  • Check ports 8093/8095 are free: lsof -i :8093
  • Check Python http.server is available

Proofreading Issues

  • Ensure DeepSeek model is available
  • Check subtitle file exists and is valid VTT format

See Also

Installation

Terminal bash

openclaw install video-news-downloader
    
Copied!

💻Code Examples

python3 scripts/subtitle_proofreader.py /path/to/subtitle.vtt

python3-scriptssubtitleproofreaderpy-pathtosubtitlevtt.txt
Or use DeepSeek directly:
> "校对字幕文件 /path/to/subtitle.vtt"

### 3. Start HTTP Servers

bash scripts/setup_cron.sh

bash-scriptssetupcronsh.txt
## Commands

### Video Download Script

**Download CBS only:**

python3 scripts/video_download.py --cbs --bbc --proofread

python3-scriptsvideodownloadpy---cbs---bbc---proofread.txt
### Subtitle Proofreading

**Proofread single file:**

python3 scripts/subtitle_proofreader.py --all

python3-scriptssubtitleproofreaderpy---all.txt
### Server Management

**Start servers:**
example.txt
/workspace/
├── cbs-live-local/
│   ├── cbs_latest.mp4
│   ├── cbs_latest.en.vtt          # Original subtitle
│   ├── cbs_latest.en.vtt-backup   # Backup
│   ├── cbs_latest-corrected.txt   # DeepSeek corrected text
│   └── cbs_latest-corrections.md  # Error list
│
├── bbc-news-live/
│   ├── bbc_news_latest.mp4
│   ├── bbc_news_latest.en.vtt
│   ├── bbc_news_latest.en.vtt-backup
│   ├── bbc_news_latest-corrected.txt
│   └── bbc_news_latest-corrections.md
│
└── temp/                           # Temporary download files

Tags

#web_and-frontend-development

Quick Info

Category Development
Model Claude 3.5
Complexity One-Click
Author cyberpsychosissss
Last Updated 3/10/2026
🚀
Optimized for
Claude 3.5
🧠

Ready to Install?

Get started with this skill in seconds

openclaw install video-news-downloader