Commit Graph

6 Commits

Author SHA1 Message Date
hailin 7077ac7129 fix: provide real image files for video key; set load_original_video=True
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-06 03:13:33 -08:00
hailin 2cb9a1f29e fix: add num_frames/height/width/fps/id columns to smoke test CSV
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-06 03:05:01 -08:00
hailin ae3e448c8a feat: add smoke test config and synthetic data generator
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-06 02:21:58 -08:00
hailin bdeb2870d4 feat: add opensora/datasets module and tools/datasets
- Add opensora/datasets (aspect, bucket, dataloader, datasets, parallel,
  pin_memory_cache, read_video, sampler, utils, video_transforms)
- Add tools/datasets pipeline scripts
- Fix .gitignore: scope /datasets to root-level only, whitelist opensora/datasets/

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-06 02:00:19 -08:00
hailin 916ee2126d fix: replace GitHub blob image URLs with raw.githubusercontent.com for Gitea rendering 2026-03-05 22:59:17 -08:00
hailin 410c20d4fa feat: init my-sora — merge Open-Sora v2.0 (11B) + v1.3 data pipeline tools 2026-03-05 22:53:15 -08:00