PRODUCT ROADMAP 2026 - 2028

The Future of Multimodal Lecture RAG

Our strategic timeline to scale TubeRAG into the ultimate study companion for engineers, students, and workspaces. Join us on this journey!

🚀 Accelerate Our Build ProcessWe believe in open learning tools and high-fidelity tech. Your contributions fund backend operations, API calls, and directly fast-tracks upcoming features.
Q1 - Q2 2026

Phase 1: Core Synthesis & Pipeline

Shipped

Public Video Ingestion

Shipped

Asynchronous processing queue powered by serverless Upstash QStash, returning instant 202 responses.

Supabase pgvector Database

Shipped

Chunking and embedding transcript text structures for semantic vector-aware querying and citations.

Interactive Study Studio

Shipped

Real-time generation of structured outlines, concepts mindmaps, and cursive calligraphy study notes.

Q3 - Q4 2026

Phase 2: Security & Session Persistence

Active Build

User Authentication & JWT Verification

In Dev

Secure user registration and token-based sessions utilizing Supabase Auth and JWT state verification.

Row Level Security (RLS) Policies

In Dev

Database layer isolation to secure user nodes, ensuring users can only read and write their own ingested workspaces.

Chat History Persistence

Scheduled

Saving multi-agent chat sessions to the database to allow users to pause, resume, and manage historic queries.

Multi-tenant Shared Workspace Folders

Scheduled

Allowing users to organize ingested videos into folders and share workspaces or playlists with teammates.

H1 2027

Phase 3: Visual & Audio Intelligence

Planned

Multimodal Frame Sampling

Scheduled

Periodic frame extraction using vision models to transcribe texts from slide decks, code editors, and diagrams.

Gemini 3.1 TTS Speech Expressions

Scheduled

Inline voice tags support ([laughs], [whispers]) inside generated podcasts to create human-like audio conversations.

Vector SVG Concept Maps Exporters

Scheduled

Downloading and printing high-definition interactive mindmaps as editable SVG vector assets.

H2 2027 - 2028

Phase 4: Developer Ecosystem & Scales

Planned

Public Developer APIs

Scheduled

Secure API keys and webhooks subscription endpoints to build integrations with external learning management tools.

Batch Transcript Downloads

Scheduled

Exporting semantic lecture transcripts in multiple formats including SRT, VTT, and styled Markdown reports.

Enterprise Multi-modal Search Engine

Scheduled

Global semantic search index crossing transcripts, notes, and visual frames across all user repositories.