Project Case

CourseSnap

An AI learning-material workflow that turns scattered course screenshots, PDFs, and transcripts into structured study notes.

Role

Product definition / MVP design / Prototype design / AI workflow design / Product trade-off

Stage

Runnable MVP / AI product case

Key Outcome

Built a working MVP in about 10 hours, covering auto screenshot capture, PDF generation, transcript detection, and AI-assisted study-note generation.

Background

CourseSnap started from a small but very real learning workflow: when students watch online courses, replayed meetings, or remote lectures, the full slide deck is often unavailable. They can only take screenshots manually, then spend extra time sorting scattered images into something they can actually review.

Problem

The product problem was not simply image capture. The real issue was input quality: screenshots are easy to miss, OCR is unstable on slides, transcripts are separated from visual context, and AI summaries become weak when the upstream materials are incomplete or poorly structured.

Approach

I narrowed the MVP into a clear workflow: detect slide changes, save new screenshots, merge screenshots into a PDF, detect transcript files in the project folder, and use an OpenAI-compatible API call to generate structured study notes. I also packaged the tool for Windows so the PDF workflow could work even when users do not use AI.

Decision Logic

Why I moved away from OCR

The early idea was screenshot to OCR to text to AI summary. In practice, OCR was fragile on slide screenshots and could lose layout, order, and visual context. Keeping the original slides as a PDF made the input more reliable.

Why PDF + transcript became the core input

The PDF preserves the visual structure of the course, while the transcript adds semantic detail. Together they create a higher-quality input for review and AI summarization than OCR text alone.

Why the API key is provided by the user

Embedding a personal API key would create security and maintenance risks. Letting users provide their own key keeps distribution safer and makes the AI layer easier to replace or extend.

Why AI is an enhancement, not the entry barrier

The product still delivers value as an auto-capture and PDF-generation tool. AI improves the workflow, but users can still organize and review materials without model access.

PRD Summary

Feature	User Pain Point	Product Solution	Priority
Auto Capture	Manual screenshots are inefficient	Detect page changes and save screenshots automatically	P0
PDF Merge	Scattered images are hard to review	Merge screenshots into an ordered PDF	P0
Transcript Check	AI summary lacks context	Detect whether TXT/DOCX transcript files exist	P1
AI Summary	Organizing study materials is time-consuming	Generate structured study notes	P1
OCR	Recognition is unstable on slide screenshots	Keep it outside the core workflow for now	P2 / Deferred

User Flow

Start recording

Detect page changes

Auto-save screenshots

Generate PDF

Detect transcript

AI summary

Export study notes

Core Wireframe Prototype

原型图

低保真 Figma 原型

我用低保真原型先梳理主界面布局、按钮位置和关键弹窗，验证用户能否顺利从“开始录制”走到“AI 总结”。

交互重点

状态与异常先行

我优先设计录制中、缺少逐字稿、输出完成等关键状态，确保用户在每一步都知道下一步该做什么。

Core Wireframe Prototype — The prototype follows the core path of recording, PDF generation, transcript supplementation, and AI summarization. I used it to validate the main interface, error prompt, and final output interactions.

Product Interface & Real Outputs

Product Interface & Real Outputs 3 — PDF output: Captured images are merged into a continuous course PDF for reading, archiving, and further AI processing.

Product Interface & Real Outputs 4 — Error handling: When the transcript is missing, the app guides the user to add the required file before summarization.

Product Interface & Real Outputs 5 — AI summary: The tool combines the course PDF and transcript to generate structured study notes, completing the loop from course materials to review-ready outputs.

Outcome

CourseSnap became an end-to-end learning-material MVP: course content capture, PDF organization, transcript pairing, and AI-assisted note generation. The case shows my ability to identify a concrete user pain point, scope an MVP, make product trade-offs, and embed AI into a real workflow instead of treating AI as a standalone feature.

Reflection

This project clarified a product principle I would bring into AI product roles: model capability matters, but the user experience often depends more on input quality, workflow design, privacy expectations, and fallback value when AI is unavailable.

Next Iteration Plan

Auto-capture optimization

For videos with subtitles, subtitle updates can trigger extra screenshots even when the main slide content has not changed. Next, I plan to refine the capture logic so it screenshots only when key page content changes, while still preserving useful subtitle information. This should reduce redundant screenshots and improve review efficiency.

AI-summary flexibility

At the moment, the AI-summary flow is limited when the user does not provide a transcript. Next, I want the tool to warn users that adding a transcript will improve summary quality, but still allow them to continue generating study notes from the PPT/PDF alone. This keeps user control while improving feature availability.

Note

Read the full CourseSnap iteration note

Links

Figma Prototype GitHub Repository MVP Download

Continue Browsing

ViewTraditional Culture AI Visual System ViewShanghai Housing Price Forecast ViewCasa Rossi Real Estate Valuation ViewMonza Esports Hotel Feasibility ViewHedonic Price Regression ViewCultural Asset Digital Commercialization ViewContent-Driven Growth ViewPre-Master Education Service ViewNetEase Product Europe Market