---
title: Derived formats with Gemini
date: 2026-04-18T11:26:48-04:00
categories:
- llms
---
The natural capability of Generative AI is to _generate_ stuff - and Gemini's particularly good with media.
For example, we can take any document, like this MasterCard report on [The State of Open Finance 2026](https://files.s-anand.net/blog/2026-04-18-derived-formats-with-gemini/the-state-of-open-finance-2026/report.pdf), and generate videos, podcasts, sketchnotes, songs, and more from it.
How?
I uploaded the [PDF](https://files.s-anand.net/blog/2026-04-18-derived-formats-with-gemini/the-state-of-open-finance-2026/report.pdf) to [NotebookLM](https://notebooklm.google.com/notebook/26da8a27-fd08-4c98-b0d5-73fefcb9e1dd) and created a 20-minute podcast by clicking on Generate Audio Overview - Deep Dive - English - Default.
It supports multiple languages, so I generated a Chinese and Filipino version as well.
Clicking on Generate Video Overview - Cinematic led to this video overview:
There are other formats in which we can generate videos. The Cinematic format is new, and the list is growing.
It's not just NotebookLM that you can use to generate new formats. [Gemini](https://gemini.google.com/) itself supports a variety of formats.
For example, I used my [Gemini Sketchnote prompt](https://www.s-anand.net/blog/gemini-sketchnotes/) to create a visual summary of the report:
... and, using Lyria via the "Create Music" option to generate a [narrative song](https://www.s-anand.net/blog/singing-a-vote-of-thanks/) with this prompt:
```markdown
Create a narrative summarizing this article.
Narrate it rather than sing it.
Use a voice like Bobby McFerrin's, as if he were narrating rather than singing.
Keep the music minimal, focus on the voice.
```
Next, I had [Gemini create a slide deck](https://gemini.google.com/share/5dc1b824ea7b) by uploading the report and prompting:
```markdown
Convert the attached report into a beautiful slide deck that conveys the most important actionable information for the audience.
STYLE:
Write it McKinsey style with action titles. Just reading the titles should give the audience the entire message of the deck.
Follow the pyramid principle. The contents of the slide should prove the title.
Make the slides content rich, i.e. clear and self-explanatory with enough detail to help the audience understand without a narrator.
Use iconography, typography, stock images, etc. as appropriate.
Write as a single page HTML application.
```
[**See the slides**](https://files.s-anand.net/blog/2026-04-18-derived-formats-with-gemini/the-state-of-open-finance-2026/slides.html).
Then, a set of [interactive explainers](https://gemini.google.com/share/7342906e979a) using this prompt:
```markdown
Convert this report into 3 interactive explainers.
Pick the parts of the report that are best conveyed through interactive explanations. Identify the 3 most suitable ones.
Each explainer should, using animations, interactions, and simulations, explain a core point made in the report.
Render this as a single page HTML canvas.
```
[**See the explainers**](https://files.s-anand.net/blog/2026-04-18-derived-formats-with-gemini/the-state-of-open-finance-2026/explainers.html).
Finally, a [narrative data story using Claude](https://claude.ai/share/5d41d995-3658-4a9e-82d4-8ef1fb10cf6d) -- which I could do with Gemini, too, but Claude is better at.
[**See the story**](https://files.s-anand.net/blog/2026-04-18-derived-formats-with-gemini/the-state-of-open-finance-2026/story.html).
---
Where this is becomes practical is in:
- **Proposals**. No one pays attention to that company slide or RFP response. A 3-min video or 15-min podcast lets them absorb it during a walk.
- **Reviews**. Skip copy-pasting metrics into PowerPoint. Feed the raw data and ask for a McKinsey-style deck with action titles.
- **Onboarding**. Instead of a 100-page SOP or compliance manual, how about interactive explainers or a localized audio guide in Mandarin or Spanish?
- **Manuals:** How about a visual sketchnotes or step-by-step interactive flows from that documentation for call center agents?
- **Case studies.** Text-heavy fails. Maybe a 60-second narrative data story or sketchnote accompanied an upbeat narrative song?
- **Reports.** No one reads the 10-page competitor analysis. A 5-minute podcast or a single-page visual sketchnote helps the execs.
- **Training.** Create interactive simulations where people make _actual_ decisions. [Simsaram](https://ragzbuilds.com/simsaram/) is my favorite example: family relationship training/simulation based on an [iconic film](https://en.wikipedia.org/wiki/Samsaram_Adhu_Minsaram).
- **Emails.** Why not use illustrations, sketches, flowcharts, etc. to liven up internal / external emails?
When generative AI makes generation easy, why not generate _actually interesting_ stuff?