# Awesome Gemini API examples Here's a list of really cool Gemini-powered projects, demos and hacks that we liked. We hope that they will give you some ideas on what to build with Gemini next. You should also have a look at our Google-maintained [quickstarts](quickstarts/) and [examples](examples/). ## Gemini API-powered projects Those projects are ordered in an anti-chronological order and are the work of their authors, who are are not necessarily affiliated with Google. ### Creating workflows diagrams using Gemini From [Gitesh Mahadik](https://medium.com/@gitesh08), Create amazing workflows diagrams for your next pack of slides: https://medium.com/google-cloud/how-to-generate-workflow-diagrams-using-gemini-8d3d538bd2e7 ### [Caching] Helping with navigating a lot of documentation

The context caching feature for Gemini is really neat!

As you all know, I write a lot and document AI research progress in lots of places. But I also tend to forget research findings due to the vast amount of papers I read so I wanted to try whether context caching can help me… pic.twitter.com/7Q7G2RapFD

— elvis (@omarsar0) June 18, 2024
### [Vision + Math reasoning] Solving Math problems on a white board Gemini analyze the math drawn then computes an answers as well as where to display it. It can even recognize variables, functions, and solve brain teasers. ![Solving Math problems in tldraw](https://pbs.twimg.com/media/GP818DAWUAEkZSA?format=jpg&name=900x900)

finally we can do maths pic.twitter.com/SoECzN1Aom

— tldraw (@tldraw) June 11, 2024

a b test pic.twitter.com/kBzxvGL91t

— tldraw (@tldraw) June 12, 2024

Bee maths pic.twitter.com/NPzfWl6UrS

— tldraw (@tldraw) June 15, 2024

maths is solved https://t.co/kgbc1soUhr pic.twitter.com/g9QwcICglB

— tldraw (@tldraw) June 13, 2024
### [Vision] [Gemini spatial example](https://gemini-spatial-example.grantcuster.com/) Image analysis and labelling using gemini flash. ![Gemini spatial example](https://github.com/GrantCuster/gemini-spatial-example/raw/main/gemini-spatial-example.gif) See also:

Alright Gemini's vision capabilities are really good. All I sent to Gemini was the photo and some instructions β€” the SVG on top was 100% generated by Gemini, including the coordinates etc. pic.twitter.com/wwv4a6maEa

— Blixt (@blixt) June 7, 2024
### [Vision] Recognizing actions on computer Gemini detects which application is used and which action is used in the app.

I am building a system to teach Gemini Flash new skills by observing the actions I take on my computer so it can repeat them.

I found a way to achieve 100% accuracy in Gemini's ability to recognize what I am clicking.

The trick?
Draw a blue circle where the click happens. πŸ”΅ pic.twitter.com/sxj6BcXSta

— Pietro Schirano (@skirano) June 6, 2024
### [Vision] Drawings labelling in [](https://tldraw.dev/) A Gemini API integration that labels your drawings. ![tldraw](https://pbs.twimg.com/tweet_video_thumb/GPOT8XtXAAEHyxh.jpg)

this demo may seem un-flashy but its actually huge for letting models interact better with a canvas, and also potentially massive for accessibility use cases https://t.co/58MFuLQ3kP

— Lu wilson πŸ³οΈβ€πŸŒˆπŸ³οΈβ€πŸŒˆ (@TodePond) June 4, 2024
### [Vision and Code generation] [Gemini UI to Code Streamlit App](https://github.com/Doriandarko/gemini-ui-to-code) Generates an HTML page from an image. ![Gemini UI to Code Streamlit App](https://pbs.twimg.com/ext_tw_video_thumb/1797737995325362176/pu/img/sA48r-MQRQouOJnW.jpg)

Introducing Gemini UI to Code πŸ§‘β€πŸ’»

A small but powerful app that uses an agentic framework to convert an image to code.

Perfect as a starter template or for learning new tricks.

Thanks to its improved Vision capabilities, Gemini 1.5 Pro is perfect for this task. πŸͺ„ pic.twitter.com/QfNi0q4ygL

— Pietro Schirano (@skirano) June 3, 2024
### [Vision] [Gemini helping to play super mario 64](https://t.co/doTngufjFL) Gemini recognize the game, and the4 levels and provide advices when asked. ![Gemini helping to play super mario 64](https://pbs.twimg.com/ext_tw_video_thumb/1792945797765431296/pu/img/o4b_hjENB5fRhtSQ.jpg)

I built my own omni assistant using Gemini 1.5 Flash to guide me through Super Mario 64.

Gemini can see what I do on my screen and communicate with me in real time via voice, and thanks to the long 1M context, it has a memory of everything we do together.

Incredible. pic.twitter.com/doTngufjFL

— Pietro Schirano (@skirano) May 21, 2024