---
title: Things I Learned - 02 Jun 2024
date: 2024-06-02T00:00:00+00:00
categories:
  - til
description: I explored Modal.com for GPUs, hybrid search using reciprocal rank fusion for RAG, and tested Undermind.ai for deep research. I also looked into audio embedding models, LLM routers like LiteLLM, and the psychological difficulty of subtraction in innovation.
keywords: [modal.com, reciprocal rank fusion, rag, undermind.ai, audio embeddings, litellm, llm routers]
---

This week, I learned:

- Modal.com seems of offer reasonably priced GPUs
- Combining vector search and keyword search with reciprocal rank fusion seems to work well for RAG. [Ref](https://www.assembled.com/blog/better-rag-results-with-reciprocal-rank-fusion-and-hybrid-search)
- Knowledge Project podcast. Morgan Housel
  - Differences of opinion exist because of different stories arising from origins and experiences. We are not debating facts. We are debating life lessons!
  - Solution: hear their anecdotes. The stories that taught them their lessons.
- AI reporting templates are a trend. Domain expertise comes in via structuring the report template and associated prompts.
- Some audio embedding models:
  [unoti/voice-embeddings](https://github.com/unoti/voice-embeddings),
  [retkowsky/audio_embeddings](https://github.com/retkowsky/audio_embeddings),
  [pyannote/embedding](https://huggingface.co/pyannote/embedding) (for speaker similarity),
  [and more](https://huggingface.co/models?sort=likes&search=wav2vec2).
- Hidden Brain podcast: Innovation 2.0: The power of less
  - Subtraction is hard because we are biologically and economically wired against it. It's also hard because there are fewer markers of subtraction. Additions are natural markers / triggers.
  - Marie Kondo suggests keeping only what sparks joy
- #POST I tried [Undermind.ai](https://undermind.ai/) - an agent that researches for you. It guides you to ask a detailed question, spends 2-3 minutes finding the answer, and provides detailed results. But it's worth the wait. It's a good alternative to quick validations on SciSpace.
- For popular results, search actually makes results worse! [When not to trust language models](https://arxiv.org/abs/2212.10511)
- Perception of fluency and usefulness are NEGATIVELY correlated in LLM! [Evaluating Verifiability in Generative Search Engines](https://arxiv.org/abs/2304.09848)
- GPTs are now available to non paying users. Apparently for a few weeks! Everyone also has limited access to GPT-4o.
- Discussion with Anand
  - Explore BBC Microbit
  - Everyone should get a Raspberry Pi!
  - Watch 2 minutes paper on YouTube
- More LLM routers:
  - [LiteLLM](https://www.litellm.ai/): Open source, OpenAI compatible, 100+ LLMs
  - [RouteLLM](https://github.com/lm-sys/RouteLLM): Open source, OpenAI compatible, automatically routes based on cost
  - [OpenRouter](https://openrouter.ai/): OpenAI compatible API, several models
  - [Unify](https://unify.ai/): Supports many models
  - [Portkey](https://portkey.ai/): Supports popular providers
  - [Martian](https://withmartian.com/): Limited set of models
- [d-id](https://www.d-id.com/) and [Heygen](https://www.heygen.com/) can modify videos of a person.