--- title: Things I Learned - 02 Jun 2024 date: 2024-06-02T00:00:00+00:00 categories: - til description: I explored Modal.com for GPUs, hybrid search using reciprocal rank fusion for RAG, and tested Undermind.ai for deep research. I also looked into audio embedding models, LLM routers like LiteLLM, and the psychological difficulty of subtraction in innovation. keywords: [modal.com, reciprocal rank fusion, rag, undermind.ai, audio embeddings, litellm, llm routers] --- This week, I learned: - Modal.com seems of offer reasonably priced GPUs - Combining vector search and keyword search with reciprocal rank fusion seems to work well for RAG. [Ref](https://www.assembled.com/blog/better-rag-results-with-reciprocal-rank-fusion-and-hybrid-search) - Knowledge Project podcast. Morgan Housel - Differences of opinion exist because of different stories arising from origins and experiences. We are not debating facts. We are debating life lessons! - Solution: hear their anecdotes. The stories that taught them their lessons. - AI reporting templates are a trend. Domain expertise comes in via structuring the report template and associated prompts. - Some audio embedding models: [unoti/voice-embeddings](https://github.com/unoti/voice-embeddings), [retkowsky/audio_embeddings](https://github.com/retkowsky/audio_embeddings), [pyannote/embedding](https://huggingface.co/pyannote/embedding) (for speaker similarity), [and more](https://huggingface.co/models?sort=likes&search=wav2vec2). - Hidden Brain podcast: Innovation 2.0: The power of less - Subtraction is hard because we are biologically and economically wired against it. It's also hard because there are fewer markers of subtraction. Additions are natural markers / triggers. - Marie Kondo suggests keeping only what sparks joy - #POST I tried [Undermind.ai](https://undermind.ai/) - an agent that researches for you. It guides you to ask a detailed question, spends 2-3 minutes finding the answer, and provides detailed results. But it's worth the wait. It's a good alternative to quick validations on SciSpace. - For popular results, search actually makes results worse! [When not to trust language models](https://arxiv.org/abs/2212.10511) - Perception of fluency and usefulness are NEGATIVELY correlated in LLM! [Evaluating Verifiability in Generative Search Engines](https://arxiv.org/abs/2304.09848) - GPTs are now available to non paying users. Apparently for a few weeks! Everyone also has limited access to GPT-4o. - Discussion with Anand - Explore BBC Microbit - Everyone should get a Raspberry Pi! - Watch 2 minutes paper on YouTube - More LLM routers: - [LiteLLM](https://www.litellm.ai/): Open source, OpenAI compatible, 100+ LLMs - [RouteLLM](https://github.com/lm-sys/RouteLLM): Open source, OpenAI compatible, automatically routes based on cost - [OpenRouter](https://openrouter.ai/): OpenAI compatible API, several models - [Unify](https://unify.ai/): Supports many models - [Portkey](https://portkey.ai/): Supports popular providers - [Martian](https://withmartian.com/): Limited set of models - [d-id](https://www.d-id.com/) and [Heygen](https://www.heygen.com/) can modify videos of a person.