Speech models + LLMs = Magic

Speech-to-text models are the magic behind converting spoken language into written text. From dictating documents and emails to transcribing meetings, their applications are vast. Personally, I rely on this feature to send hands-free messages and control my smart home devices. While those everyday uses are great, I recently discovered an even more powerful application for speech-to-text. When you combine speech recognition with the power of LLMs, things get really interesting. We’re talking Q&A documents, meeting summaries, and even enhancing RAG models....

September 9, 2024 · 6 min

Site development

Last updated - 2024-09-05 ToDo ToDo Date ✅ Global images 2024-SEP-03 ✅ Tags 2024-SEP-04 ✅ Categories 2024-SEP-03 ✅ Google analytics 2024-SEP-05 ✅ Comment system - Integrating Giscus 2024-SEP-05 Adding global images Create images folder under content directory. Upload images under contents/images/ foldder Add images using the figure shortcode. e.g. {\{< figure src="/images/jerry.gif" alt="Jerry" width="200" align="center" >}} Adding tags and categories Edit your config.yaml or hugo.yaml and add a taxonomies section as given below,...

September 5, 2024 · 2 min

Solving Sudoku Puzzles with OpenCV

For my next project I wanted to build something that I have been planning to do for a long time. I love solving sudoku and I was really good at it. Yup, I was good. That was three years ago. By the time I got good at solving sudoku, it started to get boring for me and I decided to write my first sudoku solver – goSudoku. CLI Solution goSudoku is written in Go....

September 3, 2023 · 8 min

Learning ML

I’m no expert in AI/ML. I’m trying to self-learn and haven’t built anything significant, until now! To learn something, I need a problem to apply and practice what we learned. I was missing that. I didn’t have a use case for anything related to AI/ML and I wanted a pet project for AI/ML. The beginnings Yeah! Let’s talk about that. I started with this – Basic classification: Classify images of clothing....

July 14, 2023 · 10 min