Data Science at Home
Episodes
Monday Jan 08, 2024
Careers, Skills, and the Evolution of AI (Ep. 248)
Monday Jan 08, 2024
Monday Jan 08, 2024
!!WARNING!!
Due to some technical issues the volume is not always constant during the show. I sincerely apologise for any inconvenienceFrancesco
In this episode, I speak with Richie Cotton, Data Evangelist at DataCamp, as he delves into the dynamic intersection of AI and education. Richie, a seasoned expert in data science and the host of the podcast, brings together a wealth of knowledge and experience to explore the evolving landscape of AI careers, the skills essential for generative AI technologies, and the symbiosis of domain expertise and technical skills in the industry.
References
Become a generative AI developer in this FREE code-along series. Learn to build a chatbot using the OpenAI API, the Pinecone API, and LangChain, and learn to build NLP and image applications with Hugging Face. https://www.datacamp.com/ai-code-alongs
Learn to use ChatGPT and the OpenAI API in the OpenAI Fundamentals skill track. https://www.datacamp.com/tracks/openai-fundamentals
Get started with deep learning using PyTorch in the Introduction to Deep Learning with PyTorch course. https://www.datacamp.com/courses/introduction-to-deep-learning-with-pytorch
Monday Dec 04, 2023
Debunking AGI Hype and Embracing Reality [RB] (Ep. 245)
Monday Dec 04, 2023
Monday Dec 04, 2023
In this thought-provoking episode, we sit down with the renowned AI expert, Filip Piekniewski, Phd, who fearlessly challenges the prevailing narratives surrounding artificial general intelligence (AGI) and the singularity. With a no-nonsense approach and a deep understanding of the field, Filip dismantles the hype and exposes some of the misconceptions about AI, LLMs and AGI. Join us as we delve into the real-world implications of AI, separating fact from fiction, and gaining a firm grasp on the tangible possibilities of AI advancement. If you're seeking a refreshingly pragmatic perspective on the future of AI, this episode is an absolute must-listen.
Filip Piekniewski Bio
Filip Piekniewski is a distinguished computer vision researcher and engineer, specializing in visual object tracking and perception. He approaches machine learning with a pragmatic mindset, recognizing its current limitations. Filip earned his Ph.D. from Warsaw University, where he explored neuroscience and later joined Brain Corporation in San Diego. His extensive study of neuroscience inspired him to develop innovative, bio-inspired machine learning architectures. Filip's unique blend of scientific curiosity and software engineering expertise allows him to quickly prototype and implement new ideas. He is known for his realistic perspective on AI, debunking AGI hype and focusing on tangible advancements.
Sponsors
Finally, a better way to do B2B research. NewtonX The World’s Leading B2B Market Research Company
Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks. Overlapping requirements. Let Arctic Wolf be your guide.Check it out at https://arcticwolf.com/datascience
Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone they serve. We provide solutions in AI/ML, Fintech, Defense, Robotics and Predictive maintenance.
References
https://twitter.com/filippie509
http://blog.piekniewski.info/ (On limits of deep learning and where to go next with AI.)
Monday Nov 20, 2023
The AI Chip Chat 🤖💻 (Ep. 243)
Monday Nov 20, 2023
Monday Nov 20, 2023
Dive into the cool world of AI chips with us! 🚀 We're breaking down how these special computer chips for AI have evolved and what makes them different. Think of them like the superheroes of the tech world!
Don't miss out! 🎙️🔍 #AIChips #TechTalk #SimpleScience
Monday Sep 11, 2023
Unlocking Language Models: The Power of Prompt Engineering (Ep. 238)
Monday Sep 11, 2023
Monday Sep 11, 2023
Join me on an enlightening journey through the world of prompt engineering. Explore the multifaceted skills and strategies involved in harnessing the potential of large language models for various applications. From enhancing safety measures to augmenting models with domain knowledge, learn how prompt engineering is shaping the future of AI.
References
https://arxiv.org/pdf/2109.01652.pdf
https://arxiv.org/abs/2201.11903
https://arxiv.org/abs/2302.00923
https://www.mihaileric.com/posts/a-complete-introduction-to-prompt-engineering
https://www.axios.com/2023/02/22/chatgpt-prompt-engineers-ai-job
Thursday Aug 17, 2023
The new dimension of AI: Vector Databases (Ep. 236)
Thursday Aug 17, 2023
Thursday Aug 17, 2023
Let's delve into the emerging trend in database design – or is it really a new trend? The realm of vector databases and their revolutionary influence on AI and ML is making headlines. Come along as we investigate how these groundbreaking databases are revolutionizing the landscape of data storage, retrieval, and processing, ultimately unlocking the complete potential of artificial intelligence and machine learning. But are they genuinely as innovative as they seem?
References
https://partee.io/2022/08/11/vector-embeddings/
https://blog.det.life/why-you-shouldnt-invest-in-vector-databases-c0cd3f59d23c
https://medium.com/@ryanntk/choosing-the-right-embedding-model-a-guide-for-llm-applications-7a60180d28e3
Monday Jun 26, 2023
Debunking AGI Hype and Embracing Reality (Ep. 233)
Monday Jun 26, 2023
Monday Jun 26, 2023
In this thought-provoking episode, we sit down with the renowned AI expert, Filip Piekniewski, Phd, who fearlessly challenges the prevailing narratives surrounding artificial general intelligence (AGI) and the singularity. With a no-nonsense approach and a deep understanding of the field, Filip dismantles the hype and exposes some of the misconceptions about AI, LLMs and AGI. Join us as we delve into the real-world implications of AI, separating fact from fiction, and gaining a firm grasp on the tangible possibilities of AI advancement. If you're seeking a refreshingly pragmatic perspective on the future of AI, this episode is an absolute must-listen.
Filip Piekniewski Bio
Filip Piekniewski is a distinguished computer vision researcher and engineer, specializing in visual object tracking and perception. He approaches machine learning with a pragmatic mindset, recognizing its current limitations. Filip earned his Ph.D. from Warsaw University, where he explored neuroscience and later joined Brain Corporation in San Diego. His extensive study of neuroscience inspired him to develop innovative, bio-inspired machine learning architectures. Filip's unique blend of scientific curiosity and software engineering expertise allows him to quickly prototype and implement new ideas. He is known for his realistic perspective on AI, debunking AGI hype and focusing on tangible advancements.
Sponsors
Finally, a better way to do B2B research. NewtonX The World’s Leading B2B Market Research Company
Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks. Overlapping requirements. Let Arctic Wolf be your guide.Check it out at https://arcticwolf.com/datascience
Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone they serve. We provide solutions in AI/ML, Fintech, Defense, Robotics and Predictive maintenance.
References
https://twitter.com/filippie509
http://blog.piekniewski.info/ (On limits of deep learning and where to go next with AI.)
Tuesday May 16, 2023
Tuesday May 16, 2023
Hold on to your calculators and buckle up for a wild mathematical ride in this episode! Brace yourself as we dive into the fascinating realm of Liquid Time-Constant Networks (LTCs), where mathematical content reaches new heights of excitement.
In this mind-bending adventure, we demystify the intricacies of LTCs, from complex equations to mind-boggling mathematical concepts, we break them down into digestible explanations.
References
https://www.science.org/doi/10.1126/scirobotics.adc8892
https://spectrum.ieee.org/liquid-neural-networks#toggle-gdpr
Thursday May 11, 2023
Thursday May 11, 2023
Get ready for an eye-opening episode! 🎙️
In our latest podcast episode, we dive deep into the world of LoRa (Low-Rank Adaptation) for large language models (LLMs). This groundbreaking technique is revolutionizing the way we approach language model training by leveraging low-rank approximations.
Join us as we unravel the mysteries of LoRa and discover how it enables us to retrain LLMs with minimal expenditure of money and resources. We'll explore the ingenious strategies and practical methods that empower you to fine-tune your language models without breaking the bank.
Whether you're a researcher, developer, or language model enthusiast, this episode is packed with invaluable insights. Learn how to unlock the potential of LLMs without draining your resources.
Tune in and join the conversation as we unravel the secrets of LoRa low-rank adaptation and show you how to retrain LLMs on a budget.
Listen to the full episode now on your favorite podcast platform! 🎧✨
References
LoRA: Low-Rank Adaptation of Large Language Models https://arxiv.org/abs/2106.09685
Low-rank approximation https://en.wikipedia.org/wiki/Low-rank_approximation
Attention is all you need https://arxiv.org/pdf/1706.03762.pdf
Tuesday Feb 21, 2023
Deep learning vs tabular models (Ep. 217)
Tuesday Feb 21, 2023
Tuesday Feb 21, 2023
Deep learning methods are not as effective with tabular data. Here is why, and what to do about it.
Sponsors
If you're ready to take your WiFi game to the next level, head over to asus.click/ZenWiFi_XD5 or check out the show notes for this episode. Trust me, with ASUS ZenWiFi XD5, you'll get the best WiFi experience ever!
References
https://paperswithcode.com/methods/category/deep-tabular-learning
https://m-clark.github.io/posts/2022-04-01-more-dl-for-tabular/
Saturday Jan 14, 2023
Accelerating Perception Development with Synthetic Data (Ep. 214)
Saturday Jan 14, 2023
Saturday Jan 14, 2023
In this episode I am with Kevin McNamara, founder and CEO of Parallel Domain. We speak about a very effective method to generate synthetic data that is currently in production at Parallel Domain.
Enjoy the show!
References
Parallel Domain Synthetic Data Improves Cyclist Detection (blog post):
https://paralleldomain.com/parallel-domain-synthetic-data-improves-cyclist-detection/
Beating the State of the Art in Object Tracking with Synthetic Data:
https://paralleldomain.com/beating-the-state-of-the-art-in-object-tracking-with-synthetic-data/
Parallel Domain Open Synthetic Dataset:
https://paralleldomain.com/open-datasets/bicycle-detection
How Toyota Research Institute Trains Better Computer Vision Models with PD Synthetic Data (interview):
https://www.youtube.com/watch?v=QIYttoVxf2w
Career Opportunities:
https://paralleldomain.com/careers