Data Science at Home
Episodes

Tuesday Nov 17, 2020
Distill data and train faster, better, cheaper (Ep. 128)
Tuesday Nov 17, 2020
Tuesday Nov 17, 2020
Come join me in our Discord channel speaking about all things data science.
Follow me on Twitch during my live coding sessions usually in Rust and Python
Our Sponsors
Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.
References
Dataset distillation (official paper)
GitHub repo

Saturday Nov 07, 2020
Top-3 ways to put machine learning models into production (Ep. 126)
Saturday Nov 07, 2020
Saturday Nov 07, 2020
Come join me in our Discord channel speaking about all things data science.
Follow me on Twitch during my live coding sessions usually in Rust and Python
Our Sponsors
physicspodcast.com is not just a physics podcast. But also interviews with scientists, scholars, authors and reflections on the history and future of science and technology are all in the wheelhouse.
Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

Friday Oct 30, 2020
What is contrastive learning and why it is so powerful? (Ep. 124)
Friday Oct 30, 2020
Friday Oct 30, 2020
Come join me in our Discord channel speaking about all things data science.
Follow me on Twitch during my live coding sessions usually in Rust and Python
Our Sponsors
The Monday Apps Challenge is bringing developers around the world together to compete in order to build apps that can improve the way teams work together on monday.com
Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.
References
A Simple Framework for Contrastive Learning of Visual Representations

Sunday Oct 18, 2020
Let's talk about federated learning (Ep. 122)
Sunday Oct 18, 2020
Sunday Oct 18, 2020
Let's talk about federated learning. Why is it important? Why large organizations are not ready yet?
Come join me in our Discord channel speaking about all things data science.
Follow me on Twitch during my live coding sessions usually in Rust and Python
This episode is supported by Monday.com
The Monday Apps Challenge is bringing developers around the world together to compete in order to build apps that can improve the way teams work together on monday.com.

Sunday Oct 11, 2020
How to test machine learning in production (Ep. 121)
Sunday Oct 11, 2020
Sunday Oct 11, 2020
Come join me in our Discord channel speaking about all things data science.
Follow me on Twitch during my live coding sessions usually in Rust and Python
This episode is supported by Monday.com
Monday.com bring teams together so you can plan, manage and track everything your team is working on in one centralized place
The monday Apps Challenge is bringing developers around the world together to compete in order to build apps that can improve the way teams work together on monday.com.

Saturday Sep 26, 2020
Why synthetic data cannot boost machine learning (Ep. 120)
Saturday Sep 26, 2020
Saturday Sep 26, 2020
Come join me in our Discord channel speaking about all things data science.
Follow me on Twitch during my live coding sessions usually in Rust and Python
This episode is supported by Women in Tech by Manning Conferences
![Machine learning in production: best practices [LIVE from twitch.tv] (Ep. 119)](https://pbcdn1.podbean.com/imglogo/image-logo/1799802/dsh-cover-2_300x300.jpg)
Wednesday Sep 16, 2020
Machine learning in production: best practices [LIVE from twitch.tv] (Ep. 119)
Wednesday Sep 16, 2020
Wednesday Sep 16, 2020
Hey there! Having the best time of my life ;)
This is the first episode I record while I am live on my new Twitch channel :) So much fun!
Feel free to follow me for the next live streaming. You can also see me coding machine learning stuff in Rust :))
Don't forget to jump on the usual Discord and have a chat
I'll see you there!

Monday Jun 29, 2020
Rust and machine learning #4: practical tools (Ep. 110)
Monday Jun 29, 2020
Monday Jun 29, 2020
In this episode I make a non exhaustive list of machine learning tools and frameworks, written in Rust. Not all of them are mature enough for production environments. I believe that community effort can change this very quickly.
To make a comparison with the Python ecosystem I will cover frameworks for linear algebra (numpy), dataframes (pandas), off-the-shelf machine learning (scikit-learn), deep learning (tensorflow) and reinforcement learning (openAI).
Rust is the language of the future.Happy coding!
Reference
BLAS linear algebra https://en.wikipedia.org/wiki/Basic_Linear_Algebra_Subprograms
Rust dataframe https://github.com/nevi-me/rust-dataframe
Rustlearn https://github.com/maciejkula/rustlearn
Rusty machine https://github.com/AtheMathmo/rusty-machine
Tensorflow bindings https://lib.rs/crates/tensorflow
Juice (machine learning for hackers) https://lib.rs/crates/juice
Rust reinforcement learning https://lib.rs/crates/rsrl

Friday Jun 19, 2020
Rust and machine learning #2 with Luca Palmieri (Ep. 108)
Friday Jun 19, 2020
Friday Jun 19, 2020
In the second episode of Rust and Machine learning I am speaking with Luca Palmieri, who has been spending a large part of his career at the interception of machine learning and data engineering. In addition, Luca contributed to several projects closer to the machine learning community using the Rust programming language. Linfa is an ambitious project that definitely deserves the attention of the data science community (and it's written in Rust, with Python bindings! How cool??!).
References
Series Announcement - Zero to Production in Rust https://www.lpalmieri.com/posts/2020-05-10-announcement-zero-to-production-in-rust/
Zero To Production #0: Foreword https://www.lpalmieri.com/posts/2020-05-24-zero-to-production-0-foreword/
Taking ML to production with Rust: a 25x speedup https://www.lpalmieri.com/posts/2019-12-01-taking-ml-to-production-with-rust-a-25x-speedup/

Friday Feb 14, 2020
Bridging the gap between data science and data engineering: metrics (Ep. 95)
Friday Feb 14, 2020
Friday Feb 14, 2020
Data science and data engineering are usually two different departments in organisations. Bridging the gap between the two is essential to success. Many times the brilliant applications created by data scientists don't find a match in production, just because they are not production-ready.
In this episode I have a talk with Daan Gerits, co-founder and CTO at Pryml.io