Data Science at Home
Episodes

Monday Apr 19, 2021
Building high-growth data businesses with Lillian Pierson (Ep. 149)
Monday Apr 19, 2021
Monday Apr 19, 2021
In this episode I have an amazing conversation with Lillian Pierson from data-mania.com
This is an action-packed episode on how data professionals can quickly convert their data expertise into high-growth data businesses, all by selecting optimal business models, revenue models, and pricing structures.
If you want to know more or get in touch with Lillian, follow the links below:Weekly Free Trainings: We currently publish 1 free training per week on YouTube! https://www.youtube.com/channel/UCK4MGP0A6lBjnQWAmcWBcKQ
Becoming World-Class Data Leaders and Data Entrepreneurs Facebook Group: https://www.facebook.com/groups/data.leaders.and.entrepreneurs
LinkedIn: https://www.linkedin.com/in/lillianpierson/
The Data Entrepreneur’s Toolkit: A recommendation set for 32 free (or low-cost) tools & processes that'll actually grow your data business (even if you still haven’t put up that website yet!). https://www.data-mania.com/data-entrepreneur-toolkit/

Tuesday Apr 13, 2021
Learning and training in AI times (Ep. 148)
Tuesday Apr 13, 2021
Tuesday Apr 13, 2021
Is there a gap between life sciences and data science?What's the situation when it comes to interdisciplinary research?In this episode I am with Laura Harris, Director of Training for the Institute of Cyber-Enabled Research (ICER) at Michigan State University (MSU), and we try to answer some of those questions.
You can contact Laura at training@msu.edu or on LinkedIn

Friday Mar 26, 2021
Apache Arrow, Ballista and Big Data in Rust with Andy Grove (Ep. 145)
Friday Mar 26, 2021
Friday Mar 26, 2021
Do you want to know the latest in big data analytics frameworks? Have you ever heard of Apache Arrow? Rust? Ballista? In this episode I speak with Andy Grove one of the main authors of Apache Arrow and Ballista compute engine.Andy explains some challenges while he was designing the Arrow and Ballista memory models and he describes some amazing solutions.
Our Sponsors
This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey.To learn more about the innovative tools and collaborative approach that distinguish the Chapman program in Computational and Data Sciences, visit chapman.edu/datascience
If building software is your passion, you’ll love ThoughtWorks Technology Podcast. It’s a podcast for techies by techies. Their team of experienced technologists take a deep dive into a tech topic that’s piqued their interest — it could be how machine learning is being used in astrophysics or maybe how to succeed at continuous delivery.
References
https://arrow.apache.org/
https://ballistacompute.org/
https://github.com/ballista-compute/ballista

Wednesday Mar 10, 2021
Concurrent is not parallel - Part 1 (Ep. 142)
Wednesday Mar 10, 2021
Wednesday Mar 10, 2021
In plain English, concurrent and parallel are synonyms. Not for a CPU. And definitely not for programmers. In this episode I summarize the ways to parallelize on different architectures and operating systems. Rock-star data scientists must know how concurrency works and when to use it IMHO.
Our Sponsors
This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey.To learn more about the innovative tools and collaborative approach that distinguish the Chapman program in Computational and Data Sciences, visit chapman.edu/datascience
Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

Monday Feb 22, 2021
You are the product (Ep. 140)
Monday Feb 22, 2021
Monday Feb 22, 2021
In this episode I am with George Hosu from Cerebralab
and we speak about how dangerous it is not to pay for the services you use, and as a consequence how dangerous it is letting an algorithm decide what you like or not.
Our Sponsors
This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey.To learn more about the innovative tools and collaborative approach that distinguish the Chapman program in Computational and Data Sciences, visit chapman.edu/datascience
If building software is your passion, you’ll love ThoughtWorks Technology Podcast. It’s a podcast for techies by techies. Their team of experienced technologists take a deep dive into a tech topic that’s piqued their interest — it could be how machine learning is being used in astrophysics or maybe how to succeed at continuous delivery.
Links
https://cerebralab.com
https://www.eugenewei.com/blog/2019/2/19/status-as-a-service

Monday Feb 15, 2021
How to reinvent banking and finance with data and technology (Ep. 139)
Monday Feb 15, 2021
Monday Feb 15, 2021
The financial system is changing. It is becoming more efficient and integrated with many more services making our life more... digital. Is the old banking system doomed to fail? Or will it just be disrupted by the smaller players of the fintech industry?In this episode we answer some of these fundamental questions with Alessandro E. Hatami from Pacemakers
Subscribe to the Newsletter and come chat with us on the official Discord channel
Our Sponsors
This episode is supported by Chapman’s Schmid College of Science and Technology, where master’s and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey.To learn more about the innovative tools and collaborative approach that distinguish the Chapman program in Computational and Data Sciences, visit chapman.edu/datascience
Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

Monday Feb 01, 2021
Is Rust flexible enough for a flexible data model? (Ep. 137)
Monday Feb 01, 2021
Monday Feb 01, 2021
In this podcast I get inspired by Paul Done's presentation about The Six Principles for Building Robust Yet Flexible Shared Data Applications, and show how powerful of a language Rust is while still maintaining the flexibility of less strict languages.
Our Sponsor
This episode is supported by Chapman’s Schmid College of Science and Technology, where master's and PhD students join in cutting-edge research as they prepare to take the next big leap in their professional journey. To learn more about the innovative tools and collaborative approach that distinguish the Chapman program in Computational and Data Sciences, visit chapman.edu/datascience

Monday Jan 25, 2021
Is Apple M1 good for machine learning? (Ep.136)
Monday Jan 25, 2021
Monday Jan 25, 2021
In this episode I explain the basics of computer architecture and introduce some features of the Apple M1
Is it good for Machine Learning tasks?
References
Computer architectures book https://www.amazon.com/Computer-Architecture-Quantitative-John-Hennessy/dp/012383872X
Performance https://nod.ai/comparing-apple-m1-with-amx2-m1-with-neon/

Thursday Dec 31, 2020
Scaling machine learning with clusters and GPUs (Ep. 134)
Thursday Dec 31, 2020
Thursday Dec 31, 2020
Let's finish this year with an amazing episode about scaling ML with clusters and GPUs. Kind of as a continuation of Episode 112 I have a terrific conversation with Aaron Richter from Saturn Cloud about, well, making ML faster and scaling it to massive infrastructure.
Aaron can be reached on his website https://rikturr.com and Twitter @rikturr
Our Sponsor
Saturn Cloud is a data science and machine learning platform for scalable Python analytics. Users can jump into cloud-based Jupyter and Dask to scale Python for big data using the libraries they know and love, while leveraging Docker and Kubernetes so that work is reproducible, shareable, and ready for production.
Try Saturn Cloud for free at https://saturncloud.io
Twitter: @saturn_cloud

Saturday Dec 19, 2020
What is data ethics? (Ep. 133)
Saturday Dec 19, 2020
Saturday Dec 19, 2020
What is data ethics? In this episode I have an interesting chat with Denny Wong from FaqBot and Muna.
Our Sponsor
Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.
References
Denny's Twitter profile
The data ethics awareness workshop for AI practitioners