Data Science at Home
Episodes
Tuesday Jan 30, 2024
Is SQream the fastest big data platform? (Ep. 250)
Tuesday Jan 30, 2024
Tuesday Jan 30, 2024
Join us in a dynamic conversation with Yori Lavi, Field CTO at SQream, as we unravel the data analytics landscape. From debunking the data lakehouse hype to SQream's GPU-based magic, discover how extreme data challenges are met with agility. Yori shares success stories, insights into SQream's petabyte-scale capabilities, and a roadmap to breaking down organizational bottlenecks in data science. Dive into the future of data analytics with SQream's commitment to innovation, leaving legacy formats behind and leading the charge in large-scale, cost-effective data projects. Tune in for a dose of GPU-powered revolution!
References
SQream - GPU-based Big Data Platform
Patents Assigned to SQREAM TECHNOLOGIES LTD
Wednesday Aug 30, 2023
Erosion of Software Architecture Quality in the Age of AI Code Generation (Ep. 237)
Wednesday Aug 30, 2023
Wednesday Aug 30, 2023
In this era of AI-powered code generation, software architects are facing a concerning decline in the quality of their creations. The once meticulously crafted software architectures are now being compromised. Should LLMs be responsible?
References
Program Design in the UNIX Environmenthttps://harmful.cat-v.org/cat-v/unix_prog_design.pdf
Thursday May 25, 2023
AI’s Impact on Software Engineering: Killing Old Principles? [RB] (Ep. 229)
Thursday May 25, 2023
Thursday May 25, 2023
In this episode, we dive into the ways in which AI and machine learning are disrupting traditional software engineering principles. With the advent of automation and intelligent systems, developers are increasingly relying on algorithms to create efficient and effective code. However, this reliance on AI can come at a cost to the tried-and-true methods of software engineering. Join us as we explore the pros and cons of this paradigm shift and discuss what it means for the future of software development.
Sponsors
Bloomberg
At Bloomberg, they solve complex, real-world problems for customers across the global capital markets. From real-time market data to sophisticated analytics, powerful trading tools, and more, Bloomberg engineers work with systems that operate at scale. If you're a software engineer looking for an exciting and fulfilling career, head over to bloomberg.com/careers to learn more.
Arctic Wolf
Cybercriminals are evolving. Their techniques and tactics are more advanced, intricate, and dangerous than ever before. Industries and governments around the world are fighting back, unveiling new regulations meant to better protect data against this rising threat. Arctic Wolf — the leader in security operations — is on a mission to end cyber risk by giving organizations the protection, information, and confidence they need to protect their people, technology, and data. Visit arcticwolf.com/datascience to take your first step.
Wednesday Apr 26, 2023
Wednesday Apr 26, 2023
The journey of porting our projects to Rust was intense, but it was a decision we made to improve the quality of our software. The migration was not an easy task, as it required a considerable amount of time and resources. However, it was worth the effort as we have seen significant improvements in code reusability, code cleanliness, and performance.In this episode I will tell you why you should consider taking that journey too.
Tuesday Nov 08, 2022
Evolution of data platforms (Ep. 209)
Tuesday Nov 08, 2022
Tuesday Nov 08, 2022
Let's look at the history of data platforms. How did they evolve? Why? Shall I switch to the latest architecture? Enjoy the show!
Our Sponsors
Explore the Complex World of Regulations. Compliance can be overwhelming. Multiple frameworks. Overlapping requirements. Let Arctic Wolf be your guide.Check it out at https://arcticwolf.com/datascience
Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone they serve. We provide solutions in AI/ML, Fintech, Healthcare/RWE, and Predictive maintenance.
Tuesday Oct 25, 2022
Private machine learning done right (Ep. 207)
Tuesday Oct 25, 2022
Tuesday Oct 25, 2022
There are many solutions to private machine learning. I am pretty confident when I say that the one we are speaking in this episode is probably one of the most feasible and reliable.I am with Daniel Huynh, CEO of Mithril Security, a graduate from Ecole Polytechnique with a specialisation in AI and data science. He worked at Microsoft on Privacy Enhancing Technologies under the office of the CTO of Microsoft France. He has written articles on Homomorphic Encryptions with the CKKS explained series (https://blog.openmined.org/ckks-explained-part-1-simple-encoding-and-decoding/). He is now focusing on Confidential Computing at Mithril Security and has written extensive articles on the topic: https://blog.mithrilsecurity.io/.
In this show we speak about confidential computing, SGX and private machine learning
References
Mithril Security: https://www.mithrilsecurity.io/
BindAI GitHub: https://github.com/mithril-security/blindai
Use cases for BlindAI:Deploy Transformers models with confidentiality: https://blog.mithrilsecurity.io/transformers-with-confidentiality/
Confidential medical image analysis with COVID-Net and BlindAI: https://blog.mithrilsecurity.io/confidential-covidnet-with-blindai/
Build a privacy-by-design voice assistant with BlindAI: https://blog.mithrilsecurity.io/privacy-voice-ai-with-blindai/
Confidential Computing Explained: https://blog.mithrilsecurity.io/confidential-computing-explained-part-1-introduction/
Confidential Computing Consortium: https://confidentialcomputing.io/
Confidential Computing White Papers: https://confidentialcomputing.io/white-papers-reports/
List of Intel processors with Intel SGX:https://www.intel.com/content/www/us/en/support/articles/000028173/processors.html
https://github.com/ayeks/SGX-hardware
Azure Confidential Computing VMs with SGX:Azure Docs: https://docs.microsoft.com/en-us/azure/confidential-computing/confidential-computing-enclaves
How to deploy BlindAI on Azure: https://docs.mithrilsecurity.io/getting-started/cloud-deployment/azure-dcsv3
Confidential Computing 101: https://www.youtube.com/watch?v=77U12Ss38Zc
Rust: https://www.rust-lang.org/
ONNX: https://github.com/onnx/onnx
Tract, a Rust inference engine for ONNX models: https://github.com/sonos/tract
Wednesday Sep 28, 2022
LIDAR, cameras and autonomous vehicles (Ep. 204)
Wednesday Sep 28, 2022
Wednesday Sep 28, 2022
How does an autonomous vehicle see? How does it sense the road? They are equipped of many sensors, of course. Are they all powerful enough? Small enough to hide them and make your car look beautiful? In this episode I speak about LIDAR, high resolution cameras and some machine learning methods adapted to a minimal number of sensors.
Our Sponsors
Ready to advance your career in data science? University of Cincinnati Online offers nationally recognized educational programs in business analytics and information systems. Predictive Analytics Today named UC as the No.1 MS Data Science school in the country and is nationally recognized with a proven track record of placing students at high-profile companies such as Google, Amazon and P&G. Discover more about the University of Cincinnati’s 100% online master’s degree programs at online.uc.edu/obais
Amethix works to create and maximize the impact of the world’s leading corporations and startups, so they can create a better future for everyone they serve. We provide solutions in AI/ML, Fintech, Healthcare/RWE, and Predictive maintenance.
References
https://patents.google.com/patent/US20220043449A1/en?oq=20220043449
Friday May 27, 2022
Streaming data with ease. With Chip Kent from Deephaven Data Labs (Ep. 198)
Friday May 27, 2022
Friday May 27, 2022
In this episode, I am with Chip Kent, chief data scientist at Deephaven Data Labs.
We speak about streaming data, real-time, and other powerful tools part of the Deephaven platform.
Links
Deephaven - https://deephaven.io
Deephaven Community Core Documentation - https://deephaven.io/core/docs/
Deephaven Community Slack - https://join.slack.com/t/deephavencommunity/shared_invite/zt-11x3hiufp-DmOMWDAvXv_pNDUlVkagLQ
GitHub:
Deephaven Community Core - https://github.com/deephaven/deephaven-core
Barrage - https://github.com/deephaven/barrage
Deephaven web components - https://github.com/deephaven/web-client-ui
YouTube Channel - https://www.youtube.com/channel/UCoaYOlkX555PSTTJz8ZaI_w
Blog posts
Real-time classification with Deephaven and SciKit-Learn - https://deephaven.io/blog/2022/02/02/learn-scikit/
Display a quadrillion rows of data in the browser - https://deephaven.io/blog/2022/01/24/displaying-a-quadrillion-rows/
A performance comparison between Materialize and Deephaven - https://deephaven.io/blog/2022/03/05/deephaven-materialize-study/
Careers https://deephaven.io/company/careers/
Community Slack http://deephaven.io/slack.
Thursday Apr 21, 2022
Improving your AI by finding issues within data pockets (Ep. 195)
Thursday Apr 21, 2022
Thursday Apr 21, 2022
In this episode I have a conversation with, Itai Bar-Sinai, CPO & Cofounder of Mona.
We speak about several interesting points about data and monitoring.Why is AI monitoring so different from monitoring classic software?How to reduce the gap between data science and business?What is the role of MLOps in the data monitoring field?
With over 10 years of experience with AI and as the CPO and head of customer success at Mona, the leading AI monitoring intelligence company, Itai has a unique view of the AI industry. Working closely with data science and ML teams applying dozens of AI solutions in over 10 industries, Itai encounters the wide variety of business use-cases, organizational structures and cultures, and technologies and tools used in today’s AI world.
References
https://www.monalabs.io
Tuesday Feb 08, 2022
Tuesday Feb 08, 2022
In this episode I speak about AI and cloud automation with Leon Kuperman, co-founder and CTO at CAST AI. Formerly Vice President of Security Products OCI at Oracle, Leon’s professional experience spans across tech companies such as IBM, Truition, and HostedPCI.
Enjoy the episode!
Chat with me
Join us on Discord community chat to discuss the show, suggest new episodes and chat with other listeners!
Sponsored by Amethix Technologies
Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.
Sponsored by NordVPN
NordVPN protects your privacy while you are online. Get secure and private access to the internet by surfing nordvpn.com/DATASCIENCE or use coupon code DATASCIENCE and get a massive discount.
References
https://cast.ai/
Cloud automation https://cast.ai/blog/cloud-automation-in-2021-the-new-normal-in-the-tech-industry/
Cloud cost management https://cast.ai/blog/cloud-cost-management-alone-wont-fix-your-cloud-spend-problem/
Case study on how gross margin could be increased by cloud automation https://cast.ai/blog/the-hidden-shortcut-to-increasing-fintech-gross-margins-cloud-automation/