Archive for the 'Deep Learning' Category

Come join me in our Discord channel speaking about all things data science.

Follow me on Twitch during my live coding sessions usually in Rust and Python

Subscribe to the official Newsletter and never miss an episode

Our Sponsors

  • ProtonMail offers a simple and trusted solution to protect your internet connection and access blocked or restricted websites. All of ProtonMail and ProtonVPN's apps are open source and have been inspected by cybersecurity experts, and Proton is based in Switzerland, home to some of the world’s strongest privacy laws
  • Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

Read Full Post »

Come join me in our Discord channel speaking about all things data science.

Follow me on Twitch during my live coding sessions usually in Rust and Python

Our Sponsors

  • Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

 

References

Dataset distillation (official paper)

GitHub repo

 

Read Full Post »

Come join me in our Discord channel speaking about all things data science.

Follow me on Twitch during my live coding sessions usually in Rust and Python

Our Sponsors

  • ProtonMail is a secure and private email provider that protects yourmessages with end-to-end encryption and zero-access encryption so that besides you, noone can access them.
  • Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

References

Read Full Post »

Come join me in our Discord channel speaking about all things data science.

Follow me on Twitch during my live coding sessions usually in Rust and Python

Our Sponsors

  • The Monday Apps Challenge is bringing developers around the world together to compete in order to build apps that can improve the way teams work together on monday.com
  • Amethix use advanced Artificial Intelligence and Machine Learning to build data platforms and predictive engines in domain like finance, healthcare, pharmaceuticals, logistics, energy. Amethix provide solutions to collect and secure data with higher transparency and disintermediation, and build the statistical models that will support your business.

 

References

A Simple Framework for Contrastive Learning of Visual Representations

 

Read Full Post »

Come join me in our Discord channel speaking about all things data science.

Follow me on Twitch during my live coding sessions usually in Rust and Python

This episode is supported by Monday.com

The Monday Apps Challenge is bringing developers around the world together to compete in order to build apps that can improve the way teams work together on monday.com.

Read Full Post »

Come join me in our Discord channel speaking about all things data science.

Follow me on Twitch during my live coding sessions usually in Rust and Python

 

This episode is supported by Monday.com

Monday.com bring teams together so you can plan, manage and track everything your team is working on in one centralized place

The monday Apps Challenge is bringing developers around the world together to compete in order to build apps that can improve the way teams work together on monday.com.

Read Full Post »

In this episode I speak with Adam Leon Smith, CTO at DragonFly and expert in testing strategies for software and machine learning.
We cover testing with deep learning (neuron coverage, threshold coverage, sign change coverage, layer coverage, etc.), combinatorial testing and their practical aspects.

On September 15th there will be a live@Manning Rust conference. In one Rust-full day you will attend many talks about what's special about rust, building high performance web services or video game, about web assembly and much more.
If you want to meet the tribe, tune in september 15th to the live@manning rust conference.

 

 

Read Full Post »

In this episode I speak with Adam Leon Smith, CTO at DragonFly and expert in testing strategies for software and machine learning.

 

On September 15th there will be a live@Manning Rust conference. In one Rust-full day you will attend many talks about what's special about rust, building high performance web services or video game, about web assembly and much more.
If you want to meet the tribe, tune in september 15th to the live@manning rust conference.

 

 

Read Full Post »

After deep learning, a new entry is about ready to go on stage. The usual journalists are warming up their keyboards for blogs, news feeds, tweets, in one word, hype.
This time it's all about privacy and data confidentiality. The new words, homomorphic encryption.

 

Join and chat with us on the official Discord channel.

 

Sponsors

This episode is supported by Amethix Technologies.

Amethix works to create and maximize the impact of the world’s leading corporations, startups, and nonprofits, so they can create a better future for everyone they serve. They are a consulting firm focused on data science, machine learning, and artificial intelligence.

 

References

Towards a Homomorphic Machine Learning Big Data Pipeline for the Financial Services Sector

IBM Fully Homomorphic Encryption Toolkit for Linux

Read Full Post »

The hype around GPT-3 is alarming and gives and provides us with the awful picture of people misunderstanding artificial intelligence. In response to some comments that claim GPT-3 will take developers' jobs, in this episode I express some personal opinions about the state of AI in generating source code (and in particular GPT-3).

 

If you have comments about this episode or just want to chat, come join us on the official Discord channel.

 

 

This episode is supported by Amethix Technologies.

Amethix works to create and maximize the impact of the world’s leading corporations, startups, and nonprofits, so they can create a better future for everyone they serve. They are a consulting firm focused on data science, machine learning, and artificial intelligence.

Read Full Post »

There is definitely room for improvement in the family of algorithms of stochastic gradient descent. In this episode I explain a relatively simple method that has shown to improve on the Adam optimizer. But, watch out! This approach does not generalize well.

Join our Discord channel and chat with us.

 

References

 

Read Full Post »

In this episode I speak with Filip Piekniewski about some of the most worth noting findings in AI and machine learning in 2019. As a matter of fact, the entire field of AI has been inflated by hype and claims that are hard to believe. A lot of the promises made a few years ago have revealed quite hard to achieve, if not impossible. Let's stay grounded and realistic on the potential of this amazing field of research, not to bring disillusion in the near future.

Join us to our Discord channel to discuss your favorite episode and propose new ones.

 

This episode is brought to you by Protonmail

Click on the link in the description or go to protonmail.com/datascience and get 20% off their annual subscription.

Read Full Post »

In this episode I make a non exhaustive list of machine learning tools and frameworks, written in Rust. Not all of them are mature enough for production environments. I believe that community effort can change this very quickly.

To make a comparison with the Python ecosystem I will cover frameworks for linear algebra (numpy), dataframes (pandas), off-the-shelf machine learning (scikit-learn), deep learning (tensorflow) and reinforcement learning (openAI).

Rust is the language of the future.
Happy coding!
 

Reference

  1. BLAS linear algebra https://en.wikipedia.org/wiki/Basic_Linear_Algebra_Subprograms
  2. Rust dataframe https://github.com/nevi-me/rust-dataframe
  3. Rustlearn https://github.com/maciejkula/rustlearn
  4. Rusty machine https://github.com/AtheMathmo/rusty-machine
  5. Tensorflow bindings https://lib.rs/crates/tensorflow
  6. Juice (machine learning for hackers) https://lib.rs/crates/juice
  7. Rust reinforcement learning https://lib.rs/crates/rsrl

Read Full Post »

As a continuation of the previous episode in this one I cover the topic about compressing deep learning models and explain another simple yet fantastic approach that can lead to much smaller models that still perform as good as the original one.

Don't forget to join our Slack channel and discuss previous episodes or propose new ones.

This episode is supported by Pryml.io
Pryml is an enterprise-scale platform to synthesise data and deploy applications built on that data back to a production environment.

 

References

Comparing Rewinding and Fine-tuning in Neural Network Pruning
https://arxiv.org/abs/2003.02389

 

Read Full Post »

Using large deep learning models on limited hardware or edge devices is definitely prohibitive. There are methods to compress large models by orders of magnitude and maintain similar accuracy during inference.

In this episode I explain one of the first methods: knowledge distillation

 Come join us on Slack

Reference

Read Full Post »

Play this podcast on Podbean App