Machine Learning with R, the tidyverse, and mlr by Hefin Rhys episode artwork

EPISODE · Apr 2, 2020 · 46 MIN

Machine Learning with R, the tidyverse, and mlr by Hefin Rhys

from HumAIn Podcast · host David Yakobovitch

[Audio] Podcast: Play in new window | DownloadSubscribe: Google Podcasts | Spotify | Stitcher | TuneIn | RSSHefin Rhys is a Senior Scientist (flow cytometry) at UCB. He completed his PhD at the William Harvey Research Institute in Queen Mary University of London in 2017, and graduated from my MPharmacol degree from the University of Bath in 2013. His main academic interests are conventional, imaging and small particle flow cytometry, data science and machine learning. Episode Links:  Hefin Rhys’ LinkedIn: https://www.linkedin.com/in/hefin-rhys/ Hefin Rhys’ Twitter:  @HRJ21Hefin Rhys’ Website: https://www.manning.com/books/machine-learning-with-r-the-tidyverse-and-mlr Podcast Details: Podcast website: https://www.humainpodcast.com/ Apple Podcasts:  https://podcasts.apple.com/us/podcast/humain-podcast-artificial-intelligence-data-science/id1452117009 Spotify:  https://open.spotify.com/show/6tXysq5TzHXvttWtJhmRpS RSS: https://feeds.redcircle.com/99113f24-2bd1-4332-8cd0-32e0556c8bc9 YouTube Full Episodes: https://www.youtube.com/channel/UCxvclFvpPvFM9_RxcNg1rag YouTube Clips:  https://www.youtube.com/channel/UCxvclFvpPvFM9_RxcNg1rag/videos Support and Social Media:  – Check out the sponsors above, it’s the best way to support this podcast– Support on Patreon: https://www.patreon.com/humain/creators   – Twitter:  https://twitter.com/dyakobovitch – Instagram: https://www.instagram.com/humainpodcast/ – LinkedIn: https://www.linkedin.com/in/davidyakobovitch/  – Facebook: https://www.facebook.com/HumainPodcast/ – HumAIn Website Articles: https://www.humainpodcast.com/blog/ Outline: Here’s the timestamps for the episode: (00:00) – Introduction(01:44) – My view is not that of someone who is an expert on this virus, but it's clearly something that's very serious and that we need to take seriously and treat with respect. So as much as the virulence of the virus itself is concerning, I particularly consider how viral misinformation and misinformed practices have gone along with it.(08:24) – As a pharmacologist, my PhD was in immunology. The traditional analysis methods that we had been using and that other people in biological fields were using started to not quite suit our needs, not quite answer our questions. In biological life sciences the level of maths left them. I started to teach statistics, R and machine learning during my PhD. Manning wanted a book that was not for computer scientists necessarily, but more for people who were an expert in their own area but who could use and benefit from machine learning, who could benefit from understanding and learning machine learning to make predictions and extract meaningful insights from the data that they have.(14:57) – The answer to the question of whether somebody should learn R or Python is yes, people should use either or both. Python would probably have been a more convenient choice for a lot of people for machine learning. Carat or MLR in R, which were kind of an answer to scikit-learn and create this common interface so that you learn how to use that package and then substituting in a variety of different machine learning techniques and algorithms is extremely simple. Tidyverse is a collection of data science packages, a set of packages that are designed to make common data science tasks extremely easy, clean and reproducible.(22:21) – There's basically no reason for Python and R to compete, we can incorporate code from both languages.(24:11) – R has a phenomenal community of people. You need only to tweet a question or ask for opinions, and hashtag our stats and you get a ton of really nice supportive answers back and a huge amount of support on github or stackoverflow. (25:41) – Submitting a package to CRAN, the Comprehensive R Archive Network, is not a difficult process at all, if you write your package well. But writing a package for it to be submitted on to CRAN has to meet certain criteria. The documentation has to be of a certain quality in data in a certain way. The script files have to be laid out and documented in a certain way. So the whole CRAN submission process selects for good quality packages. (27:30) – People that are asking the really important questions, whether to do with business or science or health or whatever, the people that know how to ask and are asking those important questions are the ones that should be able to harness and implement statistics, data science, and machine learning to get those answers. I don't think that machine learning should be the purview only of mathematicians and computer scientists.(28:13) – As long as you teach people how to do things properly, that they have enough of an understanding of how the techniques work and what they do and what they don't do, then, absolutely, we can democratize machine learning. We can absolutely teach people to be able to use these techniques, to extract the answers or make the predictions that they're looking for in their field of expertise.(29:18) – The MLR package, which stands for machine learning in R. It provides a unified interface to a huge number of, not only actual machine learning algorithms, but also processes and functions like missing value, imputation, hyperparameter tuning, validation techniques. Where MLR particularly shines is, It makes it extremely simple to validate your models, MLR works very nicely with parallelization. MLR helps achieve that because you can do some extremely complicated validation pre-processing with very small amounts of code. (34:49) – Caret has functions that you can use to split your data into train test validation sets. And it has the ability for you to perform data pre-processing steps like missing value, imputation and things like that. MLR has become more popular recently. Caret has been the mainstay.(38:15) – Tidy Models are a set of packages that come from the Tidyverse. And in a similar way in which MLR is trying to create a uniform interface to machine learning, Tidy models are packages that are trying to create a unified approach to modeling in general. So that includes, and it's probably more widely used, as linear modeling. (41:53) – I really do think that Machine Learning with R, the tidyverse, and mlr is an excellent book. And it sounds very braggy of me and I don't mean to be, because although I wrote the content, a huge number of people other than me have made the book very good. So I do think that people will learn a lot and get a lot from it. Advertising Inquiries: https://redcircle.com/brandsPrivacy & Opt-Out: https://redcircle.com/privacy

In this episode: Hefin Rhys, Author of Machine Learning with R, the tidyverse, and mlr Learn more about your ad-choices at www.humainpodcast.com/advertise You can support the HumAIn podcast and receive subscriber-only content at http://humainpodcast.com/newsletter. Advertising Inquiries: https://redcircle.com/brands Privacy & Opt-Out: https://redcircle.com/privacy

NOW PLAYING

Machine Learning with R, the tidyverse, and mlr by Hefin Rhys

0:00 46:04

No transcript for this episode yet

We transcribe on demand. Request one and we'll notify you when it's ready — usually under 10 minutes.

That Hoarder: Overcome Compulsive Hoarding That Hoarder Hoarding disorder is stigmatised and people who hoard feel vast amounts of shame. This podcast began life as an audio diary, an anonymous outlet for somebody with this weird condition. That Hoarder speaks about her experiences living with compulsive hoarding, she interviews therapists, academics, researchers, children of hoarders, professional organisers and influencers, and she shares insight and tips for others with the problem. Listened to by people who hoard as well as those who love them and those who work with them, Overcome Compulsive Hoarding with That Hoarder aims to shatter the stigma, share the truth and speak openly and honestly to improve lives. The Small Business Startup School – Business Notes | Financial Literacy | Retail Psychology – For Professionals & Entrepreneurs The Small Business Startup School Inc. Starting or buying a small business? While personal circumstances may vary, business patterns remain timeless. On The Small Business Startup School, we explore strategies, insights, and practical solutions to help entrepreneurs confidently navigate their journey.Hosted by Ola Williams—a retail entrepreneur, fintech founder, and financial coach with over two decades of experience—this podcast marries financial awareness and retail psychology with optimism to deliver actionable takeaways.Join us to learn, grow, and connect as we uncover the keys to business success.Let’s continue to learn together and be encouraged to keep on connecting! DIOSA. Carolina Sanper This podcast is a sacred space created by Carolina Sanper where you connect with your inner wisdom and embody your magnetic feminine power.It is the realization that the mystical realm is where you plant the seeds of your desired reality.It is a portal to your true essence: awareness, presence, and receiving with ease. Welcome home, DIOSA. 🖤 XXX Tech by SOVRYN Dr. Brian Sovryn The crossroads between technology, sensuality, and metaphysics - and the longest running anarchist podcast in the world! Brought to you by Dr. Brian Sovryn.

Frequently Asked Questions

How long is this episode of HumAIn Podcast?

This episode is 46 minutes long.

When was this HumAIn Podcast episode published?

This episode was published on April 2, 2020.

What is this episode about?

[Audio] Podcast: Play in new window | DownloadSubscribe: Google Podcasts | Spotify | Stitcher | TuneIn | RSSHefin Rhys is a Senior Scientist (flow cytometry) at UCB. He completed his PhD at the William Harvey Research Institute in Queen Mary...

Can I download this HumAIn Podcast episode?

Yes, you can download this episode by clicking the download button on the episode player, or subscribe to the podcast in your preferred podcast app for automatic downloads.
URL copied to clipboard!