EPISODE · Jul 29, 2022 · 52 MIN
Just Fetch the Data and then... // David Bayliss // Coffee Sessions #110
from MLOps.community · host Demetrios
MLOps Coffee Sessions #110 with David Bayliss, Chief Data Scientist of LexisNexis Risk Solutions, Just Fetch the Data and then... co-hosted by Vishnu Rachakonda.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// AbstractComposing data to extract features can be a significant problem. Key factors are the data size, compliance restrictions, and real-time data. Ethics (and law) can drive extremely complex audit requirements. In the cloud, you can do anything - at a price.// BioOne of the creators of the world's first big data platform (HPCC), David has been tackling big data problems for two decades. A mathematician, compiler writer, and data sponge with more than five dozen patents spanning platforms, linking, and search.Most inventors think outside the box; David can't even remember where the box is. He leads the team that creates their core Data Science methods used by hundreds of data scientists.// MLOps Jobs board MLOps Swag/Merchhttps://mlops-community.myshopify.com/// Related LinksInteresting insight in this post. It would be cool to learn from David about his view on thingshttps://www.google.com/url?q=https://www.linkedin.com/posts/david-bayliss-426556a_datascience-platform-portability-activity-6913448643303759872-2dqq?utm_source%3Dlinkedin_share%26utm_medium%3Dmember_desktop_web&sa=D&source=calendar&ust=1649078059106132&usg=AOvVaw26wAevExeEfW_AdZSA8UhF--------------- ✌️Connect With Us ✌️ -------------Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunitySign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletters, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/Connect with David on LinkedIn: https://www.linkedin.com/in/david-bayliss-426556a/Timestamps: [00:00] Introduction to David Bayliss [01:03] Takeaways [04:56] LexisNexis and David's role [07:15] Evolution of LexisNexis in 20 years with so many use cases [08:51] Role of David in structuring data for working with data change [14:32] Data management and data access [17:45] Unique challenges of scale, use case, and diversity at LexisNexis [24:47] Tardis Iron Box [30:05] Iron Box translation [32:56] JVM for data science [34:24] Iron Box meaning [36:52] Metadata with PII [39:08] Detrimental privacy / Hairy Kneecap Theory [40:57] Speeding things up and Anonymized linking [46:47] What kept David working at LexisNexis? [50:30] Wrap up
What this episode covers
MLOps Coffee Sessions #110 with David Bayliss, Chief Data Scientist of LexisNexis Risk Solutions, Just Fetch the Data and then... co-hosted by Vishnu Rachakonda.Join the Community: https://go.mlops.community/YTJoinInGet the newsletter: https://go.mlops.community/YTNewsletter// AbstractComposing data to extract features can be a significant problem. Key factors are the data size, compliance restrictions, and real-time data. Ethics (and law) can drive extremely complex audit requirements. In the cloud, you can do anything - at a price.// BioOne of the creators of the world's first big data platform (HPCC), David has been tackling big data problems for two decades. A mathematician, compiler writer, and data sponge with more than five dozen patents spanning platforms, linking, and search.Most inventors think outside the box; David can't even remember where the box is. He leads the team that creates their core Data Science methods used by hundreds of data scientists.// MLOps Jobs board MLOps Swag/Merchhttps://mlops-community.myshopify.com/// Related LinksInteresting insight in this post. It would be cool to learn from David about his view on thingshttps://www.google.com/url?q=https://www.linkedin.com/posts/david-bayliss-426556a_datascience-platform-portability-activity-6913448643303759872-2dqq?utm_source%3Dlinkedin_share%26utm_medium%3Dmember_desktop_web&sa=D&source=calendar&ust=1649078059106132&usg=AOvVaw26wAevExeEfW_AdZSA8UhF--------------- ✌️Connect With Us ✌️ -------------Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunitySign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletters, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/Connect with David on LinkedIn: https://www.linkedin.com/in/david-bayliss-426556a/Timestamps: [00:00] Introduction to David Bayliss [01:03] Takeaways [04:56] LexisNexis and David's role [07:15] Evolution of LexisNexis in 20 years with so many use cases [08:51] Role of David in structuring data for working with data change [14:32] Data management and data access [17:45] Unique challenges of scale, use case, and diversity at LexisNexis [24:47] Tardis Iron Box [30:05] Iron Box translation [32:56] JVM for data science [34:24] Iron Box meaning [36:52] Metadata with PII [39:08] Detrimental privacy / Hairy Kneecap Theory [40:57] Speeding things up and Anonymized linking [46:47] What kept David working at LexisNexis? [50:30] Wrap up
NOW PLAYING
Just Fetch the Data and then... // David Bayliss // Coffee Sessions #110
No transcript for this episode yet
Similar Episodes
Apr 21, 2026 ·13m
Apr 19, 2026 ·16m
Apr 17, 2026 ·13m
Apr 13, 2026 ·11m
Apr 11, 2026 ·16m