PodParley PodParley

How to cluster tabular data with Markov Clustering (Ep. 73)

Episode 69 of the Data Science at Home podcast, hosted by Francesco Gadaleta, titled "How to cluster tabular data with Markov Clustering (Ep. 73)" was published on August 20, 2019 and runs 20 minutes.

August 20, 2019 ·20m · Data Science at Home

0:00 / 0:00

In this episode I explain how a community detection algorithm known as Markov clustering can be constructed by combining simple concepts like random walks, graphs, similarity matrix. Moreover, I highlight how one can build a similarity graph and then run a community detection algorithm on such graph to find clusters in tabular data. You can find a simple hands-on code snippet to play with on the Amethix Blog  Enjoy the show!    References [1] S. Fortunato, “Community detection in graphs”, Physics Reports, volume 486, issues 3-5, pages 75-174, February 2010. [2] Z. Yang, et al., “A Comparative Analysis of Community Detection Algorithms on Artificial Networks”, Scientific Reports volume 6, Article number: 30750 (2016) [3] S. Dongen, “A cluster algorithm for graphs”, Technical Report, CWI (Centre for Mathematics and Computer Science) Amsterdam, The Netherlands, 2000. [4] A. J. Enright, et al., “An efficient algorithm for large-scale detection of protein families”, Nucleic Acids Research, volume 30, issue 7, pages 1575-1584, 2002.

In this episode I explain how a community detection algorithm known as Markov clustering can be constructed by combining simple concepts like random walks, graphs, similarity matrix. Moreover, I highlight how one can build a similarity graph and then run a community detection algorithm on such graph to find clusters in tabular data.

You can find a simple hands-on code snippet to play with on the Amethix Blog 

Enjoy the show! 

 

References

[1] S. Fortunato, “Community detection in graphs”, Physics Reports, volume 486, issues 3-5, pages 75-174, February 2010.

[2] Z. Yang, et al., “A Comparative Analysis of Community Detection Algorithms on Artificial Networks”, Scientific Reports volume 6, Article number: 30750 (2016)

[3] S. Dongen, “A cluster algorithm for graphs”, Technical Report, CWI (Centre for Mathematics and Computer Science) Amsterdam, The Netherlands, 2000.

[4] A. J. Enright, et al., “An efficient algorithm for large-scale detection of protein families”, Nucleic Acids Research, volume 30, issue 7, pages 1575-1584, 2002.

The Analytics Engineering Podcast dbt Labs, Inc. Tristan Handy has been curating the Analytics Engineering Roundup newsletter since 2015, pulling together the internet's best data science & analytics articles.Tristan and co-host Julia Schottenstein now bring the Roundup to real life, hosting biweekly conversations with data practitioners inventing the future of analytics engineering.You can view full episode summaries and read back issues of the Roundup newsletter at https://roundup.getdbt.com.The podcast is sponsored by dbt labs, makers of the data transformation framework dbt. To reach our team, drop a note to [email protected]. Explicit STEM.queer() Vera Sativa Machine learning, data science, feminismo y queer anarquismo.Episodios cada 2 semanas. Explicit 天方烨谈 基因频道 华大基因专业团队倾情打造,基因科普娓娓道来! Explicit HOODWINKED Kris Greer In a world filled with conspiracy and uncertainty, meet Quinton and Symone Young, the dynamic sibling duo behind their very own security detail company. Join them on a thrilling journey as they are thrust into the heart of a massive conspiracy, and witness the fate of the world hanging in the balance.As a mysterious light from the sky threatens to disrupt data and communication systems, humanity faces an unprecedented challenge. The government is quick to label it an invasion, but is everything as it seems? Enter Collin McMurry, a brilliant whistleblower whose discoveries are about to reshape the world order.'Betrayed by their own government, our heroes must now protect the truth before it's too late.'Their mission: to safeguard the lives of the people and prevent world leaders from dominating an altered future. In a race against time, they'll have to outsmart and outmaneuver those who seek to control the narrative. The fate of the world rests in their capable h Explicit
URL copied to clipboard!