Artwork

Content provided by Bio-IT World. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Bio-IT World or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ppacc.player.fm/legal.
Player FM - Podcast App
Go offline with the Player FM app!

Tony Kerlavage on Data Lakes, Data Commons, and Empowering the Research of the Future

28:27
 
Share
 

Manage episode 321006420 series 3319353
Content provided by Bio-IT World. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Bio-IT World or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ppacc.player.fm/legal.

At the National Cancer Institute, Tony Kerlavage knows quite a bit about managing very large pools of data. When NCI launched the Genomic Data Commons, it aimed to democratize access to the genomic data in The Cancer Genome Atlas and other sources. Since then, though, Kerlavage points out that our data types and volumes have only grown. Now NCI is taking a “Commons of Commons” approach to link pools of well-structured data. “The more data we can bring together in a well-structured way, the more value it has in the long run,” he believes. He advocates for sharable Python notebooks and reusable R programming, believing significant investments in data hygiene and interoperability delivers more value than simply mining data lakes with artificial intelligence tools—for now, at least. The challenge for researchers, Kerlavage says, is to view their work with an eye to the future: How might someone else use this data going forward?

Links from this episode:
Bio-IT World
BioTeam
NCI Launches Genomic Data Commons
Bob Grossman’s Vision of the Commons of Commons
BioTeam’s Approach to Collaborative Dictionary Authoring

Trends from the Trenches boiler: Bio-IT World’s Trends from the Trenches podcast delivers your insider’s look at the science, technology, and executive trends driving the life sciences through conversations with industry leaders. BioTeam co-founder Stan Gloss brings years of industry experience in science, data, and technology to conversations exploring what is driving data and discovery, and what’s coming next.

  continue reading

36 episodes

Artwork
iconShare
 
Manage episode 321006420 series 3319353
Content provided by Bio-IT World. All podcast content including episodes, graphics, and podcast descriptions are uploaded and provided directly by Bio-IT World or their podcast platform partner. If you believe someone is using your copyrighted work without your permission, you can follow the process outlined here https://ppacc.player.fm/legal.

At the National Cancer Institute, Tony Kerlavage knows quite a bit about managing very large pools of data. When NCI launched the Genomic Data Commons, it aimed to democratize access to the genomic data in The Cancer Genome Atlas and other sources. Since then, though, Kerlavage points out that our data types and volumes have only grown. Now NCI is taking a “Commons of Commons” approach to link pools of well-structured data. “The more data we can bring together in a well-structured way, the more value it has in the long run,” he believes. He advocates for sharable Python notebooks and reusable R programming, believing significant investments in data hygiene and interoperability delivers more value than simply mining data lakes with artificial intelligence tools—for now, at least. The challenge for researchers, Kerlavage says, is to view their work with an eye to the future: How might someone else use this data going forward?

Links from this episode:
Bio-IT World
BioTeam
NCI Launches Genomic Data Commons
Bob Grossman’s Vision of the Commons of Commons
BioTeam’s Approach to Collaborative Dictionary Authoring

Trends from the Trenches boiler: Bio-IT World’s Trends from the Trenches podcast delivers your insider’s look at the science, technology, and executive trends driving the life sciences through conversations with industry leaders. BioTeam co-founder Stan Gloss brings years of industry experience in science, data, and technology to conversations exploring what is driving data and discovery, and what’s coming next.

  continue reading

36 episodes

All episodes

×
 
Loading …

Welcome to Player FM!

Player FM is scanning the web for high-quality podcasts for you to enjoy right now. It's the best podcast app and works on Android, iPhone, and the web. Signup to sync subscriptions across devices.

 

Quick Reference Guide

Copyright 2025 | Privacy Policy | Terms of Service | | Copyright
Listen to this show while you explore
Play