Overview
Explore the world of drug and vaccine discovery through a 31-minute conference talk that delves into the power of Knowledge Graphs and Apache Spark. Learn how GSK is building the world's largest medical knowledge graph to provide scientists with access to global medical knowledge and enable machine learning for inferring links between facts. Discover the open-source libraries codenamed "Project Bellman" that enable Sparql queries over partitioned RDF data in Apache Spark, allowing for scalable querying of trillions of RDF triples. Gain insights into the use of these tools by GSK's AI/ML team for gene-to-disease mapping and by scientists for querying medical knowledge. Follow the journey from introduction to present-day applications, covering topics such as data use cases, knowledge graph construction, Spark architecture, and literature search in this informative presentation by Databricks.
Syllabus
Introduction
Background
Data Use Cases
Why a Knowledge Graph
Knowledge Graph Data
Spark Query
Spark Architecture
Demo Setup
Loading the Knowledge Graph
Querying the Knowledge Graph
Exploring the Knowledge Graph
Constructing the Knowledge Graph
Literature Search
Present Day
Taught by
Databricks