Skip to content
START FOR FREE
START FOR FREE
  • SUPPORT
  • COMMUNITY
Menu
  • SUPPORT
  • COMMUNITY
MENUMENU
  • Products
    • The World’s Fastest and Most Scalable Graph Platform

      LEARN MORE

      Watch a TigerGraph Demo

      TIGERGRAPH CLOUD

      • Overview
      • TigerGraph Cloud Suite
      • FAQ
      • Pricing

      USER TOOLS

      • GraphStudio
      • Insights
      • Application Workbenches
      • Connectors and Drivers
      • Starter Kits
      • openCypher Support

      TIGERGRAPH DB

      • Overview
      • GSQL Query Language
      • Compare Editions

      GRAPH DATA SCIENCE

      • Graph Data Science Library
      • Machine Learning Workbench
  • Solutions
    • The World’s Fastest and Most Scalable Graph Platform

      LEARN MORE

      Watch a TigerGraph Demo

      Solutions

      • Solutions Overview

      INCREASE REVENUE

      • Customer Journey/360
      • Product Marketing
      • Entity Resolution
      • Recommendation Engine

      MANAGE RISK

      • Fraud Detection
      • Anti-Money Laundering
      • Threat Detection
      • Risk Monitoring

      IMPROVE OPERATIONS

      • Supply Chain Analysis
      • Energy Management
      • Network Optimization

      By Industry

      • Advertising, Media & Entertainment
      • Financial Services
      • Healthcare & Life Sciences

      FOUNDATIONAL

      • AI & Machine Learning
      • Time Series Analysis
      • Geospatial Analysis
  • Customers
    • The World’s Fastest and Most Scalable Graph Platform

      LEARN MORE

      CUSTOMER SUCCESS STORIES

      • Ford
      • Intuit
      • JPMorgan Chase
      • READ MORE SUCCESS STORIES
      • Jaguar Land Rover
      • United Health Group
      • Xbox
  • Partners
    • The World’s Fastest and Most Scalable Graph Platform

      LEARN MORE

      PARTNER PROGRAM

      • Partner Benefits
      • TigerGraph Partners
      • Sign Up
      TigerGraph partners with organizations that offer complementary technology solutions and services.​
  • Resources
    • The World’s Fastest and Most Scalable Graph Platform

      LEARN MORE

      BLOG

      • TigerGraph Blog

      RESOURCES

      • Resource Library
      • Benchmarks
      • Demos
      • O'Reilly Graph + ML Book

      EVENTS & WEBINARS

      • Graph+AI Summit
      • Graph for All - Million Dollar Challenge
      • Events &Trade Shows
      • Webinars

      DEVELOPERS

      • Documentation
      • Ecosystem
      • Developers Hub
      • Community Forum

      SUPPORT

      • Contact Support
      • Production Guidelines

      EDUCATION

      • Training & Certifications
  • Company
    • Join the World’s Fastest and Most Scalable Graph Platform

      WE ARE HIRING

      COMPANY

      • Company Overview
      • Leadership
      • Legal Terms
      • Patents
      • Security and Compliance

      CAREERS

      • Join Us
      • Open Positions

      AWARDS

      • Awards and Recognition
      • Leader in Forrester Wave
      • Gartner Research

      PRESS RELEASE

      • Read All Press Releases
      TigerGraph Reports Exceptional Customer Growth and Product Leadership as More Market-Leading Companies Tap the Power of Graph
      March 1, 2023
      Read More »

      NEWS

      • Read All News
      The-New-Stack-Logo-square

      Multiple Vendors Make Data and Analytics Ubiquitous

      TigerGraph enhances fundamentals in latest platform update

  • START FREE
    • The World’s Fastest and Most Scalable Graph Platform

      GET STARTED

      • Request a Demo
      • CONTACT US
      • Try TigerGraph
      • START FREE
      • TRY AN ONLINE DEMO

Implementing a Biomedical Knowledge Graph Using BERN2 and TigerGraph

  • Sudha Vijayakumar
  • March 28, 2022
  • blog, Healthcare
  • Blog >
  • Implementing a Biomedical Knowledge Graph Using BERN2 and TigerGraph

This blog is a reprint of an article published by the author on Medium.com on March 15, 2022.

Do you know how to construct a biomedical knowledge graph using TigerGraph and BERN2? Let’s learn together!

Knowledge Graph

Why representing biomedical data as a semantic knowledge graph is value-adding?

One can imagine biomedical data as an interconnected network of biomedical entities like diseases, drugs, genes, proteins, etc. Representing biomedical data as a knowledge graph has the following advantages:

  1. Biomedical data is dynamic, not fully integrated, and requires a flexible schema to represent the entities and interactions. A graph-based data model is more flexible for dynamic data integration and handling evolving schema.
  2. An enormous collection of biomedical ontologies and vocabularies are available online, which model the concepts of a biomedical domain. The amount of biomedical data in research works and papers are exploding. But, there are significantly fewer ways of integrating and connecting related information from different sources to gain valuable medical insights. So, representing biomedical data as a semantic knowledge graph will provide an integrated source to perform complex queries faster with appropriate reasoning.
  3. A graph database like TigerGraph can unleash the power of biomedical data by representing them as distributed semantic knowledge graphs and applying real-time machine learning to solve problems in the biomedical domain.
  4. A biomedical knowledge graph implemented as a distributed TigerGraph can perform real-time disease analytics and used to develop real-life applications like drug discovery, drug-to-drug interactions, drug side-effects, drug repurposing, genomics, and safe drug recommendations, patient diagnosis, rare disease identification.

This article will discuss how to build a biomedical knowledge graph using TigerGraph from scratch and traverse the graph with some sample gsql queries.

1. Understanding a biomedical graph

A biomedical knowledge graph is a systematic way to connect biomedical entities and can act as a critical asset to search biomedical data semantically, helping to make fast clinical decisions and also helping to drive biomedical research towards life-saving applications. The below is a simple construct of a biomedical graph.

Sample Biomedical Graph

2. Understanding BERN2

BERN2 is an ‘Advanced neural Biomedical Named Entity Recognition and Normalization Tool’ implemented by DMIS Lab at Korea University.

BERN2 can,

  1. Recognize nine different biomedical entity types by their normalized identifiers. Identifiers include genes, diseases, DNA, RNA, etc.
  2. Annotate in real-time when provided a plain text or PubMed ID.
  3. Provide access via RESTful API and also support stand-alone installation.
  4. Merge the entities derived into target knowledge bases.

The below is an example of a real-time annotation by BERN2,

BERN2 real-time biomedical entity recognition

Alright, now we have a basic understanding of biomedical knowledge graphs and their potential application use cases.

3. Biomedical Knowledge Graph solution using TigerGraph

Solution Architecture

4. HANDS-ON

Step-0: Create a tgcloud solution

Reference link: https://docs.tigergraph.com/cloud/solutions/create

Python Scripts

Below are the 3-steps to load biomedical data into TigerGraph using python scripts,

Script Flow

Step-1: Pulling data and entity/ relationship extraction.

The below research paper titled ‘Developmental Status and Perspectives for Tissue Engineering in Urology’ will be the biomedical data source for this project.

Download source pdf: https://arxiv.org/abs/2111.09414

Run this Google colab notebook to extract entities and relationships from the biomedical research paper.

Step-2: Data transformation

Run this Google colab notebook to convert extracted into a graph schema.

Step-3: Getting data into TigerGraph

Run this Google colab notebook to load the data into TigerGraph cloud.

After the execution of the above load script, the following graph is created with the respective data mappings as follows,

3.1 Global Schema

Global Schema

3.2 Data Mapping

Biomedical data mappings

3.3 Sample semantic relationship

Sample semantic relationship

Step-4: Sample gsql query

Ask the knowledge graph: Find all the source which mentions ‘thrombosis’.

CREATE QUERY get_disease_relation(/* Parameters here */) FOR GRAPH bio { 
/* Write query logic here */
ListAccum @relation;
start = {Entity.*};entity_typ = SELECT s FROM start:s -(reverse_has_bio_entity:r)- :t
WHERE s.name=="thrombosis"
ACCUM s.@relation+=r;
PRINT entity_typ;
}

5. Conclusion

This article demonstrated how biomedical knowledge graphs could be constructed from scientific text using biomedical entity recognition and linking services like BERN2 and TigerGraph. Representing biomedical data as a semantic knowledge graph can accelerate clinical trials, pave way for hidden knowledge discovery, real-time biomedical analytics, diagnosis, clinical predications, etc., and instigate further research in the domain. I hope this can be a starter project for many such biomedical applications using TigerGraph.

6. Takeaway

A biomedical knowledge graph built using the highly scalable and super-fast TigerGraph can unwrap valuable yet hidden medical knowledge quickly setting a space for real-time machine learning and could lead to many life-saver applications!

Source Code

Please check out the Git Repo for detailed run instructions.

For any questions and clarifications, please reach out to me @ TigerGraph Developer Community Forum.

WATCH OUT FOR 👀

Stay tuned for a project on implementing a giant biomedical graph using the TigerGraph database where we will implement the biomedical knowledge graph on a bigger scale and learn some complex real-time queries.

References

https://bmcmedinformdecismak.biomedcentral.com/articles/10.1186/s12911-020-01341-5

https://het.io/about/#whats-in-hetionet

https://www.researchgate.net/figure/The-metagraph-ie-schema-of-the-knowledge-graph-used-in-the-Rephetio-project-9-The_fig1_341847848

https://githubhelp.com/fractalego/zero-shot-relation-extractor

http://bern2.korea.ac.kr/

https://github.com/MannLabs/CKG

https://ieeexplore.ieee.org/abstract/document/9086146

You Might Also Like

Trillion edges benchmark: new world record beyond 100TB by TigerGraph featuring AMD based Amazon EC2 instances

Trillion edges benchmark: new world record...

March 13, 2023
Graph Databases 101: Your Top 5 Questions with Non-Technical Answers

Graph Databases 101: Your Top 5...

February 7, 2023
It’s Time to Harness the Power of Graph Technology [Infographic]

It’s Time to Harness the Power...

January 25, 2023

Introducing TigerGraph 3.0

July 1, 2020

Everything to Know to Pass your TigerGraph Certification Test

June 24, 2020

Neo4j 4.0 Fabric – A Look Behind the Curtain

February 7, 2020

TigerGraph Blog

  • Categories
    • blogs
      • About TigerGraph
      • Benchmark
      • Business
      • Community
      • Compliance
      • Customer
      • Customer 360
      • Cybersecurity
      • Developers
      • Digital Twin
      • eCommerce
      • Emerging Use Cases
      • Entity Resolution
      • Finance
      • Fraud / Anti-Money Laundering
      • GQL
      • Graph Database Market
      • Graph Databases
      • GSQL
      • Healthcare
      • Machine Learning / AI
      • Podcast
      • Supply Chain
      • TigerGraph
      • TigerGraph Cloud
    • Graph AI On Demand
      • Analysts and Research
      • Customer 360 and Entity Resolution
      • Customer Spotlight
      • Development
      • Finance, Banking, Insurance
      • Keynote
      • Session
    • Video
  • Recent Posts

    • Trillion edges benchmark: new world record beyond 100TB by TigerGraph featuring AMD based Amazon EC2 instances
    • Overview of Graph and Machine Learning with TigerGraph | Mar 8 @ 11am PST
    • Gartner Data & Analytics Summit 2023, London
    • Gartner Data and Analytics Summit, Orlando
    • Transaction Surveillance with Maximum Flow Algorithm
    TigerGraph

    Product

    SOLUTIONS

    customers

    RESOURCES

    start for free

    TIGERGRAPH DB
    • Overview
    • Features
    • GSQL Query Language
    GRAPH DATA SCIENCE
    • Graph Data Science Library
    • Machine Learning Workbench
    TIGERGRAPH CLOUD
    • Overview
    • Cloud Starter Kits
    • Login
    • FAQ
    • Pricing
    • Cloud Marketplaces
    USEr TOOLS
    • GraphStudio
    • TigerGraph Insights
    • Application Workbenches
    • Connectors and Drivers
    • Starter Kits
    • openCypher Support
    SOLUTIONS
    • Why Graph?
    industry
    • Advertising, Media & Entertainment
    • Financial Services
    • Healthcare & Life Sciences
    use cases
    • Benefits
    • Product & Service Marketing
    • Entity Resolution
    • Customer 360/MDM
    • Recommendation Engine
    • Anti-Money Laundering
    • Cybersecurity Threat Detection
    • Fraud Detection
    • Risk Assessment & Monitoring
    • Energy Management
    • Network & IT Management
    • Supply Chain Analysis
    • AI & Machine Learning
    • Geospatial Analysis
    • Time Series Analysis
    success stories
    • Customer Success Stories

    Partners

    Partner program
    • Partner Benefits
    • TigerGraph Partners
    • Sign Up
    LIBRARY
    • Resources
    • Benchmark
    • Webinars
    Events
    • Trade Shows
    • Graph + AI Summit
    • Million Dollar Challenge
    EDUCATION
    • Training & Certifications
    Blog
    • TigerGraph Blog
    DEVELOPERS
    • Developers Hub
    • Community Forum
    • Documentation
    • Ecosystem

    COMPANY

    Company
    • Overview
    • Careers
    • News
    • Press Release
    • Awards
    • Legal
    • Patents
    • Security and Compliance
    • Contact
    Get Started
    • Start Free
    • Compare Editions
    • Online Demo - Test Drive
    • Request a Demo

    Product

    • Overview
    • TigerGraph 3.0
    • TIGERGRAPH DB
    • TIGERGRAPH CLOUD
    • GRAPHSTUDIO
    • TRY NOW

    customers

    • success stories

    RESOURCES

    • LIBRARY
    • Events
    • EDUCATION
    • BLOG
    • DEVELOPERS

    SOLUTIONS

    • SOLUTIONS
    • use cases
    • industry

    Partners

    • partner program

    company

    • Overview
    • news
    • Press Release
    • Awards

    start for free

    • Request Demo
    • take a test drive
    • SUPPORT
    • COMMUNITY
    • CONTACT
    • Copyright © 2023 TigerGraph
    • Privacy Policy
    • Linkedin
    • Facebook
    • Twitter

    Copyright © 2020 TigerGraph | Privacy Policy

    Copyright © 2020 TigerGraph Privacy Policy

    • SUPPORT
    • COMMUNITY
    • COMPANY
    • CONTACT
    • Linkedin
    • Facebook
    • Twitter

    Copyright © 2020 TigerGraph

    Privacy Policy

    • Products
    • Solutions
    • Customers
    • Partners
    • Resources
    • Company
    • START FREE
    START FOR FREE
    START FOR FREE
    TigerGraph
    PRODUCT
    PRODUCT
    • Overview
    • GraphStudio UI
    • Graph Data Science Library
    TIGERGRAPH DB
    • Overview
    • Features
    • GSQL Query Language
    TIGERGRAPH CLOUD
    • Overview
    • Cloud Starter Kits
    TRY TIGERGRAPH
    • Get Started for Free
    • Compare Editions
    SOLUTIONS
    SOLUTIONS
    • Why Graph?
    use cases
    • Benefits
    • Product & Service Marketing
    • Entity Resolution
    • Customer Journey/360
    • Recommendation Engine
    • Anti-Money Laundering (AML)
    • Cybersecurity Threat Detection
    • Fraud Detection
    • Risk Assessment & Monitoring
    • Energy Management
    • Network Resources Optimization
    • Supply Chain Analysis
    • AI & Machine Learning
    • Geospatial Analysis
    • Time Series Analysis
    industry
    • Advertising, Media & Entertainment
    • Financial Services
    • Healthcare & Life Sciences
    CUSTOMERS
    read all success stories

     

    PARTNERS
    Partner program
    • Partner Benefits
    • TigerGraph Partners
    • Sign Up
    RESOURCES
    LIBRARY
    • Resource Library
    • Benchmark
    • Webinars
    Events
    • Trade Shows
    • Graph + AI Summit
    • Graph for All - Million Dollar Challenge
    EDUCATION
    • TigerGraph Academy
    • Certification
    Blog
    • TigerGraph Blog
    DEVELOPERS
    • Developers Hub
    • Community Forum
    • Documentation
    • Ecosystem
    COMPANY
    COMPANY
    • Overview
    • Leadership
    • Careers  
    NEWS
    PRESS RELEASE
    AWARDS
    START FREE
    Start Free
    • Request a Demo
    • SUPPORT
    • COMMUNITY
    • CONTACT
    Dr. Jay Yu

    Dr. Jay Yu | VP of Product and Innovation

    Dr. Jay Yu is the VP of Product and Innovation at TigerGraph, responsible for driving product strategy and roadmap, as well as fostering innovation in graph database engine and graph solutions. He is a proven hands-on full-stack innovator, strategic thinker, leader, and evangelist for new technology and product, with 25+ years of industry experience ranging from highly scalable distributed database engine company (Teradata), B2B e-commerce services startup, to consumer-facing financial applications company (Intuit). He received his PhD from the University of Wisconsin - Madison, where he specialized in large scale parallel database systems

    Todd Blaschka | COO

    Todd Blaschka is a veteran in the enterprise software industry. He is passionate about creating entirely new segments in data, analytics and AI, with the distinction of establishing graph analytics as a Gartner Top 10 Data & Analytics trend two years in a row. By fervently focusing on critical industry and customer challenges, the companies under Todd's leadership have delivered significant quantifiable results to the largest brands in the world through channel and solution sales approach. Prior to TigerGraph, Todd led go to market and customer experience functions at Clustrix (acquired by MariaDB), Dataguise and IBM.