Evaluating the Leading Graph Databases – TigerGraph and Neo4j for Scalability

Blog >
Evaluating the Leading Graph Databases – TigerGraph and Neo4j for Scalability

As the decade of Graph roars on, it’s important to evaluate each available offering based on objective criteria. I mentioned in the last blog that architecture is an important consideration as you are evaluating the available graph technology from leading vendors. Today, I will focus on key criteria for scaling up a graph database based on universally accepted enterprise standards and compare the two leading graph databases, TigerGraph and Neo4j. Here are the criteria to consider for enterprise scalability:

Unified enterprise schema – As graph databases are used for connecting multiple datasets and pipelines, having a unified enterprise schema is an essential requirement for enterprises.
Automatic data partitioning – As the scale of data grows beyond 100 GB, enterprises add multiple physical or virtual machines or nodes to partition the data for a graph database. The ability to partition the data across multiple machine nodes on-premises or in the cloud is a key requirement, especially for DB administrators in an enterprise.
Distributed querying – As data is partitioned into multiple machine nodes, the ability to query across the nodes, with minimal overhead for engineers is a key requirement. An enterprise-scale graph database must support distributed querying with a single query without having to write individual queries for each data partition.
ACID transactions across the cluster – For an enterprise-scale graph database, ACID transactions must be supported across the cluster to ensure operational use of the database. This is not a requirement for the analytical use cases, but a must-have for operational deployment of a graph database such as real-time fraud detection for financial transactions.
Graph algorithm execution across the cluster – Graph algorithms such as PageRank and community detection identify relationships across business entities and are used in most enterprise-scale deployments of Graph databases. As the relationships can span multiple partitions of data, the ability to run graph algorithms is a must-have for an enterprise-scale graph database.

Here’s a comparison of TigerGraph and Neo4j’s 4.0 Fabric architecture based on the above scalability criteria. In order to keep this objective, we have included references to Neo4j documentation outlining specific aspects of the architecture involved in this comparison.