Jim Webber

Chief Scientist

Neo4j

Biography

I am Neo4j’s Chief Scientist and Visiting Professor at Newcastle University, UK. At Neo4j I lead the research group, working on a variety of database topics including query languages and runtimes, temporality, streaming, scale, and fault-tolerance. I have also also co-authored several books on graph technology including Graph Databases - 1st and 2nd Editions (O’Reilly), Graph Databases for Dummies (Wiley), and Building Knowledge Graphs (O’Reilly).

Prior to Neo4j, I worked on fault-tolerant distributed systems. First at Newcastle University startup Arjuna and then for a variety of clients for global consulting firm ThoughtWorks. Along the way I co-authored the distrubuted systems books REST in Practice (O’Reilly) and Developing Enterprise Web Services - An Architect’s Guide (Prentice-Hall).

Interests

Graph Theory
Databases
Distributed Systems
Fault Tolerance

Education

Visting Professor of Practice, 2018-present

Newcastle University
Ph.D. in Programming Languages for High-Performance Computing, 2000

Newcastle University
B.Sc. (1st class Honours) Computing Science, 1996

Newcastle University

Recent & Upcoming Talks

Industry and Research

The Pub-Time Parliament

GOTO Copenhagen

Imagine a busy pub on a Friday night. It’s crowded, lots of people are talking at the same time. They’re all exchanging information with each other which makes tiny changes in their brains. Some folks are taking it easy on the drink, a few are a bit tipsy after one too many beers, and my mate Stevo is plastered, falling off his barstool. Classic Stevo. Now imagine trying to get this crowd to agree on something when they can’t even agree on which footy team is the worst this week and some of them can’t remember their own names.

In comparison you’d think that getting a bunch of computers to agree on something, say a simple number would be pretty easy. Computers can do way smarter things than agreeing upon a number, right? Sadly not. Computers are often in various states of being wrong or crashing. Much like a bunch of drunks they all want to talk at the same time, and they’re confident they have the best opinion.

How have we have built such incredible systems on such a flakey foundation? In this talk we will visit classic consensus algorithms and see how they provide benefits of correctness and fault-tolerance for systems but at the price of reduced scalability. Then we’ll explore some new research which aims to provide both correctness and scalability for distributed systems. The talk will be interactive - you may need a drink yourself afterwards.

Copenhagen, Denmark 28 September 2026

The Pub-Time Parliament

Yow! Australia

Imagine a busy pub on a Friday night. It’s crowded, lots of people are talking at the same time. They’re all exchanging information with each other which makes tiny changes in their brains. Some folks are taking it easy on the drink, a few are a bit tipsy after one too many beers, and my mate Stevo is plastered, falling off his barstool. Classic Stevo. Now imagine trying to get this crowd to agree on something when they can’t even agree on which footy team is the worst this week and some of them can’t remember their own names.

In comparison you’d think that getting a bunch of computers to agree on something, say a simple number would be pretty easy. Computers can do way smarter things than agreeing upon a number, right? Sadly not. Computers are often in various states of being wrong or crashing. Much like a bunch of drunks they all want to talk at the same time, and they’re confident they have the best opinion.

How have we have built such incredible systems on such a flakey foundation? In this talk we will visit classic consensus algorithms and see how they provide benefits of correctness and fault-tolerance for systems but at the price of reduced scalability. Then we’ll explore some new research which aims to provide both correctness and scalability for distributed systems. The talk will be interactive - you may need a drink yourself afterwards.

Australia 4 December 2025

Lies, Damn Lies, and AIs

Great International Developer Summit (GIDS)

Generative AI has taken the world by storm, but it’s not always a reliable helper. It makes up alternative facts, has difficulty with number and logical reasoning, all while exuding the confidence of a used car salesperson. In this talk we’ll see how to use Knowledge Graphs to improve accuracy. The audience will hear several technology patterns where deterministic knowledge graphs complement generative AI to create systems that are compelling and truthful.

Bangalore, India 24 April 2024

See all talks

Books

Jesùs Barrasa, Jim Webber

June 2023

Building Knowledge Graphs - A Practitioner's Guide

A practitioner’s guide to building Knowledge Graphs for the enterprise.

PDF Buy at Amazon UK Buy at Amazon US

Jim Webber, Rik Van Bruggen

September 2020

Graph Databases for Dummies

A practice and humane introduction to graph databases and Neo4j, Graph Databases For Dummies walks you through modeling, querying, and importing graph data, all the way through to your first production system.

PDF

Ian Robinson, Jim Webber, And Emil Eifrem

June 2015

Graph Databases

The first book on graph databases, now in its second edition. Provides in-depth coverage of graph modeling and querying, as well as thorough explanations of the internal workings of Neo4j.

PDF

Jim Webber, Savas Parastatidis, Ian Robinson

September 2010

Rest in Practice

Why don’t typical enterprise projects go as smoothly as projects you develop for the Web? Does the REST architectural style really present a viable alternative for building distributed systems and enterprise-class applications?

In this insightful book, three SOA experts provide a down-to-earth explanation of REST and demonstrate how you can develop simple and elegant distributed hypermedia systems by applying the Web’s guiding principles to common enterprise computing problems. You’ll learn techniques for implementing specific Web technologies and patterns to solve the needs of a typical company as it grows from modest beginnings to become a global enterprise.

Code Project Buy at Amazon UK Buy at Amazon US

Sandeep Chatterjee, Jim Webber

November 2003

Developing Enterprise Web Services

This was one of the first books to demonstrate how to build (WS-*) Web Services with enterprise-class reliability, and performance. This book takes a no-nonsense view of architecting and constructing enterprise-class Web services and applications. The authors assess the state of the art of the Web services platform circa 2004, offering best practices and new architectural patterns for taking advantage of Web Services.

While the architectural patterns in this book generally remain worthwhile today, the protocols and standards covered are now looking somewhat out of date, especially since there is a strong groundswell towards building RESTful systems on the Web rather than tunnelling through HTTP with XML payloads.

Career History

Chief Scientist

Neo4j

Oct 2010 – Present London

I encountered Neo4j while working at ThoughtWorks, and the data model seemed so natural that I became involved as an open source contributor building the first Neo4j Server implementation. As Neo4j gained ground commercially, I moved over to the company full time as Chief Scientist and executive manager. Initially I lead the engineering team delivering the early versions of the database product, then worked for a long time building fault-tolerant clustering for the Neo4j database. I currently lead Neo4j Research, an empirical systems-focussed group that provides optionality for the long-term future of Neo4j. We work alongside our engineering team and academic researchers on the next generation of graph data system.

Responsibilities included:

Research manager
Executive manager
Empirical research on scalable fault-tolerant methods

Director of Professional Services

ThoughtWorks

Jan 2005 – Oct 2010 Sydney, London

I joined ThoughtWorks in Sydney as part of a small group of early employees. My initial responsibilities were to help drive sales of consultancy in finance, media, and telecoms to deliver consulting and software delivery services. While at ThoughtWorks, I created a community of practice around SOA and developed a lightweight, iterative method of building service-oriented systems known as “Guerilla SOA.” After a move to London, I was promoted to Director of Professional services, and continued to provide strategic technology advisory (internally and externally), sales and marketing support, as well as building large-scale software systems for clients.

Responsibilities included:

Leading technlogy delivery
Strategic technology advisory
Office of the CTO

Senior Research Associate

Newcastle University

Jan 2004 – Dec 2004 hosted by University of Sydney

I took a role as a Senior RA at the Newcastle University (UK), working at Sydney University (Australia). My role involved the development of example systems of Web Services that demonstrated the utility of the WS-* protocols for Grid computing, rather than needing to develop a new, competing suite of protocols for that domain.

While at the University of Sydney, I also lectured a Masters degree course in Parallel Computing.

Responsibilities included:

Research on emerging Web services standards and Grid computing
WS-GAF protocol design and empirical validation
Co-author of SSDL
Outreach to Australian academia

Senior Developer

Bluestone/Hewlett-Packard/Arjuna

Oct 2000 – Oct 2003 Newcastle upon Tyne

I joined Bluestone software’s Arjuna lab from my Ph.D. initially to work on transactional workflow middleware. As Web Services rose to prominence, I started a new team around transaction support for systems of Web Services. I lead the development of this middleware through being acquired by HP, and later spun out as Arjuna again. Ultimately the Arjuna IP was sold to JBoss.

Responsibilities included:

Design and implementation of Web Services transaction protocols and platform-specific bindings (Java, .NET)
Web Services transaction protocols standardisation
Co-author of “Developing Enterprise Web Services”
Industry and partner outreach

Publications

Search publication history

Jim Webber, Georgios Theodorakis, Hugo Firth, Natacha Crooks

May 2026 SIGMOD 2026 Graph Database Management Systems, Transactions, Fault-Tolerance, Leaderless Consensus

RIOT: Replicated Independently-Ordered Transactions

Consensus protocols such as Raft and Paxos implement state machine replication through a single leader that enforces a totally ordered log. While this simplifies correctness, it introduces sequential bottlenecks that restrict scalability. We present RIOT, a generalized consensus protocol that eliminates centralized leadership and log replication in favor of decentralized coordination over a directed acyclic graph (DAG) of entries. RIOT guarantees that all servers maintain a logically identical DAG, preserving order where conflicts require it while allowing commutative operations to execute concurrently. RIOT is motivated by our work on distributed graph databases,which must guarantee reciprocal consistency for edges that span shards. Unlike specialized transaction protocols, RIOT makes no assumptions about concurrency control or transaction models. It provides a replicated state machine abstraction that integrates cleanly with transactional databases, treating DAG entries as transaction placeholders. Both single-phase and two-phase variants are supported, ensuring atomic agreement on entries and their ordering constraints. We integrate RIOT with Neo4j and evaluate it against Neo4j’s production Raft implementation. For common workloads, RIOT delivers up to 2.5× higher throughput and 2.3× lower tail latency while matching the strong consistency guarantees of log-based consensus. In doing so, RIOT demonstrates how consensus can be generalized to unlock scalability for transactional databases at scale.

PDF DOI

Paul Ezhilchelvan, Isi Mitrani, Jim Webber

October 2025 MASCOTS 2025 Server Replication, Total Order, Logical Ring, Approximate Analysis, Parallel Queues, Bulk Service

Performance Evaluation of a Multi-Folder Ring Protocol for Total Ordering of Messages

In a system containing several distributed servers, messages of random sizes generated at different locations must be disseminated and processed in the same order by all hosts. A ring protocol is defined, where a number of folders carrying messages circulate in one direction without overtaking each other. A model involving parallel queues is analysed in the steady state and is solved approximately, allowing the computation of performance measures. A number of example systems are evaluated numerically and by simulations, leading to a heuristic for choosing the optimal number of folders.

PDF

Georgios Theodorakis, Hugo Firth, James Clarkson, Natacha Crooks, Jim Webber

September 2025 VLDB 2025 Graph Database Management Systems, Transactions

TuskFlow: An Efficient Graph Database for Long-Running Transactions

Mammoth transactions, which involve long-running operations that access many items, are common in graph workloads. Graph analytics tasks, including pattern matching and graph algorithms, can generate large read-write operations that impact signi!cant portions of data, which makes their execution challenging under strict isolation guarantees. Consequently, we face an apparent trade-off between ensuring high isolation and achieving high performance, forcing users to choose between the two. In this work, we present TuskFlow, an experimental graph database based on Neo4j, designed to e#ciently handle mammoth transactions on graphs (the technique is applicable to other models such as relational) while maintaining existing transactional semantics. TuskFlow employs a deterministic protocol that safely reorders regular transactions around mammoths within an epoch. Our protocol supports parallel mammoth execution inspired by graph-parallel algorithms. To minimize con$icts with regular transactions, TuskFlow introduces query- and workload-aware optimizations, including graph entity tagging and partitioning. Our experiments demonstrate that, unlike traditional protocols like two-phase locking or MVCC, TuskFlow avoids blocking write transactions and improves tail latency by up to 45x.

PDF DOI

Ye Liu, Paul Ezhilchelvan, Yingming Wang, Jim Webber

July 2025 IDEAS 2025 Replication, Ordering, Distributed Systems, Ring Networks

Throughput-Driven Database Replication Using a Ring-Based Order Protocol

We present a database replication architecture that guarantees ACID transaction properties as well as high throughput expected of modern database systems. Higher throughput results due to server replicas processing distinct, non-overlapping subsets of incoming transactions in parallel. Our novel approach addresses all challenges that emerge in ensuring ACID properties across all incoming transactions processed in parallel even when access pattern of transactions is not known a priori. At the core of our approach is a high-throughput, ring-based total order protocol which the database replicas use to reach consensus for resolving conflicts among transactions, ensuring serializability and accomplishing atomic commit. After presenting the architecture, protocol performance is evaluated through implementations when replication degree is two and three, tolerating at most one replica crash. While 2-fold replication requires perfect crash detection, three-fold can do with weak detectors.

PDF

Yingming Wang, Paul Ezhilchelvan, Jack Waudby, Jim Webber

June 2024 EPEW 2024 Transactions, Concurrency Control, Database Management Systems

Implementations Based Evaluation of No-Wait Approach for Resolving Conflicts in Databases

In this paper, we describe No-Wait concurrency control mechanisms to address conflict resolution and then comprehensively evaluate their performance under Read-Committed and Serializability isolation levels using an in-memory database system in various configurations and contention scenarios. Key performance metrics are percentage of transaction aborts and average latency for those who do not abort. Our evaluations affirm that the No-Wait approach indeed offers a cost-effective, practical alternative to traditional conflict resolution mechanisms.

PDF

See all publications

Scientific Peer Review

2026

VLDB

2026

ICDE

2025

VLDB

2024

SEAGRAPH: Search, Exploration, and Analysis in Heterogeneous Datastores - Graph Edition

2019

Communications of the ACM

See all academic activity

Socal Media

Twitter and BlueSky

I have Twitter and BlueSky accounts which are mixture of chatter with friends and colleagues, some computing science things, and a dash of left politics.

Following the example of Jonthan Dowland, my Twitter feed has a sliding window of 90 days worth of tweets. I like Twitter (somewhat) for conversations, but as a system of record much less so.

Facebook, Instagram, Snapchat etc.

I’m not on any other social media sites, I prefer email. If you meet a Jim Webber on any other platforms, it’s not me.

Jim Webber

Chief Scientist

Biography

Interests

Education

Recent & Upcoming Talks

Books

Recent Posts

Career History

Chief Scientist

Director of Professional Services

Senior Research Associate

Senior Developer

Publications

Scientific Peer Review

Socal Media

Twitter and BlueSky

Facebook, Instagram, Snapchat etc.