Beyond the CPU Ceiling: NCBA Group’s Shift to an AI Factory Model

Scalium

>

Insights

>

Customer Story

>

Beyond the CPU Ceiling: NCBA Group’s Shift to an AI Factory Model

Industrializing Mobile Banking Intelligence

Customer: NCBA Group

Industry: Financial Services (Digital Banking)

Impact: 81% reduction in data prep time; 2.7x acceleration in behavioral scoring.


The Challenge:  The Infrastructure Ceiling

As a leader in African mobile banking, NCBA Group manages data-intensive workloads for a customer base exceeding 60 million users. Their legacy infrastructure, built on serial, CPU bound Oracle databases, reached a physical performance wall.

The CPU-era data pipelines were fundamentally incapable of processing transactional volumes at the speed required for modern digital banking. This created an Impedance Incompatibility between their data storage and their operational ambitions, resulting in:

  • The 6 AM Deadline Failure: Critical nightly ETL jobs prepared data for noon instead of the 6 AM operational deadline. This forced loan approval teams to operate on stale data, creating significant credit exposure and business risk.

  • Operational Fragility: High-complexity reports and heavy JOIN queries frequently failed to complete on the legacy architecture, leaving the business without the visibility required for critical decision-making.

Architectural Realization

While working to resolve these bottlenecks, NCBA recognized that the same CPU-bound data preparation layer also constrains future AI initiatives.

AI use cases such as risk modeling, fraud detection, and personalization require high￾throughput feature assembly and complex joins at scale.

Even with GPUs downstream, the data layer would starve any accelerator-based system.

The Solution: The AI Production Layer:

NCBA required a fundamental shift from a "prototype" mindset to an AI Factory model. They bypassed further expansion of their legacy 22-server Hadoop cluster, and implemented SCAILIUM as their AI Production Layer.

SCAILIUM provided a GPU-native dataflow engine that unified their disparate transactional sources into a continuous production environment.

The Industrial Backbone Specs:

  • Infrastructure Consolidation: Consolidated a 22-server Hadoop environment into two SCAILIUM nodes.

  • Compute Power: Each node equipped with 4x A100 80GB GPUs.

  • Physics-Aligned Architecture: Leveraged multi-level GPU parallelization to eliminate the "Serialization Tax" inherent in CPU-only pipelines.

The Results: Intelligence at Silicon Speed

By shifting structured data preparation to a GPU-native architecture, NCBA:

  • Eliminated the ETL bottleneck

  • Restored operational deadlines

  • Reduced infrastructure sprawl

  • Established an AI-ready data foundation

They started by fixing BI. They ended by removing the infrastructure ceiling.

Metric

Legacy Infrastructure

SCAILIUM Layer

Improvement

Data Loading & Prep (All processes aggregated)

37 Hours

7.5 Hours

81% Faster

Operational Reporting (1)

30 Minutes

5 minutes

6x Acceleration

Operational Reporting (2)

Incomplete (high load)

100% completion

 100% completion

Critical Nightly ETL Job

7 hours

4  hours

1.75x Acceleration


The Bottom Line

"It’s all about performance - being able to provide a positive experience while reducing time to insight," says Leslie Chemwolo, Head of Data Engineering and Infrastructure.

By establishing this high-velocity dataflow, NCBA is currently implementing SCAILIUM’s MCP (Model Context Protocol). This will bridge the gap from high-speed reporting to a fully industrial AI Production environment.