Beyond the CPU Ceiling: NCBA Group’s Shift to an AI Factory Model
Scalium
>
Insights
>
Customer Story
>
Beyond the CPU Ceiling: NCBA Group’s Shift to an AI Factory Model
Industrializing Mobile Banking Intelligence
Customer: NCBA Group
Industry: Financial Services (Digital Banking)
Region: East Africa
Impact: 81% reduction in data prep time; 2.7x acceleration in behavioral scoring.
The Challenge: The Infrastructure Ceiling
As a leader in African mobile banking, NCBA Group manages data-intensive workloads for a customer base exceeding 60 million users. Their legacy infrastructure, built on serial, CPU-bound Oracle databases, reached a physical performance wall.
The CPU-era data pipelines were fundamentally incapable of processing transactional volumes at the speed required for modern digital banking. This created an Impedance Incompatibility between their data storage and their operational ambitions, resulting in:
The 6 AM Deadline Failure: Critical nightly ETL jobs prepared data for noon instead of the 6 AM operational deadline. This forced loan approval teams to operate on stale data, creating significant credit exposure and business risk.
Operational Blind Spots: High-complexity reports and heavy JOIN queries frequently failed to complete on the legacy architecture, leaving the business without the visibility required for critical decision-making.
Innovation Stagnation: The engineering team was overloaded with basic data logistics, leaving limited bandwidth for new initiatives such as AI models.
The Solution: The AI Production Layer
NCBA required a fundamental shift from a "prototype" mindset to an AI Factory model. They bypassed further expansion of their legacy 22-server Hadoop cluster —and implemented SCAILIUM as their AI Production Layer.
SCAILIUM provided a GPU-native dataflow engine that unified their disparate transactional sources into a continuous production environment.
The Industrial Backbone Specs:
Infrastructure Consolidation: Consolidated a 22-server Hadoop environment into two SCAILIUM nodes.
Compute Power: Each node equipped with 4x A100 80GB GPUs.
Physics-Aligned Architecture: Leveraged multi-level GPU parallelization to eliminate the "Serialization Tax" inherent in CPU-only pipelines.
The Results: Intelligence at Silicon Speed
By moving ingestion, transformation, and curation directly onto the GPU, NCBA eliminated silicon starvation and achieved industrial-scale performance.
Metric | Legacy Infrastructure | SCAILIUM Layer | Improvement |
Data Loading & Prep (All processes aggregated) | 37 Hours | 7.5 Hours | 81% Faster |
Operational Reporting (1) | 30 Minutes | 5 minutes | 6x Acceleration |
Operational Reporting (2) | Incomplete (high load) | 100% completion | 100% completion |
Critical Nightly ETL Job | 7 hours | 4 hours | 1.75x Acceleration |
BI Workloads | 98 Minutes | 11 Minutes | 88% Faster |
The Bottom Line
"It’s all about performance - being able to provide a positive experience while reducing time to insight," says Leslie Chemwolo, Head of Data Engineering and Infrastructure.
By establishing this high-velocity dataflow, NCBA is currently implementing SCAILIUM’s MCP (Model Context Protocol). This will bridge the gap from high-speed reporting to a fully industrial AI Production environment.
