Data Masking at Scale: Implementing IBM Optim Functions with Spring Boot

Why Data Masking Matters More Than Ever

In the age of AI, your training data is only as safe as your masking layer. Exposing PII in enterprise data pipelines — even internally — is a compliance and security risk. IBM Infosphere Optim solves this, but its script generation was entirely manual.

The Intellisphere Project

On the IBM Optim Modernization project, I rebuilt the Access Definition (AD) Script Generator backend from scratch. The goal: generate valid Optim scripts programmatically so engineers never had to hand-write them.

The Architecture

User configures fields in UI (Next.js)
          ↓
Spring Boot REST API
          ↓
Velocity Engine (template rendering)
          ↓
Generated Optim script
          ↓
Imported into IBM Optim Classic (validated)

7 Data Masking Functions Implemented

Function	Purpose
Age	Randomize age within a realistic range
Seq	Sequential numeric replacement
Shuffle	Shuffle column values across rows
Substring	Partial data extraction/replacement
TRANS_EML	Email address transformation
RAND_LOOKUP	Random value from lookup table
HASH_LOOKUP	Hash-based deterministic lookup

Each function was implemented, merged, and regression-tested against real Optim imports.

Redis Caching for Performance

To speed up repeated script generation requests, I added Spring Boot Redis caching using RedisTemplate. I also led a team knowledge-sharing session on this — turning a personal implementation into team-wide competency.

Testing & Quality

JUnit + Mockito for backend unit tests
Jest for the Next.js LMS application
SonarQube compliance enforced throughout

Key Insight

Velocity Engine is a powerful but underused tool for template-driven code generation. Pairing it with a Spring Boot API and a UI config layer creates a self-service script factory — no Optim expertise required from end users.

Conclusion

Data masking is foundational infrastructure for responsible AI. Building it well means your downstream models, reports, and integrations never touch real PII.

Data Masking at Scale: Implementing IBM Optim Functions with Spring Boot and Velocity Engine