Job Description
We are seeking an experienced MDM Subject Matter Expert with strong Java development skills to support and modernize a high volume IBM MDM environment. This role focuses on identity matching, linking, and golden record creation, improving performance and accuracy while reducing SLAs. The ideal candidate is a hands on engineer who can assess existing architecture, tune matching algorithms, embed custom Java logic within MDM, and partner closely with business and vendor teams (including IBM) to drive measurable improvements.
This is a hybrid functional and technical role requiring deep understanding of MDM concepts and the ability to implement solutions in Java at scale.
• Conduct comprehensive reviews of the current MDM ecosystem, including matching and linking algorithm performance, duplicate and suspect processing, and custom extensions.
• Evaluate and enhance logging, monitoring, backup, and recovery processes to ensure system reliability and resiliency.
• Assess and optimize transaction/query SLAs and index performance, identifying and addressing known issues such as bucket overflows and bulk-load mechanism gaps (including DataStage constraints).
• Design, develop, and maintain Java code for MDM algorithms, data processing, and integration components, ensuring code is performant, scalable, and maintainable.
• Refactor and optimize existing Java code, and write unit, integration, and system tests for all Java components.
• Adjust matching algorithm thresholds to minimize false positives/negatives and add/test new matching attributes (e.g., email + last name, phone combinations).
• Assess and implement phonetic matching techniques (e.g., Soundex), and lead feasibility studies to determine the need for single or multi-algorithm approaches for different business lines.
• Perform sample pairing for new data sources, collaborating with business teams to identify edge cases and refine algorithm behavior, including store-identifier-based auto-matching where applicable.
• Review and enhance frequency-based and dynamic bucketing strategies, assess passive execution options to minimize production impact, and validate interactions with STR tables and existing buckets.
• Develop and document bulk load strategies, including ingestion options with and without entity management/event handlers; explore disabling matching for high-volume ingestion scenarios (e.g., 200M+ records) and evaluate alternative ingestion patterns beyond API-only approaches.
• Capture and report on metrics, SLAs, and trade-offs for each ingestion strategy.
• Target <2 seconds for matching response time through iterative tuning, must-gather data analysis, and leveraging load testing tools (e.g., JMeter); review reconciliation SLAs as part of ongoing optimization.
• Address deployment challenges such as outages and unpredictable environment behaviors; explore and implement CICD or improved deployment automation; review and optimize configuration, code, and deployment steps for efficiency.
• Plan and implement OAuth/PingFed-based authentication, reduce reliance on process IDs and passwords, automate credential rotation, and maintain a dedicated 'Security' category in the scope documentation.
• Ensure all code adheres to Walmart’s coding standards and guidelines.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
Expert level knowledge of IBM MDM
Current hands on Java algorithm development experience
Data matching, linking and deduplication experience
Performance tuning and system health monitoring for large-scale data systems
Data integration frameworks and ETL tools (e.g., DataStage)
SQL and database optimization (indexing, query tuning, bulk operations)
CI/CD pipeline tools and deployment automation (e.g., Jenkins, GitHub)
Secure coding practices, including OAuth authentication and credential management
Micro services API development and integration (e.g., RESTful services)
Unit, integration, and system testing frameworks (e.g., JUnit, Mockito)
Documentation tools (Confluence, GitHub, DX Guides)
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.