Job Description
Responsibilities:
Build performance scenarios for Import PO workflows including:
-Bulk PO creation (batch → group → child workflows)
-CDC flow throughput
-Mutation API write performance
-OpenSearch index/search latency
Identify bottlenecks across services, API Gateway, Cassandra layers, and Kafka pipelines
Design and execute failover + resiliency tests (node restart, pod eviction, consumer rebalancing)
Produce detailed performance reports with recommendations for tuning
Validate scalability under multi market load (CA, MX, CL, CAM)
Work with Engineering/SRE to implement optimizations and retest
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
6-10 years experience with LoadRunner, JMeter, Gatling, Locust, or similar load testing tools
Ability to design load, stress, endurance, soak, and failover tests
Strong understanding of Kafka throughput, consumer lag, backpressure, and partition strategies
Experience profiling Cassandra read/write performance, compaction impact, and query optimization
Ability to interpret performance metrics from Grafana, Splunk, and distributed tracing tools
Experience building performance benchmarks aligned to SLAs (e.g., 100K records < 1 minute)
Nice to Have Skills & Experience
Chaos Engineering tools (Gremlin, Litmus)
Automated performance testing in CI/CD
Prior experience with large scale retail or supply chain workloads
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.