Job Description
• Design, build, and maintain highly scalable and reliable data pipelines to move and transform data across large-scale, distributed systems at exabyte scale.
• Architect, implement, and deploy new data models and data processing workflows in production, ensuring data quality, integrity, and compliance with data privacy regulations.
• Collaborate with cross-functional teams — including software engineers, data scientists, and product managers — to understand data requirements and deliver effective, high-impact solutions.
• Develop and optimize data-driven systems and solutions to enhance operational scale, efficiency, and generate actionable business insights aligned with evolving product and business needs.
• Work closely with engineering teams to plan and execute on half-yearly roadmaps that align with business priorities and available engineering capacity.
• Perform data quality monitoring, root cause analysis, and proactive resolution of data pipeline issues to ensure high reliability and trust in data assets.
• Develop comprehensive documentation for all data pipelines, models, and related processes, and cross-train team members on new and evolving data systems.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
• • 4+ years of experience in a data engineering role, with a proven track record of designing and managing large-scale data pipelines in a production environment.
Expert-level proficiency in SQL (Presto SQL and/or Spark SQL) and strong programming skills in Python, Java, or Scala.
• Hands-on experience with big data technologies including Apache Spark, Presto, and Hive, including query optimization and performance tuning at scale.
Nice to Have Skills & Experience
• Ability to work independently and collaboratively in a fast-paced, ambiguous environment with a strong bias for action and impact.
• Excellent communication skills with the ability to articulate complex technical concepts clearly to both technical and non-technical stakeholders across engineering, product, and business teams.
• Strong problem-solving and analytical skills, with a deep understanding of business drivers and a focus on delivering measurable impact.
• Experience with real-time data processing and streaming technologies (e.g., Apache Kafka, Flink, or Scuba-equivalent systems).
• Demonstrated experience with data privacy, data governance, and compliance frameworks in a large-scale enterprise environment.
• Excellent communication and interpersonal skills, with the ability to communicate complex technical concepts to both technical and non-technical audiences.
• Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.