Location: United_States , Posted on: January 5, 2022

Position: Principal Big Data Architect–Anywhere in the US or Canada

We are looking for a Lead Data Engineer/Architect for the Professional Services Division at Mastech InfoTrellis.  Your objective is to lead the engineering team in building the next generation of MDM solutions with analytic functionalities, integrating disparate source systems for Fortune 500 clients.  Work will be conducted largely on a remote basis with travel to client sites required on an as-needed basis.


  • Act as the Data Engineering SME for the Professional Services Division of Mastech InfoTrellis.
  • Develop PoCs, strategies, and provide direction around data security, archiving, and retention, high availability, and disaster recovery.
  • Mentor internal technical and business teams in understanding the overall client requirements and proposed Data Engineering Solutions.
  • Conduct code reviews in Java, Javascript, and Python to ensure technical functionality of the solution.
  • Support Sales and Marketing from a technical standpoint to educate clients on Mastech InfoTrellis’ Data engineering capabilities
  • Provide direction to business teams to perform analytics on client data.
  • Work closely with client technical heads, business heads, and business analysts to identify, understand, and document business and technical requirements along with constraints.
  • Build trust with the client and establish self as the single point of contact for all architecture questions.
  • Drive technical architecture meetings and help clients select the right architecture for their solutions that meet both functional and non-functional requirements today and in future.


  • 10+ years of working experience in designing, architecting, and deploying enterprise level solutions.
  • Expertise in a Linux OS environment is preferred.
  • Strong proficiency in Java and JavaScript are mandatory.
  • Experience in big data products and ETL solutions in a consulting capacity.
  • Highly effective communication and collaboration skills working in a distributed team environment.
  • Technical leadership and strong independent problem solving skills.
  • Experience with Big Data Technology, specifically with Python, Streamsets, and Kafka, are preferable.
  • Degree in Computer Science, Statistics, or Mathematics, plus relevant experience.
  • Optional but nice to have skillsets include Sqoop, Pig, PySpark, Hive, Oracle, MS SQL Server, Greenplum, & Tiger Graph.

Company Description:

Mastech InfoTrellis, is a subsidiary of Mastech Digital. We are a premier Data & Analytics Company helping customers realize the full value of their Enterprise Data Assets through the use of our expertise in Master Data Management, Enterprise Data Integration and Big Data Technologies. Services provided cover all aspects of the Information Management lifecycle, ranging from evaluating clients’ needs, capturing detailed business and technical requirements, vendor selection, design, implementation, testing, deployment, and training. Mastech InfoTrellis continues to innovate to offer advanced solutions which deliver Enterprise Customer 360 and Self Service Analytics using Structured, Unstructured, Internal and External Data Assets for several Fortune 500 companies including Lowe’s USA, Dell Inc., JC Penney, Express Scripts, and Ford among others. Mastech InfoTrellis continues to seek the best talent available in the market to enhance our Delivery capabilities.

Job Summary

  • City: N/A
  • Address: N/A
  • Work Hours/Week: 10
  • Work Environment: Remote
  • Employment Type: Full-Time
  • Career Level: Experienced
  • Pay Type: N/A
  • Required Travel(%): N/A
  • Exempt/Non-Exempt: N/A
  • People Manager: N/A
  • Application Deadline: N/A
  • Platform: Operations
  • Req ID: #job7


Our core values

Ambition– We’ve come this far carrying big visions, and with them, we’ve built a team of ambitious individuals who never cease to find new ways to improve data practices.
Innovation – Our mantra is to innovate and modernize products and services constantly. By continuously experimenting, we have improved systems that bring efficiency at scale.
Success-driven – Our projects begin with success as the goal post. We work, collaborate, and implement strategies that derive positive outcomes and settle for nothing less.
Data-first – Data is our religion. Our united effort is to take data management, data science, and data analytics up a notch by implementing best-in-class strategies.

Our core values