Senior Site Reliability Engineer – Trustwise (Austin)
About Trustwise:
At Trustwise, we are deeply committed to building an AI Trust layer that helps companies unlock Generative AI’s full potential. Our software helps enterprises deploy AI systems that are safe, aligned, and production-ready. With modules for model oversight, policy enforcement, and operational optimization, Trustwise enables developers and enterprises to manage risks, meet compliance requirements, and accelerate AI adoption with confidence. We work with leading organizations across finance, healthcare, retail, and consulting, and integrate seamlessly with platforms like Nvidia NIM, Azure, AWS and major LLM providers.
Trustwise, Inc seeks a Senior Site Reliability Engineer in Austin, TX.
Duties:
Ensure high availability and reliability of our AI systems through robust infrastructure development and management. Conduct post-incident reviews and implement preventive measures; Develop tools to automate deployment, monitoring, and scaling processes. Performance Optimization: Monitor systems, identify performance bottlenecks, and implement; Develop solutions to enhance efficiency; Partner with engineering teams to build resilient, reliable and scalable systems and infrastructure for AI applications; Ensure compliance with security best practices and policies; Monitor system performance, identify bottlenecks, and develop solutions to improve overall security system health; Develop and implement automation tools for deployment, monitoring, and operations. Implement alerting mechanisms and health checks for the product and infrastructure stack; and Interface with stakeholders, presenting technical information in a clear, concise manner.
Telecommuting permitted.
Qualifications/Skills:
Master’s degree or foreign equivalent in Computer Science or in related field and 5 years of experience in the job offered or in a computer-related occupation.
Requires 5 years of experience with each of the following:
- Working with Cloud Providers (AWS, Azure and GCP);
- Working with customer Devops and Infosec teams for enterprise air gapped deployment;
- Linux shell scripting, Go, Python and creating automation of pipeline development;
- HELM charts, package management, Kubernetes, CICD;
- Micro services and container orchestration like Docker. Distributed computing, enterprise product deployment; and
- Monitoring and alerting for platforms and API management.
This role is ideal for engineers who value autonomy, ownership, and impact—not for those seeking highly structured environments or narrow scopes. At Trustwise, you’ll be part of a team building secure, reliable systems that underpin the future of AI.
40 hours/week. Must also have authority to work permanently in the U.S. Applicants who are interested in this position may apply at https://www.jobpostingtoday.com/ Ref # 37018.