We are looking for a Solutions Architect with experience in Generative AI pipeline development and deployment. As part of the Solutions Architect organization, we work with the most exciting computing hardware and software, driving the latest breakthroughs in deep learning and machine learning with NVIDIA’s key customers. This role offers an excellent opportunity to build your career in the rapidly growing field of AI while enabling the world's most successful technology companies. Primary responsibilities will be to lead software customer technical engagements with NVIDIA products and technologies. Join us in this exciting endeavor!
What You’ll Be Doing:
Develop and demonstrate software solutions based on NVIDIA’s ground breaking AI, data science software and hardware technologies to customers. Develop GenAI model pipeline and perform in-depth analysis and optimization to ensure the best performance on current- and next-generation GPU architectures
Lead and develop proof-of-concepts (PoCs) for solutions applied to Consumer Internet industry use-cases such as NLP/LLM, retrieval, recommender, etc. by working closely with customer's AI developers. Build collateral (notebook/code) for PoCs
Work closely with business development team through the sales process for GPU/Network hardware/software products. Owning the technical relationship and enabling customer in building innovative solutions based on NVIDIA technologies
Partner with NVIDIA Engineering, Product, Sales teams to secure design wins at customers. Enable development and growth of NVIDIA product features through customer feedback and PoC evaluations
What We Need To See:
BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or other Engineering fields or equivalent experience
8+ years of experience as an ML/Software Engineer with proven track record coding in Python and/or C++ with popular AI software libraries and GPUs
Experience with GenAI applications and LLM fine-tuning, inference optimization and/or RAG pipelines
Ability to communicate your ideas/code clearly through GitHub, documentation
Great teammate who enjoys collaborating with teams across the organization such as Engineering/Research, Sales, Product, and Marketing
Effective verbal/written communication, and technical presentation skills
Self-starter with passion for growth, enthusiasm for continuous learning and sharing findings across the team
Ways To Stand Out From The Crowd:
External customer facing skills and background
Experience with large-scale production data pipelines and AI model training/deployment
Knowledge of MLOps technologies such as containers, Kubernetes, data center deployments etc. Experience working with enterprise developers building computer vision, NLP, or data analytics applications
Able to think creatively to debug and solve complex problems
We make extensive use of conferencing tools, but occasional travel is required for local on-site visit to customers and data science conferences. We are open to remote work location. We look forward to have you join our team!
With highly competitive salaries, a comprehensive benefits package, and an excellent engineering work culture, NVIDIA is widely considered to be one of the technology industry's most desirable employers. NVIDIA has some of the most innovative people working on meaningful problems that are defining the field of ML/DL, data science, robotics, and graphics.
The base salary range is 180,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.