Job Details:

Job Description: 

We are looking for a dynamic software engineer to design, develop and optimize AI frameworks for training and inference on Intel Habana (https://habana.ai/) deep learning accelerators. In this role, you will work with a cross-geo team on enabling and optimizing state of the art deep learning models with a specific focus on the PyTorch framework. The roles and responsibilities that you would need to carry out may include the following:
Design and develop SW techniques for AI frameworks - both HW-agnostic and HW-aware Contribute to enhancing and extending the Training and Inference capabilities in the Software stack. Profile deep learning inference and training workloads and identify optimization opportunities in the software stack.

Qualifications:

BTech, MS or PhD in CS or related fields with an overall experience of 10 to 15 years

Programming skills in Advanced C++, Python and parallel programming skills

Previous exposure to Machine Learning (ML) frameworks such as PyTorch and Tensorflow.

Detailed understanding of machine learning systems optimization and deployment techniques such as quantization

understanding of optimization strategies for deployment of Large Language Models (LLMs)

knowledge of transformers, KV cache , prefill buffer etc optimzation technique for inference.

Working knowledge of operators in Pytorch or Tensorflow and Understanding of low level kernels.

Ability to debug complex issues in multi layered SW systems. Understanding of SW integration across open source framework and internal bridge layers.

Understanding of computer architecture and HW-SW optimization techniques

Practical knowledge of DL topologies for different use cases

Knowledge of compiler algorithms for heterogeneous systems

Experience working on frameworks/platforms that have gone to production

Effective communication skills and experience with working in a cross-geo setup

Preferred knowledge of open source compiler infrastructure like LLVM or gcc

Job Type:

Experienced Hire

Shift:

Shift 1 (India)

Primary Location: 

India, Bangalore

Additional Locations:

Business group:

The Data Center & Artificial Intelligence Group (DCAI) is at the heart of Intel’s transformation from a PC company to a company that runs the cloud and billions of smart, connected computing devices. The data center is the underpinning for every data-driven service, from artificial intelligence to 5G to high-performance computing, and DCG delivers the products and technologies—spanning software, processors, storage, I/O, and networking solutions—that fuel cloud, communications, enterprise, and government data centers around the world.

Posting Statement:

All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance.

Position of Trust

N/A

Work Model for this Role

This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. In certain circumstances the work model may change to accommodate business needs.

Location

SRR4 - SRR4 - Sarjapur 4

Job Overview
Job Posted:
7 months ago
Job Expires:
Job Type
Full Time

Share This Job: