High Tech Engineering Center Kft. logó

AI Inference Engineer

Jelentkezés a cégnél

Cég neve

High Tech Engineering Center Kft.
Munkavégzés helye

1052, Budapest, Szervita tér 8.

Munkaidő, foglalkoztatás jellege
- Alkalmazotti jogviszony
- Általános munkarend
Elvárt technológiák
- PYTHON DEBUGGING TESTING C C++ LINUX HARDWARE
Elvárások
- Angol középfok
- 1-3 év tapasztalat
- Főiskola

Állás elmentve

A hirdetést eltávolítottuk a mentett állásai közül.

Responsibilities

Building and optimizing inference pipelines for large-scale model serving
Working with frameworks like PyTorch, TensorRT, and vLLM for efficient model deployment
Implementing and optimizing ML models using quantization, kernel fusion, and efficient batching
Optimizing and implementing core ML operators such as GEMMs, convolutions, and activations
Investigating and resolving issues through system-level debugging and performance analysis
Defining and applying practices for testing, deployment, and scaling AI systems

Requirements

BSc/MSc in Computer Science, Engineering, Mathematics, or related discipline
Strong programming skills in C/C++ or Python in Linux environments using common development tools
Solid knowledge of computer architecture, system software, and data structures
Hands-on experience implementing algorithms in high-level languages (C/C++/Python)
Exposure to specialized hardware (GPUs, FPGAs, DSPs, AI accelerators) and frameworks such as OpenCL or CUDA
Experience designing or working with high-performance software systems
Solid knowledge of ML fundamentals
Experience in model serving frameworks such as Triton Inference Server, DeepSpeed Inference, or vLLM
Experience with ML runtimes such as ONNX Runtime, TVM, IREE, or XLA
Experience deploying ML workloads (LLMs, VLMs, NLP, etc.) across distributed systems
Experience implementing and optimizing ML operators and kernels with focus on vectorization and efficient execution
Experience in hardware-aware optimizations and performance tuning
2+ years of experience developing software targeting AI hardware
Motivated team player with a strong sense of responsibility

Nice-to-have

Contribution to open-source projects such as LLVM/MLIR, PyTorch, TensorFlow, ONNX Runtime, xDSL, or IREE

How to apply

You can submit your application on the company's website, which you can access by clicking the „Apply on company page“ button.

Jelentkezés a cégnél

Állás, munka területe(i)

AI and Automation

Álláshirdetés jelentése

Állás, munka területe(i)

AI and Automation

Álláshirdetés jelentése

Jelentkezés a cégnél

Jelentkezés a cégnél