Állás részletei
-
Cég neve
High Tech Engineering Center Kft.
-
Munkavégzés helye
1052, Budapest, Szervita tér 8. -
Munkaidő, foglalkoztatás jellege
- Alkalmazotti jogviszony
- Általános munkarend
-
Elvárt technológiák
- PYTHON DEBUGGING TESTING C C++ LINUX HARDWARE
-
Elvárások
- Angol középfok
- 1-3 év tapasztalat
- Főiskola
Állás elmentve
A hirdetést eltávolítottuk a mentett állásai közül.
Hasonló állásokról értesítőt állítottunk be!
Válogatást küldünk a hasonló lehetőségekről e-mailben, és app felhasználóinknak push értesítésben is.
Állás leírása
Responsibilities
- Building and optimizing inference pipelines for large-scale model serving
- Working with frameworks like PyTorch, TensorRT, and vLLM for efficient model deployment
- Implementing and optimizing ML models using quantization, kernel fusion, and efficient batching
- Optimizing and implementing core ML operators such as GEMMs, convolutions, and activations
- Investigating and resolving issues through system-level debugging and performance analysis
- Defining and applying practices for testing, deployment, and scaling AI systems
Requirements
- BSc/MSc in Computer Science, Engineering, Mathematics, or related discipline
- Strong programming skills in C/C++ or Python in Linux environments using common development tools
- Solid knowledge of computer architecture, system software, and data structures
- Hands-on experience implementing algorithms in high-level languages (C/C++/Python)
- Exposure to specialized hardware (GPUs, FPGAs, DSPs, AI accelerators) and frameworks such as OpenCL or CUDA
- Experience designing or working with high-performance software systems
- Solid knowledge of ML fundamentals
- Experience in model serving frameworks such as Triton Inference Server, DeepSpeed Inference, or vLLM
- Experience with ML runtimes such as ONNX Runtime, TVM, IREE, or XLA
- Experience deploying ML workloads (LLMs, VLMs, NLP, etc.) across distributed systems
- Experience implementing and optimizing ML operators and kernels with focus on vectorization and efficient execution
- Experience in hardware-aware optimizations and performance tuning
- 2+ years of experience developing software targeting AI hardware
- Motivated team player with a strong sense of responsibility
Nice-to-have
- Contribution to open-source projects such as LLVM/MLIR, PyTorch, TensorFlow, ONNX Runtime, xDSL, or IREE
How to apply
You can submit your application on the company's website, which you can access by clicking the „Apply on company page“ button.
Állás, munka területe(i)
Álláshirdetés jelentése