Állás részletei
-
Cég neve
High Tech Engineering Center Kft.
-
Munkavégzés helye
Országos lefedettség -
Munkaidő, foglalkoztatás jellege
- Alkalmazotti jogviszony
- Általános munkarend
-
Elvárt technológiák
- PYTHON AZURE API DATABASES DEBUGGING CI
-
Elvárások
- Nem kell nyelvtudás
- Nem kell tapasztalat
- Középiskola
Állás elmentve
A hirdetést eltávolítottuk a mentett állásai közül.
Állás leírása
Responsibilities
Build production-grade AI systems with focus on retrieval quality, agent orchestration, and reliability at scale
Take ownership of the full RAG pipeline from document ingestion and chunking strategy through retrieval design, evaluation, model integration, and stateful agent workflows
Design chunking and ingestion architecture with deterministic, validation-gated pipelines handling diverse enterprise document types with indexing and contextual enrichment
Develop retrieval pipeline including vector search, BM25 hybrid retrieval, RRF re-ranking, embedding model selection and optimisation
Design stateful, multi-step agent workflows with LangGraph; build chains and tool-use patterns with LangChain
Implement observability including tracing, debugging, and monitoring pipeline performance end-to-end with LangSmith
Evaluate and tune models using RAGAS-based evaluation frameworks, benchmarking retrieval recall and answer faithfulness
Deploy open-weight LLM models (Llama 4, Qwen3, MiMo-V2) under memory and latency budgets via Amazon Bedrock and self-hosted inference
Design async workflow orchestration pipelines with Temporal.io; containerised deployment on Kubernetes via GitLab CI
Take ownership of the full RAG pipeline from document ingestion and chunking strategy through retrieval design, evaluation, model integration, and stateful agent workflows
Design chunking and ingestion architecture with deterministic, validation-gated pipelines handling diverse enterprise document types with indexing and contextual enrichment
Develop retrieval pipeline including vector search, BM25 hybrid retrieval, RRF re-ranking, embedding model selection and optimisation
Design stateful, multi-step agent workflows with LangGraph; build chains and tool-use patterns with LangChain
Implement observability including tracing, debugging, and monitoring pipeline performance end-to-end with LangSmith
Evaluate and tune models using RAGAS-based evaluation frameworks, benchmarking retrieval recall and answer faithfulness
Deploy open-weight LLM models (Llama 4, Qwen3, MiMo-V2) under memory and latency budgets via Amazon Bedrock and self-hosted inference
Design async workflow orchestration pipelines with Temporal.io; containerised deployment on Kubernetes via GitLab CI
Requirements
Strong software engineering skills in .NET and/or Python with experience building production-grade APIs and services
Applied algorithmic problem solving including search, ranking, clustering, and data extraction/transformation pipelines
Solid understanding of LLM architectures and trade-offs (e.g. GPT-family, Claude, Llama, and similar models)
Experience integrating LLM and agent APIs (e.g. Azure OpenAI, OpenAI API, Claude API, Amazon Bedrock)
Design and implementation of agent-based systems for automation, orchestration, and decision support
Experience working with relational, document, and graph databases including data modeling and querying
Web data ingestion experience including web scraping, schema extraction, and handling changing source structures
Experience building reliable, scalable, maintainable AI systems including logging, error handling, and performance considerations
Applied algorithmic problem solving including search, ranking, clustering, and data extraction/transformation pipelines
Solid understanding of LLM architectures and trade-offs (e.g. GPT-family, Claude, Llama, and similar models)
Experience integrating LLM and agent APIs (e.g. Azure OpenAI, OpenAI API, Claude API, Amazon Bedrock)
Design and implementation of agent-based systems for automation, orchestration, and decision support
Experience working with relational, document, and graph databases including data modeling and querying
Web data ingestion experience including web scraping, schema extraction, and handling changing source structures
Experience building reliable, scalable, maintainable AI systems including logging, error handling, and performance considerations
Nice-to-have
Fine-tuning and preference optimization techniques (e.g. supervised fine-tuning, RL-style approaches via APIs), Designing pipelines that convert unstructured data into reliable structured representations, Exposure to event-driven or background processing architectures for long-running AI workflows, Experience contributing to or operating AI-powered systems in production
How to apply
You can submit your application on the company's website, which you can access by clicking the „Apply on company page“ button.
Állás, munka területe(i)
Álláshirdetés jelentése