Praca evaluation w Polsce. Znaleziono 2326 ofert pracy.

  • Agentic AI MCP & A2A Specialist

    PORT BLUE SKY Sp. z o.o. Sp.k. PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa jeden dzień temu
    ... integration, model abstraction, telemetry, evaluation, and production operations. This role ... tests, observability-driven evaluation, escalation paths, human-in-the- ... servers, workflow engines, and evaluation observability stacks. Work across engineering ...
    www.adzuna.pl
  • Senior AI Engineer with .NET

    YOUR ITEAMS sp. z o.o. PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa, mazowieckie 3 dni temu
    ... AI Engineer to own the evaluation and quality layer of a ... services, RAG pipelines, agents, and evaluation tooling. Strong, demonstrable experience integrating ... -technical stakeholders on what model evaluation means in practice. Fluent English ( ...
    www.adzuna.pl
  • Senior AI Engineer

    YOUR ITEAMS sp. z o.o. PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa, mazowieckie 12 dni temu
    ... Engineer responsible for ensuring the evaluation and quality assurance of a ... services, RAG pipelines, agents and evaluation tooling. Proven track record integrating ... Engineer focused on quality and evaluation. Significant technical ownership and influence ...
    www.adzuna.pl
  • QA Engineer

    Andersen PL, , , pl, PL, Belgrad jeden dzień temu
    ... adapted for AI workflows, including evaluation‑based testing, scenario testing, and ... issues with detailed repro steps, evaluation evidence, logs, and accuracy quality ... . Nice to have: - Exposure to evaluation frameworks (LangFuse evals, Ragas, TruLens, ...
    www.adzuna.pl
  • AI Engineering Team Leader

    XTB PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa jeden dzień temu
    ... , enterprise-grade systems where evaluation-driven development and full-stack ... services, data access layers, evaluation, observability, and production operations, Own ... for LLMOps, AgentOps, deployment, evaluation-driven development, prompt and tool ...
    www.adzuna.pl
  • QA Engineer

    Andersen PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa jeden dzień temu
    ... adapted for AI workflows, including evaluation‑based testing, scenario testing, and ... issues with detailed repro steps, evaluation evidence, logs, and accuracy quality ... . Nice to have: - Exposure to evaluation frameworks (LangFuse evals, Ragas, TruLens, ...
    www.adzuna.pl
  • QA Engineer

    Andersen PL, 52.65167, 23.04495, boćki, podlaskie, Prague jeden dzień temu
    ... adapted for AI workflows, including evaluation‑based testing, scenario testing, and ... issues with detailed repro steps, evaluation evidence, logs, and accuracy quality ... . Nice to have: - Exposure to evaluation frameworks (LangFuse evals, Ragas, TruLens, ...
    www.adzuna.pl
  • Senior Python Engineer (AI evaluations platform)

    ACAISOFT POLAND Sp. z o.o. PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa, mazowieckie jeden dzień temu
    ... a leading provider of AI evaluation and optimization solutions, trusted by ... a leading provider of AI evaluation and optimization solutions, directly contributing ... that support large-scale agent evaluation and reinforcement learning experiments Build ...
    www.adzuna.pl
  • Senior AI QA Engineer Automation & Manual AI-based Applications Testing

    EPAM Systems PL, 52.40321, 16.93875, poznań, wielkopolskie, Poznan jeden dzień temu
    ... including prompt instruction testing and evaluation of agentic workflows Strong programming ... agent frameworks, prompt engineering and evaluation metrics for LLM-based systems ... knowledge of Gen AI LLM evaluation frameworks and metrics — precision, recall, ...
    www.adzuna.pl
  • Senior AI QA Engineer Automation & Manual AI-based Applications Testing

    EPAM Systems PL, 50.26008, 19.02547, katowice, śląskie, Katowice jeden dzień temu
    ... including prompt instruction testing and evaluation of agentic workflows Strong programming ... agent frameworks, prompt engineering and evaluation metrics for LLM-based systems ... knowledge of Gen AI LLM evaluation frameworks and metrics — precision, recall, ...
    www.adzuna.pl
  • Senior AI QA Engineer Automation & Manual AI-based Applications Testing

    EPAM Systems PL, 50.1024, 20.17848, kraków, małopolskie, Krakow jeden dzień temu
    ... including prompt instruction testing and evaluation of agentic workflows Strong programming ... agent frameworks, prompt engineering and evaluation metrics for LLM-based systems ... knowledge of Gen AI LLM evaluation frameworks and metrics — precision, recall, ...
    www.adzuna.pl
  • Senior AI QA Engineer Automation & Manual AI-based Applications Testing

    EPAM Systems PL, , , warszawa, mazowieckie, Warsaw jeden dzień temu
    ... including prompt instruction testing and evaluation of agentic workflows Strong programming ... agent frameworks, prompt engineering and evaluation metrics for LLM-based systems ... knowledge of Gen AI LLM evaluation frameworks and metrics — precision, recall, ...
    www.adzuna.pl
  • Software Development Engineer, CSAI model training & evaluation

    Amazon Seattle, WA, US 7 minut temu
    ... functioning dependencies for training and evaluation are met for foundational model ... SDE with the CSAI Training & Evaluation team, you will be responsible ... boundaries in LLM training and evaluation of customer support models. This ...
    www.amazon.jobs
  • Executive Evaluation - F/H

    France 3 dni temu
    ... tests de dépréciationbr * Evaluation de Management PackagesbrbrAu-delà des ... développement du département Evaluation et modélisation financière ( ...
    www.iagora.com
  • Senior GenAI Specialist Solutions Architect, Amazon Bedrock GTM

    Amazon Seattle, WA, US 8 minut temu
    ... fine tuning and retrieval method evaluation approaches. You should understand the ... Augmentation, Responsible AI, and Performance Evaluation frameworks. You should have experience ... Experience with design, deployment, and evaluation of LLM-powered agents and ...
    www.amazon.jobs
  • Safety Evaluation and Risk Management, Safety Scientist

    GSK Belgium 3 dni temu
    ... * Responsible for signal detection and evaluation activities for assigned products.br * ... , as the preparation of detailed evaluations and reports is a core ... * Pharmacovigilance experience relating to Safety Evaluation and Risk Management, encompassing both ...
    www.iagora.com
  • Intern, Program Evaluation-Remote

    23 USD
    American Heart Association Dallas, United States 3 dni temu
    ... with us. The Data Science & Evaluation team is looking for a ... data analysis. * Contributing to written evaluation reports and presentations. Qualifications * Currently ...
    www.iagora.com
  • Principal PMT, Under the Roof Computer Vision, Worldwide Returns & ReCommerce

    Amazon Bellevue, WA, US 9 minut temu
    ... which drive accurate item evaluation to optimize customer experience and ... for automating returns evaluation and improving evaluation quality using vision tunnel technology ... ) capabilities to reduce evaluation ambiguity- Find application for automation ...
    www.amazon.jobs
  • Software Development Engineer II, SEO (Scheduling, Evaluation, Outcome)

    Amazon Arlington, VA, US 8 minut temu
    ... Amazon. The SEO team (Scheduling, Evaluation, Outcome) is part of the ... interviewer eligibility and training. The Evaluation and Outcome north star is ...
    www.amazon.jobs
  • Monitoring and Evaluation Intern

    World Vision UK Kenya 3 dni temu
    ... synthesis of impact, monitoring and evaluation data across the Field Offices ... review of assessment, baseline and evaluation design documents and related reports * ... the management of monitoring and evaluation systems (Horizon, new Amp Impact, ...
    www.iagora.com