Praca evaluation w Polsce. Znaleziono 2326 ofert pracy.

  • Infra Software Engineer (AI project)

    ACAISOFT POLAND Sp. z o.o. PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa, mazowieckie jeden dzień temu
    ... a leading provider of AI evaluation and optimization solutions, trusted by ... through rigorous, real-world agent evaluation. Due to the client’s time ... production-ready agentic systems (custom evaluation harnesses) Strong hands-on experience ...
    www.adzuna.pl
  • Mid Senior AI engineer with networking experience

    CodiLime PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa jeden dzień temu
    ... to-end agent responses Designing evaluation workflows combining deterministic checks, human ... using DeepEval or custom evaluation prompts Refining prompts, tool descriptions, ... LLM-based reasoning LLM evaluation tooling: Experience with frameworks and ...
    www.adzuna.pl
  • Senior AI Quality Analyst

    Svitla Systems PL, 50.1024, 20.17848, kraków, małopolskie, Kraków 22 godziny temu
    ... , designing and executing comprehensive evaluation strategies to identify model weaknesses, ... for model testing and evaluation. Conduct data annotation and validation ... environments to ensure reliable evaluation pipelines. Reporting & Insights: Analyze test ...
    www.adzuna.pl
  • Senior AI Quality Analyst

    Svitla Systems PL, 51.10789, 17.03854, wrocław, dolnośląskie, Wrocław 22 godziny temu
    ... , designing and executing comprehensive evaluation strategies to identify model weaknesses, ... for model testing and evaluation. Conduct data annotation and validation ... environments to ensure reliable evaluation pipelines. Reporting & Insights: Analyze test ...
    www.adzuna.pl
  • Data Science Lead AI

    Capgemini Polska PL, 51.10789, 17.03854, wrocław, dolnośląskie, Wrocław jeden dzień temu
    ... You will guide experimentation, evaluation, and refinement of AI models, ... establish experimentation pipelines and evaluation frameworks. Strong knowledge of LLMs, ... frameworks and AI evaluation tooling. Comfortable working cross‑functionally ...
    www.adzuna.pl
  • Data Science Lead AI

    Capgemini Polska PL, 54.35203, 18.64664, gdańsk, Trójmiasto, Gdańsk jeden dzień temu
    ... You will guide experimentation, evaluation, and refinement of AI models, ... establish experimentation pipelines and evaluation frameworks. Strong knowledge of LLMs, ... frameworks and AI evaluation tooling. Comfortable working cross‑functionally ...
    www.adzuna.pl
  • Senior AI Quality Analyst

    Svitla Systems PL, 52.40321, 16.93875, poznań, wielkopolskie, Poznań 22 godziny temu
    ... , designing and executing comprehensive evaluation strategies to identify model weaknesses, ... for model testing and evaluation. Conduct data annotation and validation ... environments to ensure reliable evaluation pipelines. Reporting & Insights: Analyze test ...
    www.adzuna.pl
  • Senior AI Developer with LLM Orchestration Intelligent Knowledge Platform

    DataArt PL, 51.24645, 22.56845, lublin, lubelskie, Lublin jeden dzień temu
    ... APIs, Docker, Kubernetes, ML evaluation frameworks, observability and tracing tools ... services Create and maintain evaluation frameworks including ground truth datasets ... graph concepts Experience implementing evaluation methodologies for AI systems including ...
    www.adzuna.pl
  • Data Science Lead AI

    Capgemini Polska PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa jeden dzień temu
    ... You will guide experimentation, evaluation, and refinement of AI models, ... establish experimentation pipelines and evaluation frameworks. Strong knowledge of LLMs, ... frameworks and AI evaluation tooling. Comfortable working cross‑functionally ...
    www.adzuna.pl
  • Senior AI Quality Analyst

    Svitla Systems PL, 50.04228, 22.00695, rzeszów, podkarpackie, Rzeszów 22 godziny temu
    ... , designing and executing comprehensive evaluation strategies to identify model weaknesses, ... for model testing and evaluation. Conduct data annotation and validation ... environments to ensure reliable evaluation pipelines. Reporting & Insights: Analyze test ...
    www.adzuna.pl
  • Senior AI Developer with LLM Orchestration Intelligent Knowledge Platform

    DataArt PL, 50.1024, 20.17848, kraków, małopolskie, Kraków jeden dzień temu
    ... APIs, Docker, Kubernetes, ML evaluation frameworks, observability and tracing tools ... services Create and maintain evaluation frameworks including ground truth datasets ... graph concepts Experience implementing evaluation methodologies for AI systems including ...
    www.adzuna.pl
  • Mid Senior AI engineer with networking experience

    CodiLime PL, 50.1024, 20.17848, kraków, małopolskie, Kraków jeden dzień temu
    ... to-end agent responses Designing evaluation workflows combining deterministic checks, human ... using DeepEval or custom evaluation prompts Refining prompts, tool descriptions, ... LLM-based reasoning LLM evaluation tooling: Experience with frameworks and ...
    www.adzuna.pl
  • Mid Senior AI engineer with networking experience

    CodiLime PL, 54.35203, 18.64664, gdańsk, Trójmiasto, Gdańsk jeden dzień temu
    ... to-end agent responses Designing evaluation workflows combining deterministic checks, human ... using DeepEval or custom evaluation prompts Refining prompts, tool descriptions, ... LLM-based reasoning LLM evaluation tooling: Experience with frameworks and ...
    www.adzuna.pl
  • Senior AI Quality Analyst

    Svitla Systems PL, 51.77497, 19.6198, łódź, łódzkie, Łódź 22 godziny temu
    ... , designing and executing comprehensive evaluation strategies to identify model weaknesses, ... for model testing and evaluation. Conduct data annotation and validation ... environments to ensure reliable evaluation pipelines. Reporting & Insights: Analyze test ...
    www.adzuna.pl
  • Mid/Senior AI engineer with networking experience @ CodiLime

    CodiLime PL, , , pl, PL, Remote, Warszawa 10 dni temu
    ... LLM-based reasoning LLM evaluation tooling: Experience with frameworks and ... to-end agent responses, Designing evaluation workflows combining deterministic checks, human ... using DeepEval or custom evaluation prompts, Refining prompts, tool descriptions, ...
    www.adzuna.pl
  • Site Reliability Engineer – AI/LLM & Infra @ Acaisoft Poland Sp. z o.o.

    Acaisoft Poland Sp. z o.o. PL, , , pl, PL, Remote, Warszawa 16 dni temu
    ... a leading provider of AI evaluation and optimization solutions, trusted by ... learning (RL) environments and scalable evaluation systems that guide and shape ... through rigorous, real-world agent evaluation. Due to the client’s time ...
    www.adzuna.pl
  • Data Science Lead (AI) @ Capgemini Polska Sp. z o.o.

    Capgemini Polska Sp. z o.o. PL, , , pl, PL, Kraków, Katowice, Lublin, Opole, Wrocław, Poznań, Warszawa, Gdańsk 18 dni temu
    ... establish experimentation pipelines and evaluation frameworks. Strong knowledge of LLMs, ... frameworks and AI evaluation tooling. Comfortable working cross‑functionally ... You will guide experimentation, evaluation, and refinement of AI models, ...
    www.adzuna.pl
  • Senior Data Scientist @ Bayer

    Bayer PL, 52.21519, 21.2453, warszawa, mazowieckie, Warszawa 18 dni temu
    ... forecasting experience is a plus. Evaluation focus: design and run offline online tests, rubric-based GenAI evaluation, safety checks, and error analysis; ... knowledge use cases, Establish rigorous evaluation: offline metrics, human-in-the- ...
    www.adzuna.pl
  • Language Data Scientist II, AWS AI Data | Transcribe

    Amazon Santa Clara, CA, US 22 minuty temu
    ... human-in-the-loop evaluation tasks to measure the performance ... model fine tuning and evaluation data needs and techniquesA day ... domain experts) for model evaluation and usability testing, proposing the optimal business and evaluation metrics to use.You will ...
    www.amazon.jobs
  • DSP Business Development Manager, Last Mile Account Management

    Amazon Tokyo, 13, JP 22 minuty temu
    ... for the interview, the final evaluation point and extending an offer, ... . Evaluate the candidates based on evaluation standards to assess qualification, cultural ... success beyond the pre-defined evaluation standards. - Responsible for closing the ...
    www.amazon.jobs