Senior AI Engineer - Inference Systems Optimization
Accelerated.Finance
PL, 51.10789, 17.03854, wrocław, dolnośląskie, Wrocław
2 dni temu
... cuDNN) Mastery of inference frameworks: SGLang, vLLM, Dynamo, or equivalents ... constrained decoding Knowledge of frameworks: FlashAttention, FlashInfer, xFormers - Experience with ... Key Technologies Inference frameworks : SGLang, vLLM, TensorRT-LLM GPU ...
www.adzuna.pl