Senior AI Engineer - Inference Systems Optimization
Accelerated.Finance
PL, 54.35203, 18.64664, gdańsk, Trójmiasto, Gdańsk
3 dni temu
... cuDNN) Mastery of inference frameworks: SGLang, vLLM, Dynamo, or equivalents ... constrained decoding Knowledge of frameworks: FlashAttention, FlashInfer, xFormers - Experience with ... Key Technologies Inference frameworks : SGLang, vLLM, TensorRT-LLM GPU ...
www.adzuna.pl