Toronto OfficeUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is explicitly focused on bringing up and optimizing large language models (LLMs) on specialized hardware. The responsibilities and required skills are heavily centered around AI/ML concepts and implementation.
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training…
Details Quelle / Bewerbung öffnen
India OfficeUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is explicitly focused on bringing up and optimizing ML frameworks and models on specialized hardware. The responsibilities directly involve core AI/ML tasks like model architecture translation, compiler optimizations, and performance tuning. The requ…
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training…
Details Quelle / Bewerbung öffnen
San FranciscoUSAFullTimeashby2026-06-16
Warum echter KI-Job: The role is explicitly focused on building and optimizing infrastructure for large-scale LLM inference, a core AI task. The description heavily emphasizes AI/ML technologies and their application.
About Anyscale At Anyscale https://www.anyscale.com/, we're on a mission to democratize distributed computing and make it accessible to software developers of all skill levels. We’re commercializing Ray https://docs.ray.io/en/latest/, a popular open-source project that's creating an ecosystem of li…
Details Quelle / Bewerbung öffnen
Redwood City, CAUSAFullTimeashby2026-06-16
Warum echter KI-Job: The role is explicitly focused on building and maintaining infrastructure for ML research and serving, with a strong emphasis on large language models and GPU utilization. The requirements and responsibilities directly relate to core AI/ML engineering tasks.
ABOUT THE ROLE We’re looking for seasoned ML Infrastructure engineers with experience designing, building and maintaining training and serving infrastructure for ML research. Responsibilities: - Provide infrastructure support to our ML research and product - Build tooling to diagnose cluster issues…
Details Quelle / Bewerbung öffnen
RemoteUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is entirely focused on the development and optimization of LLM inference frameworks, distributed systems, and related technologies. The responsibilities and requirements clearly indicate a core AI/ML engineering position.
About the Role At Together.ai, we are building state-of-the-art infrastructure to enable efficient and scalable inference for large language models (LLMs). Our mission is to optimize inference frameworks, algorithms, and infrastructure, pushing the boundaries of performance, scalability, and cost-e…
Details Quelle / Bewerbung öffnen
San FranciscoUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is explicitly focused on building and optimizing AI inference systems for large language models. The responsibilities and requirements heavily emphasize ML engineering, performance optimization, and working with cutting-edge AI technologies.
About the Role Together AI is seeking a Machine Learning Engineer to join our Inference Engine team, focusing on optimizing and enhancing the performance of our AI inference systems. This role involves working with state-of-the-art large language models models and ensuring they run efficiently and…
Details Quelle / Bewerbung öffnen
San FranciscoUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is explicitly focused on building and optimizing the model serving layer for voice applications, working with state-of-the-art voice models and inference engines. The responsibilities are heavily centered around ML engineering tasks.
About the Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real-time voice agents and applications — serving speech-to-text and text-to-speech models with best-in-class latency and reliability. We're looking for a…
Details Quelle / Bewerbung öffnen
San FranciscoUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is entirely focused on building and optimizing the model serving layer for voice applications, including LLMs, STT, and TTS. It requires deep expertise in ML engineering, inference optimization, and GPU utilization. The responsibilities and requireme…
About the Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real-time voice agents and applications — serving speech-to-text and text-to-speech models with best-in-class latency and reliability. We're looking for a…
Details Quelle / Bewerbung öffnen
ParisFranceFull-timelever2026-06-16
Warum echter KI-Job: The role explicitly focuses on deploying and implementing AI solutions for customers, involving deep ML expertise, LLM fine-tuning, and working with cutting-edge AI research. The job description heavily emphasizes technical AI skills and application.
About The Job Mistral AI is seeking a Applied AI Engineer to facilitate the adoption of its products among customers and collaborate with them to address complex technical challenges. The Applied AI team is Mistral's customer-facing technical organization. We work directly with enterprise clients f…
Details Quelle / Bewerbung öffnen
MontrealFranceFull-timelever2026-06-16
Warum echter KI-Job: The role explicitly focuses on deploying and supporting AI products (LLMs), fine-tuning, and working with customers on complex AI challenges. The required skills and experience are heavily weighted towards AI/ML/LLM expertise.
About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, produ…
Details Quelle / Bewerbung öffnen