Headquarters/Sunnyvale Office, Toronto OfficeUSAgreenhouse2026-06-16
Warum echter KI-Job: The role explicitly focuses on applying and improving machine learning techniques, specifically LLMs, for training, optimization, and deployment. The responsibilities are heavily centered around ML systems and pipelines.
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training…
Details Quelle / Bewerbung öffnen
Toronto OfficeUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is explicitly focused on bringing up and optimizing large language models (LLMs) on specialized hardware. The responsibilities and required skills are heavily centered around AI/ML concepts and implementation.
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training…
Details Quelle / Bewerbung öffnen
Toronto OfficeUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is explicitly focused on LLM inference performance, model evaluation, and optimization on specialized hardware. The responsibilities and required skills are deeply rooted in AI/ML concepts and techniques.
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training…
Details Quelle / Bewerbung öffnen
India OfficeUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is explicitly focused on bringing up and optimizing ML frameworks and models on specialized hardware. The responsibilities directly involve core AI/ML tasks like model architecture translation, compiler optimizations, and performance tuning. The requ…
Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training…
Details Quelle / Bewerbung öffnen
San FranciscoUSAFullTimeashby2026-06-16
Warum echter KI-Job: The role is explicitly focused on building and optimizing infrastructure for large-scale LLM inference, a core AI task. The description heavily emphasizes AI/ML technologies and their application.
About Anyscale At Anyscale https://www.anyscale.com/, we're on a mission to democratize distributed computing and make it accessible to software developers of all skill levels. We’re commercializing Ray https://docs.ray.io/en/latest/, a popular open-source project that's creating an ecosystem of li…
Details Quelle / Bewerbung öffnen
New York, San MateoUSAgreenhouse2026-06-16
Warum echter KI-Job: The role explicitly focuses on developing, fine-tuning, and operationalizing machine learning models, with a strong emphasis on generative AI and LLM inference. The responsibilities are heavily centered around AI/ML engineering tasks.
About Us: At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge in…
Details Quelle / Bewerbung öffnen
New York, San MateoUSAgreenhouse2026-06-16
Warum echter KI-Job: The role is deeply embedded in building and improving a generative AI platform, focusing on core components like inference, fine-tuning, and model deployment. The job description explicitly mentions working with LLMs and AI infrastructure.
About Us: At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge in…
Details Quelle / Bewerbung öffnen
San FranciscoUSAFullTimeashby2026-06-16
Warum echter KI-Job: The role is explicitly focused on building, scaling, and optimizing LLM inference workloads. The team is a 'Forward Deployed Engineering' team working directly with customers on AI deployments. The requirements clearly state experience with LLMs and ML infere…
ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the fronti…
Details Quelle / Bewerbung öffnen
San FranciscoUSAFullTimeashby2026-06-16
Warum echter KI-Job: The role is explicitly focused on AI/LLM inference, solution architecture for AI products, and working with customers deploying AI models. The responsibilities heavily involve technical AI concepts and deployments.
ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the fronti…
Details Quelle / Bewerbung öffnen
San FranciscoUSAFullTimeashby2026-06-16
Warum echter KI-Job: The role is centered around deploying, integrating, and teaching the use of an AI-powered software engineering tool (Devin). The tasks directly involve working with LLMs and agent-based systems, and scaling AI enablement programs. The core function is AI-focu…
WE ARE AN APPLIED AI LAB BUILDING END-TO-END SOFTWARE AGENTS. We're the makers of Devin, the first AI software engineer, and Windsurf, the AI-native IDE. Together, they represent our vision for collaborative AI teammates that enable engineers to focus on more interesting problems and empower teams…
Details Quelle / Bewerbung öffnen