AI Inference Engineer Job at Signify Technology, Hayward, CA

emFMMGJzc0Z5U2VyZGJyNUhjdWxxTngrVXc9PQ==
  • Signify Technology
  • Hayward, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Insight Global

Contract Administrator Job at Insight Global

 ...The Contracts Associate is responsible for assisting in the development, negotiation, and management of contracts related to procurement activities. This position supports and implements procurement strategies, optimizes the supplier base, and meets savings targets while... 

NeurAbilities Healthcare

Clinical Manager, BCBA Job at NeurAbilities Healthcare

Position Overview The Clinical Manager is responsible for the management of clinical operations and quality assurance within their site location. This role directly supervises BCBAs in the practice and works collaboratively with the on-site Practice Manager to manage ...

West 4th Strategy, LLC

Calibration Technician Job at West 4th Strategy, LLC

 ...Calibration Technician ROLE We need an experienced Calibration Technician at The Naval Dosimetry Center (NDC). The Naval Dosimetry Center (NDC) is the U.S. Navys hub for measuring radiation exposure among servicemembers operating nuclear-powered vessels. As the center... 

Roof Resources, Inc.

Technical Roofing Professional - Work from home Job at Roof Resources, Inc.

 ...position is semi-remote in that you must live within 3 hours of the Greater Nashville Area. When not in the field the position is work from home. Responsibilities Develop detailed design plans and specifications for non-residential roofing projects using a... 

LHH

Real Estate Associate Attorney Job at LHH

 ...LHH, Legal is assisting a full-service national law firm in their search for a Real Estate Associate Attorney to join their team in Jacksonville. Below please find an overview of the position: Full-service law firm with over 200 attorneys Transactional work to...