Back to Jobs
AI & Machine Learning 1d ago

Senior Software Engineer - Developer Tools for Deep Learning

United StatesUnited States
Full-time
$152,000 - $287,500 / year + Equity
Senior

Job Description

Key Skills Required

Master these to land this role

Machine LearningBestseller 🔥
Learn in 42 Hours
Python Scripting

Want to know if you're a match for this job?

Calculate My Match Score

About NVIDIA: NVIDIA is the premier, internationally recognized accelerated computing pioneer, artificial intelligence innovator, and global hardware-software ecosystem leader on an absolute mission to power the world’s most advanced technological transformations. Since inventing the GPU, NVIDIA has completely redefined the boundaries of modern science and enterprise compute, manufacturing the foundational AI architectures, supercomputing networks, and software stacks that drive the global intelligent revolution. Highly regarded as an uncompromised force in technological design, NVIDIA’s systems are trusted across all industries to run state-of-the-art workloads from autonomous driving to hyper-scale language modeling. The organization fosters a versatile, creative, and highly autonomous workforce, providing expert software craftsmen with the ultimate global canvas to solve complex, high-stakes challenges and shape the future of computing safely.

Position Overview

We are seeking a highly analytical, compiler-minded Senior Software Engineer - Developer Tools for Deep Learning to join our core centralized Developer Tools organization in a full-time remote capacity within Massachusetts, United States. In this high-leverage software engineering seat, you will step up to claim individual technical accountability for designing, building, and optimizing the best-in-class developer utilities utilized globally to streamline deep neural network models. Moving entirely away from standard user-space application wrapping or basic visual scripting, you will operate inside a unified environment focused on enhancing tool support, reducing latency, and raising model execution efficiency. This high-ownership role requires a deep learning domain expert with 5+ years of background who handles large-scale frameworks fluidly, collaborates seamlessly across multi-functional distributed cells, and translates complex mathematical requirements into production-grade tools.

Key Responsibilities

  • Developer Tooling Architecture Ownership: Code, design, and optimize robust developer utilities and unified engineering environments natively utilizing Machine Learning infrastructure to enable streamlined design of high-performance neural networks.
  • High-Performance Python Scripting: Implement and scale clean, maintainable, and well-documented optimization codebases, writing high-throughput logic natively leveraging Python Scripting and low-level C++ components.
  • Deep Neural Network Performance Triage: Partner with corporate architects to translate user design requirements into live system features, right-sizing model compiler pipelines to optimize processing efficiency.
  • SOTA Model Optimization Governance: Build deployment configurations for cutting-edge compute layers, evaluating execution characteristics across state-of-the-art computer vision models and large language models (LLMs).
  • Multi-Framework System Integration: Coordinate and test application workflows natively across diverse open-source deep learning platforms, managing compatibility layers for **PyTorch**, TensorFlow, or JAX.
  • NVIDIA Software Stack Calibration: Integrate and compile custom tool hooks into NVIDIA’s proprietary hardware-acceleration libraries, tracking model processing parameters natively within **TensorRT** registries.
  • ONNX and Schema Transformation: Drive robust mathematical transformations across heterogeneous file systems, utilizing **ONNX** formats to guarantee seamless cross-platform model portability and graph optimization.
  • Cross-Functional Agile Synchronization: Collaborate asynchronously alongside geographically distributed engineering and product research cells, providing expert guidance on structural technical trade-offs and deployment risks.

Required Skills & Qualifications

  • 5+ years of verified professional history running advanced software engineering, backend systems programming, machine learning tools development, compiler optimization, or cloud-native algorithm consulting.
  • Deep, authoritative technical command of core deep learning mechanics, showing hands-on experience working with framework runtimes natively inside **PyTorch** or TensorFlow.
  • Expert-tier programming proficiency writing optimized, performant code bases natively using **Python** or C++ within matrixed software engineering frameworks.
  • Proven capability to independently synthesize and translate complex architectural designs into live, production-grade diagnostic tools or developer libraries.
  • Outstanding verbal, written, and interpersonal communication traits, with a demonstrated ability to maintain alignment and cooperate smoothly across distributed, multi-functional tech cells.
  • Academic Baseline Qualifications: A Master’s degree from an accredited institution in Computer Science, Mathematics, or a closely aligned quantitative engineering discipline (or equivalent verified professional experience).
  • Location Context: Parameters open exclusively to qualified systems engineers base-stationed permanently within **Massachusetts, United States** to operate 100% remotely from home.

Preferred Strategic Indicators (Nice to Have)

  • In-depth technical mastery handling open format graph models, with verifiable background designing or debugging **ONNX** data structures.
  • Hands-on operational history developing real-world deep learning applications end to end, spanning from early cluster training arrays through to edge node optimization.
  • Prior commercial background managing execution performance layers natively inside the **NVIDIA software stack (TensorRT, CUDA, cuDNN)**.
  • An outcome-driven, self-motivated approach to software craftsmanship that stays up to date with the latest global machine learning research.

What We Offer

  • High-Yield U.S. Total Rewards Canvas: An attractive full-time baseline salary scale structured transparently between $152,000 – $241,500 USD per year for Level 3, and $184,000 – $287,500 USD per year for Level 4, calibrated precisely to reward your accelerated compute authority, supplemented by corporate equity grants.
  • The exceptional professional canvas to directly direct, code-shape, and deploy the developer tools and compiler layers framing how the global tech industry trains and optimizes neural networks.
  • Profound work-from-home remote parameters offering total location flexibility across Massachusetts, complete schedule trust, and zero physical office geographical commuting friction.
  • Immediate eligibility to access comprehensive premium medical, dental, and vision health insurance protection programs.
  • Access to an ultra-innovative corporate environment that pairs complex technical challenges with an inclusive, highly diverse workplace culture.
  • Excellent lifestyle calibration benefits, including generous vacation configurations, paid company holidays, comprehensive savings options, and structured professional training opportunities.

How would you rate this job post?

See what other professionals think about this role.

Is this company safe?

Ask Hyrizon AI to scan this company for potential red flags before you apply.

Safety First

  • Never pay for a job application.
  • Do not share sensitive bank info.
  • Verify the client before starting work.
Learn More