Kernels - Senior/Staff/Principal SW Engineer Engineering - Santa Clara, CA at Geebo

Kernels - Senior/Staff/Principal SW Engineer

THE ROLE:
SENIOR/STAFF/PRINCIPAL SOFTWARE ENGINEER About us If you are following the evolution of the leading approach in deep learning powered AI, the renaissance in NLP as well as the next disruption in computer vision, you likely know it's all about Transformer based models.
.
They are powering neural nets with billions to trillions of parameters and existing silicon architectures (including the plethora of AI accelerators) are struggling to varying degrees to keep up with exploding model sizes and their performance requirements.
More importantly, TCO considerations for running these models at scale are becoming a bottleneck to meet exploding demand.
Hyperscalers are keen on how to gain COGS efficiencies with the trillions of AI inferences/day they are already serving, but certainly for addressing the steep demand ramp they are anticipating in the next couple of years.
d-Matrix is addressing this problem head on by developing a fully digital in memory computing accelerator for AI inference that is highly optimized for the computational patterns in Transformers.
The fully digital approach removes some of the difficulties of analog techniques that are most often touted in pretty much all other in-memory computing AI inference products.
d-Matrix's AI inference accelerator has also been architected as a chiplet, thereby enabling both a scale-up and scale-out solution with flexible packaging options.
The d-Matrix team has a stellar track record in developing and commercializing silicon at scale as senior execs at the likes of Inphi, Broadcom, and Intel.
Notably, they recognized early the extremely important role of programmability and the software stack and are thoughtfully building up the team in this area even since before their Series A.
The company has raised $44m in funding so far and has 70
employees across Silicon Valley, Sydney and Bengaluru.
Why d-Matrix We want to build a company and a culture that sustains the tests of time.
We offer the candidate a very unique opportunity to express themselves and become a future leader in an industry that will have a huge influence globally.
We are striving to build a culture of transparency, inclusiveness and intellectual honesty while ensuring all our team members are always learning and having fun on the journey.
We have built the industry's first highly programmable in-memory computing architecture that applies to a broad class of applications from cloud to edge.
The candidate will get to work on a path breaking architecture with a highly experienced team that knows what it takes to build a successful business.
The Role:
Senior/Staff/Principal SW Engineer The role requires you to be part of the team that helps productize the SW stack for our AI compute engine.
As part of the Software team, you will be responsible for the development, enhancement, and maintenance of the next-generation AI hardware simulation tools for hardware and for developing software kernels for the hardware.
You possess experience building software kernels for HW architectures.
You possess a very strong understanding of various hardware architectures and how to map algorithms to the architecture.
You understand how to map computational graphs generated by AI frameworks to the underlying architecture.
You have had past experience working across all aspects of the full stack tool chain and understand the nuances of what it takes to optimize and trade-off various aspects of hardware-software co-design.
You are able to build and scale software deliverables in a tight development window.
You will work with a team of compiler experts to build out the compiler infrastructure working closely with other software (ML, Systems) and hardware (mixed signal, DSP, CPU) experts in the company.
Qualifications Minimum:
o Computer Science, Engineering, Math, Physics or related degree.
o Strong grasp of computer architecture, data structures, system software, and machine learning fundamentals.
o Proficient in C/C+
and Python development in Linux environment and using standard development tools.
o Experience implementing algorithms in high level languages such as C/C++, Python.
o Experience implementing algorithms for specialized hardware such as FPGAs, DSPs, GPUs, AI accelerators using libraries such as CuDA etc.
o Experience with development for embedded SIMD vector processors such as Tensilica.
o Experience with ML frameworks such as TensorFlow and.
or PyTorch.
o Experience working with ML compilers and algorithms, such as MLIR, LLVM, TVM, Glow, etc.
o Self-motivated team player with a strong sense of ownership and leadership.
Desired:
o MS or PhD in Computer Science, Electrical Engineering, or related fields.
o Prior startup, small team or incubation experience.
o Experience with a deep learning framework (such as PyTorch, Tensorflow) and ML models for CV, NLP, or Recommendation.
o Work experience at a cloud provider or AI compute / sub-system company.
Location Silicon Valley preferred, but open to other locations within the US/Canada.
Recommended Skills Algorithms Architecture Artificial Intelligence C+
(Programming Language) Compilers Computer Architectures Estimated Salary: $20 to $28 per hour based on qualifications.

Don't Be a Victim of Fraud

  • Electronic Scams
  • Home-based jobs
  • Fake Rentals
  • Bad Buyers
  • Non-Existent Merchandise
  • Secondhand Items
  • More...

Don't Be Fooled

The fraudster will send a check to the victim who has accepted a job. The check can be for multiple reasons such as signing bonus, supplies, etc. The victim will be instructed to deposit the check and use the money for any of these reasons and then instructed to send the remaining funds to the fraudster. The check will bounce and the victim is left responsible.