How do I apply for the Inference Server – Product Software Intern position at tenstorrentuniversity and track my application?

We provide a direct link to the official application portal. You can apply directly here . Additionally, you can easily track this and other applications by clicking the bookmark icon to save it to your YGI Kanban board .

What are the working arrangements and expected duration for the Inference Server – Product Software Intern role?

This position is a not disclosed, structured as a hybrid role.

What educational background or degree level is required for the Inference Server – Product Software Intern at tenstorrentuniversity?

Candidates are generally expected to be holding or pursuing the following degree levels: Bachelor, Master.

Does tenstorrentuniversity provide visa sponsorship or assistance for international applicants?

Regarding visa assistance and sponsorship for this role, the official status is currently listed as: Not disclosed.

Inference Server – Product Software Intern

tenstorrentuniversity

Found 3 weeks ago

Location

Belgrade, Serbia

Time

Not disclosed

Work Mode

Hybrid

Salary

Not disclosed

Visa Help

Not disclosed

Last Verified

3 weeks ago

Education

Bachelor
Master

Skills & Qualifications

Technical Skills

Python
C++

Soft Skills

Strong programming fundamentals
Interested in backend systems, API design, and how ML models are deployed in production environments
Curious about performance optimization techniques
Motivated to learn and contribute in a collaborative engineering environment

Job Description

This role is hybrid based in Belgrade, Serbia. Who You Are * Final-year BSc or MSc student in Computer Science, Software Engineering, Electrical Engineering, or a related technical field * Strong programming fundamentals in Python, with familiarity in C++ considered a plus * Interested in backend systems, API design, and how ML models are deployed in production environments * Curious about performance optimization techniques such as batching, caching, and model parallelism * Motivated to learn and contribute in a collaborative engineering environment What We Need * Contribute to backend features and APIs that support AI inference workloads * Assist in deploying, testing, and benchmarking models running on Tenstorrent hardware * Analyze inference performance and help identify optimization opportunities * Write clean, maintainable code with guidance from senior engineers * Collaborate with the team to improve reliability, usability, and performance of the inference server stack What You Will Learn * How end-to-end ML inference is optimized on custom AI hardware * How scalable backend systems are designed to serve real-world AI applications * How APIs and infrastructure shape the developer experience for AI workloads * Practical performance analysis techniques in production-like environments * How modern AI software stacks integrate models, runtimes, and hardware

Requirements

Final-year BSc or MSc student in Computer Science, Software Engineering, Electrical Engineering, or a related technical field
Strong programming fundamentals in Python, with familiarity in C++ considered a plus
Interested in backend systems, API design, and how ML models are deployed in production environments
Curious about performance optimization techniques such as batching, caching, and model parallelism
Motivated to learn and contribute in a collaborative engineering environment

Software Engineering

Backend Engineering

Nice to Haves

familiarity in C++

Apply Now

22Open Positions

Open Positions

22Open Positions

Inference Server – Product Software Intern

Education

Skills & Qualifications

Technical Skills

Soft Skills

Job Description

Requirements

Related Field

Related Subfield

Nice to Haves