Sciforium
Join to apply for the
Software Engineer, Fullstack
role at
Sciforium .
Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary high‑efficiency serving platform. Backed by multi‑million‑dollar funding and direct sponsorship from AMD with hands‑on support from AMD engineers the team is scaling rapidly to build the full stack powering frontier AI models and real‑time applications. We offer a fast‑moving, collaborative environment where engineers have meaningful impact, learn quickly, and tackle deep technical challenges across the AI systems stack.
Role Overview This role offers a unique opportunity to work on the core systems that power Sciforium’s multimodal AI models. You’ll help build the model serving platform working across C++, Python, runtime execution, and distributed infrastructure to create a fast, reliable engine for real‑time AI applications. You’ll gain hands‑on experience with performance engineering, learn how large AI models are optimized and deployed at scale, and collaborate closely with ML researchers and experienced systems engineers. If you enjoy delighting customers with a great Developer Experience, care deeply about performance, and want exposure to the full AI stack, this role provides both high‑impact work and strong growth potential.
Key Responsibilities
Design and build a low‑latency, chat‑like interface for users to test our LLMs.
Build the mission‑critical UI where users generate API keys, set budget limits, and view real‑time usage graphs.
Integrate Stripe or similar to handle complex subscription models.
Create a dynamic API reference section in the Documentation Portal.
Write client‑side wrappers to help users connect to our API endpoints easily (CLI/SDK).
Build secure API gateways and implement low‑latency streaming solutions.
Must‑Haves
Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
3+ years of software engineering experience, with a focus on front‑end development.
Strong proficiency in Typescript and Python.
Understanding of responsive design and UX fundamentals.
Strong collaboration and communication skills, with the ability to work effectively across engineering and ML teams.
Comfortable working from the office and contributing to a fast‑moving, high‑ownership team culture.
Nice to Have
Experience with ML systems engineering and open‑source inference engines like vLLM, Sglang, or TRT‑LLM.
Streaming expertise: knowledge of WebSockets and Server‑Sent Events and how to handle “jittery” network streams without freezing the UI.
Billing integration experience: integrating Stripe Elements, managing webhooks for payment success/failure, and handling SaaS logic (pro‑rating, tiers).
Documentation mindset: appreciate good DX and know how to render an API spec into a readable web page.
Python/Backend knowledge: ability to read a FastAPI or vLLM backend codebase to understand how data is being sent.
Why Join Us
Opportunity to build frontier‑scale AI infrastructure powering next‑generation LLMs and multimodal models.
Work with top‑tier engineers and researchers across systems, GPUs, and ML frameworks.
Tackle high‑impact performance and scalability challenges in training and inference.
Access state‑of‑the‑art GPU clusters, datasets, and tooling.
Opportunity to publish, patent, and push the boundaries of modern AI.
Join a culture of innovation, ownership, and fast execution in a rapidly scaling AI organization.
Benefits Include
Medical, dental, and vision insurance.
401(k) plan.
Daily lunch, snacks, and beverages.
Flexible time off.
Competitive salary and equity.
Equal Opportunity Sciforium is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
#J-18808-Ljbffr
Software Engineer, Fullstack
role at
Sciforium .
Sciforium is an AI infrastructure company developing next-generation multimodal AI models and a proprietary high‑efficiency serving platform. Backed by multi‑million‑dollar funding and direct sponsorship from AMD with hands‑on support from AMD engineers the team is scaling rapidly to build the full stack powering frontier AI models and real‑time applications. We offer a fast‑moving, collaborative environment where engineers have meaningful impact, learn quickly, and tackle deep technical challenges across the AI systems stack.
Role Overview This role offers a unique opportunity to work on the core systems that power Sciforium’s multimodal AI models. You’ll help build the model serving platform working across C++, Python, runtime execution, and distributed infrastructure to create a fast, reliable engine for real‑time AI applications. You’ll gain hands‑on experience with performance engineering, learn how large AI models are optimized and deployed at scale, and collaborate closely with ML researchers and experienced systems engineers. If you enjoy delighting customers with a great Developer Experience, care deeply about performance, and want exposure to the full AI stack, this role provides both high‑impact work and strong growth potential.
Key Responsibilities
Design and build a low‑latency, chat‑like interface for users to test our LLMs.
Build the mission‑critical UI where users generate API keys, set budget limits, and view real‑time usage graphs.
Integrate Stripe or similar to handle complex subscription models.
Create a dynamic API reference section in the Documentation Portal.
Write client‑side wrappers to help users connect to our API endpoints easily (CLI/SDK).
Build secure API gateways and implement low‑latency streaming solutions.
Must‑Haves
Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
3+ years of software engineering experience, with a focus on front‑end development.
Strong proficiency in Typescript and Python.
Understanding of responsive design and UX fundamentals.
Strong collaboration and communication skills, with the ability to work effectively across engineering and ML teams.
Comfortable working from the office and contributing to a fast‑moving, high‑ownership team culture.
Nice to Have
Experience with ML systems engineering and open‑source inference engines like vLLM, Sglang, or TRT‑LLM.
Streaming expertise: knowledge of WebSockets and Server‑Sent Events and how to handle “jittery” network streams without freezing the UI.
Billing integration experience: integrating Stripe Elements, managing webhooks for payment success/failure, and handling SaaS logic (pro‑rating, tiers).
Documentation mindset: appreciate good DX and know how to render an API spec into a readable web page.
Python/Backend knowledge: ability to read a FastAPI or vLLM backend codebase to understand how data is being sent.
Why Join Us
Opportunity to build frontier‑scale AI infrastructure powering next‑generation LLMs and multimodal models.
Work with top‑tier engineers and researchers across systems, GPUs, and ML frameworks.
Tackle high‑impact performance and scalability challenges in training and inference.
Access state‑of‑the‑art GPU clusters, datasets, and tooling.
Opportunity to publish, patent, and push the boundaries of modern AI.
Join a culture of innovation, ownership, and fast execution in a rapidly scaling AI organization.
Benefits Include
Medical, dental, and vision insurance.
401(k) plan.
Daily lunch, snacks, and beverages.
Flexible time off.
Competitive salary and equity.
Equal Opportunity Sciforium is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
#J-18808-Ljbffr