Talent.com
Tato nabídka není k dispozici ve vaší zemi.
Senior AI Research Engineer, Model Inference (100% Remote)

Senior AI Research Engineer, Model Inference (100% Remote)

Tether Operations LimitedPraha, 10, CZ
Před 7 hodinami
Popis pozice

Join Tether and Shape the Future of Digital Finance

At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction.

Innovate with Tether

Tether Finance : Our innovative product suite features the world’s most trusted stablecoin, USDT , relied upon by hundreds of millions worldwide, alongside pioneering digital asset tokenization services.

But that’s just the beginning :

Tether Power : Driving sustainable growth, our energy solutions optimize excess power for Bitcoin mining using eco-friendly practices in state-of-the-art, geo-diverse facilities.

Tether Data : Fueling breakthroughs in AI and peer-to-peer technology, we reduce infrastructure costs and enhance global communications with cutting-edge solutions like KEET , our flagship app that redefines secure and private data sharing.

Tether Education : Democratizing access to top-tier digital learning, we empower individuals to thrive in the digital and gig economies, driving global growth and opportunity.

Tether Evolution : At the intersection of technology and human potential, we are pushing the boundaries of what is possible, crafting a future where innovation and human capabilities merge in powerful, unprecedented ways.

Why Join Us?

Our team is a global talent powerhouse, working remotely from every corner of the world. If you’re passionate about making a mark in the fintech space, this is your opportunity to collaborate with some of the brightest minds, pushing boundaries and setting new standards. We’ve grown fast, stayed lean, and secured our place as a leader in the industry.

If you have excellent English communication skills and are ready to contribute to the most innovative platform on the planet, Tether is the place for you.

Are you ready to be part of the future?

About the job :

We are looking for an experienced AI Model Engineer with deep expertise in kernel development, model optimization, fine-tuning, and GPU acceleration. The engineer will extend the inference framework to support inference and fine-tuning for Language models with a strong focus on mobile and integrated GPU acceleration (Vulkan).

This role requires hands-on experience with quantization techniques, LoRA architectures, Vulkan backend, and mobile GPU debugging. You will play a critical role in pushing the boundaries of desktop and on-device inference and fine-tuning performance for next-generation SLM / LLMs.

Responsibilities :

Implement and optimize custom inference and fine-tuning kernels for small and large language models across multiple hardware backends.

Implement and optimize full and LoRA fine-tuning for small and large language models across multiple hardware backends.

Design and extend datatype and precision support (int, float, mixed precision, ternary QTypes, etc.).

Design, customize, and optimize Vulkan compute shaders for quantized operators and fine-tuning workflows.

Investigate and resolve GPU acceleration issues on Vulkan and integrated / mobile GPUs.

Architect and prepare support for advanced quantization techniques to improve efficiency and memory usage.

Debug and optimize GPU operators (e.g., int8, fp16, fp4, ternary).

Integrate and validate quantization workflows for training and inference.

Conduct evaluation and benchmarking (e.g., perplexity testing, fine-tuned adapter performance).

Conduct GPU testing across desktop and mobile devices.

Collaborate with research and engineering teams to prototype, benchmark, and scale new model optimization methods.

Deliver production-grade, efficient language model deployment for mobile and edge use cases.

Work closely with cross-functional teams to integrate optimized serving and inference frameworks into production pipelines designed for edge and on-device applications. Define clear success metrics such as improved real-world performance, low error rates, robust scalability, optimal memory usage and ensure continuous monitoring and iterative refinements for sustained improvements.

Proficiency in C++ and GPU kernel programming.

Proven Expertise in GPU acceleration with Vulkan framework.

Strong background in quantization and mixed-precision model optimization.

Experience and Expertise in Vulkan compute shader development and customization.

Familiarity with LoRA fine-tuning and parameter-efficient training methods.

Ability to debug GPU-specific performance and stability issues on desktop and mobile devices.

Hands-on experience with mobile GPU acceleration and model inference.

Familiarity with large language model architectures (e.g., Qwen, Gemma, LLaMA, Falcon etc.).

Experience implementing custom backward operators for fine-tuning.

Experience creating and curating custom datasets for style transfer and domain-specific fine-tuning.

Demonstrated ability to apply empirical research to overcome challenges in model

Important information for candidates

Recruitment scams have become increasingly common. To protect yourself, please keep the following in mind when applying for roles :

Apply only through our official channels.  We do not use third-party platforms or agencies for recruitment unless clearly stated. All open roles are listed on our official careers page :  https : / / tether.recruitee.com /

Verify the recruiter’s identity.  All our recruiters have verified LinkedIn profiles. If you’re unsure, you can confirm their identity by checking their profile or contacting us through our website.

Be cautious of unusual communication methods.  We do not conduct interviews over WhatsApp, Telegram, or SMS. All communication is done through official company emails and platforms.

Double-check email addresses.  All communication from us will come from emails ending in  @ tether.to or @ tether.io

We will never request payment or financial details.  If someone asks for personal financial information or payment at any point during the hiring process, it is a scam. Please report it immediately.

When in doubt, feel free to reach out through our official website.

Vytvořit upozornění na toto hledání

Ai Engineer • Praha, 10, CZ

Související práce
Senior FullStack Developer / AI Expert

Senior FullStack Developer / AI Expert

Growmodo GmbHPrague, 10, CZ
Growmodo is a fast-scaling web design and development agency that helps businesses and agencies outsource high-quality design and development work through a global subscription-based service.Our cl...Zobrazit vícePoslední aktualizace: před 30+ dny
Lead Data Science Researcher

Lead Data Science Researcher

Top Remote TalentPrague, Prague, .CZ
Quick Apply
A software development company is looking for a talented, long-term Lead DS Researcher.The company is a team of experts providing analytical services to healthcare clients.You will join an internat...Zobrazit vícePoslední aktualizace: před 30+ dny
  • Propagováno
Designér / ka optických sítí - mzda od 50 000 Kč, plný nebo zkrácený úvazek, auto, flexibilita a možnost home office.

Designér / ka optických sítí - mzda od 50 000 Kč, plný nebo zkrácený úvazek, auto, flexibilita a možnost home office.

SPJ group s.r.o.Kladno, Česko, Česko
Jsme česká technologická firma, která od roku 2018 staví, instaluje a udržuje optické sítě.Naším cílem je přinášet lidem i firmám rychlé a spolehlivé připojení – a tím propojovat domácnosti, firmy ...Zobrazit vícePoslední aktualizace: před 16 dny
Senior UI / UX Designer / AI Expert

Senior UI / UX Designer / AI Expert

Growmodo GmbHPrague, 10, CZ
At Growmodo, we help fast-growing companies by connecting them with global talent while supporting the careers of creative and tech professionals. We're driven by growth, strong relationships, and a...Zobrazit vícePoslední aktualizace: před 30+ dny
  • Propagováno
Technik 3D měření : spolehlivá technika, férové odměny? #Tu4eA

Technik 3D měření : spolehlivá technika, férové odměny? #Tu4eA

INDEX NOSLUŠ s.r.o.Benešov, Czech Republic
Hledáme člověka, kterého baví přesnost a detail.Láká Tě práce v moderní měrně, máš „oči na detail“ a chceš mít prostor tvořit si vlastní programy, pak je to šance pro tebe.Čeká Tě přátelské a moder...Zobrazit vícePoslední aktualizace: před 3 dny
Mid-Level AI / ML Engineer (Data Scientist Background)

Mid-Level AI / ML Engineer (Data Scientist Background)

KolomoloPrague, Prague, .CZ
Quick Apply
To be leaders in digital modernization by helping companies embrace latest cutting edge technologies to optimize their business with the help of our talented experts. AI / ML Engineer (Data Scientist ...Zobrazit vícePoslední aktualizace: před 30+ dny
  • Novinka!
Senior Research Engineer Multimodal & Video Foundation Model (100% Remote)

Senior Research Engineer Multimodal & Video Foundation Model (100% Remote)

Tether Operations LimitedPraha, 10, CZ
Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Zobrazit vícePoslední aktualizace: před 7 hodinami
Software Engineer (AI Platform) - Remote

Software Engineer (AI Platform) - Remote

ReplikaPraha 114, Prague, .CZ
Quick Apply
An AI companion who is eager to learn and would love to see the world through your eyes.Replika is always ready to chat when you need an empathetic friend. Replika is an AI companion loved by 40M+ u...Zobrazit vícePoslední aktualizace: před 6 dny
Senior Research Engineer Multimodal & Video Foundation Model

Senior Research Engineer Multimodal & Video Foundation Model

Tether Operations LimitedPrague, 10, CZ
Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Zobrazit vícePoslední aktualizace: před 30+ dny
  • Propagováno
Technolog, který má u nás slovo! AJ, rozvoj, česká firma, férové peníze? #0jgmf

Technolog, který má u nás slovo! AJ, rozvoj, česká firma, férové peníze? #0jgmf

INDEX NOSLUŠ s.r.o.Benešov, Czech Republic
Máte technické myšlení a chcete pracovat s moderními technologiemi?.Hledáme výrobního technologa do čisté výroby přístrojů, které pomáhají lidem po celém světě. Uplatníte své technické znalosti, vyu...Zobrazit vícePoslední aktualizace: před 3 dny
  • Propagováno
Project Manager : flexibilní pracovní doba, nové technologie a roční bonus #BDvr1

Project Manager : flexibilní pracovní doba, nové technologie a roční bonus #BDvr1

INDEX NOSLUŠ s.r.o.Beroun, Czech Republic
Máte zkušenosti s řízením projektů v automobilovém průmyslu a chcete pracovat na zajímavých projektech pro značky jako BMW, ŠKODA nebo Ford?. Hledáme vedoucího projektu, který zvládne koordinovat sé...Zobrazit vícePoslední aktualizace: před 3 dny
  • Propagováno
Supplier Quality Engineer

Supplier Quality Engineer

ZF LIFETECBrandýs nad Labem-Stará Boleslav, Česko, Česko
Ve společnosti ZF LIFETEC spojujeme výjimečné kariérní příležitosti s posláním zachraňovat životy.Díky inovativním řešením a vysoké kvalitě produktů si společnost získala důvěru mnoha předních auto...Zobrazit vícePoslední aktualizace: před 17 dny
MLOps Engineer (AI Platform)

MLOps Engineer (AI Platform)

OmiliaPrague, Prague, CZ
Quick Apply
Are you ready to move beyond maintaining legacy systems and build something truly new? What if your next role gave you the keys to architect an entire AI platform from the ground up, powering syste...Zobrazit vícePoslední aktualizace: před 30+ dny
AI Agent Engineer (Python & Azure)

AI Agent Engineer (Python & Azure)

CROWDCONSULTANTSPrague, Prague, .CZ
Quick Apply
This job operates on a hybrid model, requiring you to be based and work from within Bulgaria.AI-powered agents and workflows. Python, LLMs, and frameworks such as.Azure ML, Cognitive Services, Azure...Zobrazit vícePoslední aktualizace: před 20 dny
  • Novinka!
Senior AI Research Engineer, Model Inference (100% Remote)

Senior AI Research Engineer, Model Inference (100% Remote)

Tether Operations LimitedPrague, 10, CZ
Join Tether and Shape the Future of Digital Finance.At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from ex...Zobrazit vícePoslední aktualizace: před 7 hodinami
  • Propagováno
Digitální nadšenec do Power Apps

Digitální nadšenec do Power Apps

ZF LIFETECBrandýs nad Labem-Stará Boleslav, Česko, Česko
Ve společnosti ZF LIFETEC spojujeme výjimečné kariérní příležitosti s posláním zachraňovat životy.Díky inovativním řešením a vysoké kvalitě produktů si společnost získala důvěru mnoha předních auto...Zobrazit vícePoslední aktualizace: před 17 dny
  • Propagováno
PLC Specialista

PLC Specialista

ZF LIFETECBrandýs nad Labem-Stará Boleslav, Česko, Česko
Potřebujeme mistra v programování.Zkušeného odborníka co se umí programovat v PLC systémů, nejlépe se zkušeností z velkosériového výrobního podniku. Pokud byste měl(a) znalosti specifik procesů v au...Zobrazit vícePoslední aktualizace: před 14 dny
AI Evangelist

AI Evangelist

CloudtalkPrague, Prague, CZ
Quick Apply
Global SaaS Company | $28M Series B Investment | 100% Remote or Hybrid.AI business communication platform .Powered by a January 2024 $28 million Series B investment from top investors like KPN...Zobrazit vícePoslední aktualizace: před 6 dny
Director, AI & Data Products

Director, AI & Data Products

team.blue GlobalPraha-Praha 8, Hlavní město Praha, .CZ
Quick Apply
The most trusted digital enabler.Europe and has more than 3,000 experts to support them.Its goal is to shape technology and to empower businesses with innovative digital services.Click here to read...Zobrazit vícePoslední aktualizace: před 24 dny
IHCRA

IHCRA

ICONPrague, Czech Republic
IHCRA, office based flex in ICON office in Prague, CZE.ICON plc is a world-leading healthcare intelligence and clinical research organization. We’re proud to foster an inclusive environment driving ...Zobrazit vícePoslední aktualizace: před 14 dny