AI Engineer: NLP & LLMs
- Remote
- Full-Time
Company Description
Zyphe provides a privacy-first identity verification solution that prioritizes user control over personal data while ensuring businesses are protected from fraud and data breaches. Powered by a decentralized platform, Zyphe enables seamless identity verification and retention without storing Personally Identifiable Information (PII) on company servers. With advanced KYC, AML, and KYB modules built on Web3 principles, Zyphe helps organizations meet modern privacy and security requirements. The platform also offers users secure identity vaults and effortless one-click verification for smooth onboarding experiences.
Role Overview
We're looking for an AI Engineer specializing in NLP and Large Language Models to build intelligent language systems that power our compliance automation and document understanding capabilities.
This is not a prompt-engineering role. You will own the full NLP stack, from fine-tuning foundation models to designing production RAG pipelines and evaluation frameworks.
You'll work at the intersection of LLM engineering, information extraction, and regulatory intelligence, building systems that understand complex compliance documents and automate decision-making.
What You'll Do
- Build and fine-tune large language models for document classification, entity extraction, and compliance analysis
- Design and optimize Retrieval-Augmented Generation (RAG) pipelines for regulatory knowledge bases
- Develop prompt engineering frameworks and evaluation harnesses for LLM-powered features
- Implement inference optimization strategies (quantization, distillation, speculative decoding)
- Build structured output extraction from unstructured identity documents and compliance filings
- Create automated evaluation pipelines to measure accuracy, hallucination rates, and latency
- Collaborate with product and compliance teams to translate regulatory requirements into AI capabilities
- Stay current with the fast-moving LLM landscape and evaluate new models and techniques
What We're Looking For
- Strong experience building production NLP systems, not just prototypes
- Deep understanding of transformer architectures, attention mechanisms, and training dynamics
- Hands-on experience with LLM fine-tuning (LoRA, QLoRA, full fine-tuning) and RLHF/DPO
- Proven ability to design and optimize RAG systems at scale
- Experience with inference optimization and model serving (vLLM, TGI, or similar)
- Strong Python skills and familiarity with the HuggingFace ecosystem
- Understanding of evaluation methodologies for generative AI systems
- Experience with information extraction from semi-structured documents is a plus
What Makes You a Great Fit
- You think in systems, not models, you care about the full pipeline from retrieval to generation
- You're obsessed with measurable quality, hallucination rates, precision, and user impact
- You combine deep ML knowledge with pragmatic engineering
- You don't just follow trends, you evaluate critically and ship what works