About Us
At Thema, we’re pioneering how the world understands companies, markets, and industries with foundational AI research at web scale.
We regularly process hundreds of TB of data and develop novel embedding methods to crack the problem of company similarity and track market evolution. Backed by the world’s best investors, the UK government, and working in partnership with Cambridge University and the world’s leading companies, we’re making breakthroughs that push the boundaries of AI.
What the Job Involves
We’re looking for an ML Research Engineer to:
- Design and fine-tune embedding models.
- Build advanced vector-based data representations.
- Innovate with unsupervised learning.
- Develop retrieval systems capable of handling billions of vectors.
- Solve complex challenges in entity resolution and scale systems globally with MLOps.
We’re a small team of exceptional engineers and researchers tackling web-scale problems no one else is approaching. If you’re ready to make breakthrough discoveries about companies and markets at the frontier of AI, we should work together.
Necessary Experience
- MSc with experience or PhD in machine learning or related fields.
- Appreciation for data exploration, cleaning, and rapid prototyping to ensure high-quality outcomes.
- Hands-on experience with vector-based similarity methods.
- Experience in building LLM systems optimised for speed and scalability.
- We value open-source contributions...half our team was discovered through their exceptional open-source work.
Example Projects
- Advanced Embeddings: Develop vector representations of complex data to improve search and similarity.
- Retrieval Systems: Enhance search pipelines using vector search, text search, re-ranking, and retrieval-augmented generation (RAG).