Who are we?
Cohere is the leading security-first enterprise AI company. We build cutting-edge foundation AI models and end-to-end products that are designed to solve real-world business problems.
We’re training and deploying frontier models for enterprises who are building AI systems. We believe that our work is instrumental to the widespread adoption of AI and we are looking for folks that want to be part of that.
We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. Cohere is a team of researchers, engineers, designers, and more, who are all passionate about their craft.
We are a global technology company co-headquartered in Toronto and San Francisco, with key offices in London, New York City, Montreal, Seoul, Germany and Paris. Join us!
Why this role?
To have the opportunity to collaborate with Cohere researchers and tools on designing and implementing novel research ideas and shipping state-of-the-art models to production. We have openings in teams covering base model training, retrieval augmented generation, data and evaluation, safety, and finetuning, to name a few; and we are open to receiving intern applications in any research area relating to LLMs to broaden your research connections while obtaining deep experience in a growing AI startup.
Please Note: To be eligible for a Research Internship, you must be currently pursuing a PhD in Machine Learning, NLP, or a related discipline. You need to be available for a full-time internship that lasts for 4-6 months.
As a Cohere Research Intern, you will
- Conduct cutting-edge machine learning research, building and training large language models.
- Focus on research projects aimed at expanding the frontier of knowledge in language modelling and associate areas such as evaluation, multimodal models, optimisation etc.
- Disseminate your research results through the production of publications, datasets, and code.
- Contribute to research initiatives that have practical applications in Cohere’s product development.
You may be a good fit if you
- Are currently pursuing, or in the process of obtaining, a PhD in Machine Learning, NLP, Artificial Intelligence, or a related discipline. We will also consider exceptional non-PhD candidates.
- Are eligible for work authorization in the country of employment at the time of hire and maintain ongoing work authorization throughout the internship period.
- Have experience using large-scale distributed training strategies, data annotation and evaluation pipelines, or implementing state of the art ML models.
- Are familiar with autoregressive sequence models, such as Transformers.
- Have strong communication and problem-solving skills with the ability to convey complex research findings clearly and succinctly.
- Have knowledge, or are knowledgeable, of programming languages such as Python, C, C++, Lua, or related languages.
- Have knowledge of related ML frameworks such as JAX, Pytorch and Tensorflow.
- Have previous experience in building systems based on machine learning and deep learning techniques.
- Demonstrate passion for applied NLP models and products.
Preferred Qualifications
- Demonstrated expertise through publications in top tier venues in fields such as machine learning, NLP, artificial intelligence, computer vision, optimization, computer science, statistics, applied mathematics, or data science.
- Proven ability to tackle analytical problems using quantitative methodologies.
- Proficiency in handling and analysing complex, high-dimensional data from various sources.
- Experience in applying theoretical and empirical research to real-world problem-solving.
HOW AND WHERE WE WORK
- Cohere is remote-friendly. We have offices in Toronto, San Francisco, New York City, London, Paris, Montreal, and more coming soon.
- For those in the office: a daily lunch program, ple