We are looking for technically exceptional and intellectually curious machine learning interns excited about making large language models smaller, faster, and more efficient. You will work directly with the founding team to research, implement, and validate model-optimization techniques, while building software infrastructure that enables rapid experimentation and iteration.
This is a hands-on, research-meets-engineering role at the intersection of modern ML and next-generation AI compute. Candidates interested in deep learning, model efficiency, and fast-paced startup environments are encouraged to apply. The role is designed with a potential path toward a full-time position for high-performing candidates.