Responsibilities
- Research and implement deep learning algorithms for model training, optimization, and evaluation.
- Support model development in computer vision, 3D perception, and multimodal learning.
- Build and maintain scalable data processing pipelines for image, video, text, and audio datasets.
- Conduct experiments, analyze results, and summarize insights for the research team.
- Collaborate with engineers and researchers to improve model performance and data quality.
- Stay updated on the latest research in ML, CV, NLP, and MLOps, and prototype promising ideas.
Requirements
- Pursuing a Bachelor’s, Master’s, or PhD degree in Computer Science, Artificial Intelligence, Electrical Engineering, or related fields.
- Solid understanding of machine learning, deep learning, and common model architectures.
- Proficiency in Python and at least one deep learning framework (PyTorch, TensorFlow, or JAX).
- Familiarity with data preprocessing, training workflows, and evaluation metrics.
- Strong analytical, problem-solving, and learning ability.
- Experience with multimodal data (image/video/text/audio) or academic ML research is a plus.
- Strong communication, adaptability, and resilience; startup experience preferred.