Mirakl, a leader in platform economy, is seeking a Data Scientist NLP/GenAI to join their team in Paris. The role involves prototyping, iterating, and deploying algorithms in collaboration with Product, Data Engineers, and development teams. The focus will be on catalog Marketplace challenges, including NLP, Computer Vision, and Generative AI (LLMs customs) at scale.
The Data Scientist will work on projects related to automated rewriting of marketing content, product attribute extraction, variant product detection, product categorization, automatic onboarding, product sheet merging, and trend prediction. Mirakl is one of the rare French companies to have fine-tuned LLMs in production at scale.
Responsibilities:
- Analyzing and preparing data.
- Prototyping algorithms.
- Collaborating with Data Engineers and development teams for production.
- Creating dashboards to illustrate algorithm performance.
- Presenting results and participating in brainstorming sessions.
- Discussing use cases, user experience, and integration methods with other teams.
Requirements:
- 4+ years of experience as a Data Scientist.
- Significant experience in NLP and ML applied in enterprise.
- Experience in deploying Machine Learning algorithms.
- Knowledge of NLP and Computer Vision algorithms.
- Proficiency in Python, TensorFlow, or PyTorch.
- Experience with Spark development.
Mirakl offers:
- Impact on over 500 e-commerce/marketplace sites in 40 countries.
- Exposure to advanced techniques (multimodal models, LLM fine-tuning).
- Autonomy and ownership of projects.