Data Scientist | Remote
We are seeking a highly skilled Data Scientist to join our AI/LLM Delivery Unit, responsible for designing, developing, and delivering data-driven and AI-powered solutions for enterprise clients. This role combines strong analytical expertise, machine learning experience, and hands-on exposure to Large Language Models (LLMs) to support high-impact client engagements.
The ideal candidate will be both technically strong and client-facing, capable of translating business requirements into scalable AI solutions.
Key Responsibilities
1. AI / Machine Learning Development
Design, develop, and deploy machine learning models and data science solutions
Work on LLM-based use cases such as prompt engineering, evaluation, fine-tuning, and model optimization
Build and optimize pipelines for structured and unstructured data
2. LLM & Generative AI Solutions
Develop and evaluate applications using LLMs (e.g., text generation, classification, summarization, entity extraction)
Conduct prompt engineering and experimentation to improve model outputs
Support model benchmarking, testing, and performance evaluation
3. Data Analysis & Insights
Analyze large datasets to extract actionable insights and patterns
Perform exploratory data analysis (EDA) and feature engineering
Build data visualizations and reports for stakeholders
4. Client Engagement / Delivery
Participate in client-facing technical discussions and presentations
Translate business requirements into AI/ML solutions and solution architectures
Provide consultative insights based on project learnings and industry trends
5. Collaboration & Delivery Excellence
Work closely with cross-functional teams (engineering, QA, annotation, delivery leads)
Support end-to-end project lifecycle: requirement gathering → development → deployment → monitoring
Ensure quality, scalability, and reliability of AI deliverables
Qualifications
Education
Bachelor’s degree in Data Science, Computer Science, Mathematics, Statistics, or related field
Master’s degree or PhD preferred, especially for advanced AI/ML roles
Experience
2–6+ years of experience in data science, machine learning, or AI solutions
Experience working on AI/LLM or NLP-related projects is highly preferred
Exposure to client-facing or delivery environments is a strong advantage
Technical Skills
Strong proficiency in Python (NumPy, Pandas, Scikit-learn)
Experience with ML frameworks (TensorFlow, PyTorch)
Hands-on experience with LLMs / NLP tools (e.g., Hugging Face, OpenAI APIs, embeddings, RAG frameworks)
Solid understanding of:
Machine learning algorithms
Data preprocessing and feature engineering
Model evaluation and performance metrics
Experience with:
SQL and large-scale datasets
Cloud platforms (AWS, Azure, or GCP)
Version control (Git)
Company profile
Innodata is a global data engineering company focused on advancing data and AI innovation. Recognizing the inherent link between data and AI, the company’s mission is to enable leading technology companies and enterprises to drive forward Generative AI and AI technologies. Innodata delivers a broad range of adaptable solutions, platforms, and services designed to support both builders and adopters of AI. With over 35 years of expertise, the company remains committed to delivering the highest quality data and driving exceptional outcomes for its clients across all partnerships.