About TCS:
A purpose-led organization that is building a meaningful future through innovation, technology, and collective knowledge. We're #BuildingOnBelief.
Tata Consultancy Services (TCS) is a global leader in IT services, digital and business solutions that partners with its clients to simplify, strengthen and transform their businesses. TCS offers a consulting-led, integrated portfolio of IT, BPS, infrastructure, engineering and assurance services. We ensure the highest levels of certainty and satisfaction through a deep-set commitment to our clients, comprehensive industry expertise and a global network of innovation and delivery centers. For more information, visit us at www.tcs.com.
Job Description:
- Inventory and assess legacy ML models from H2O, R, SAS or AutoML platforms
- Document model inputs, outputs, transformations, feature dependencies and data flows
- Define migration strategies for each model based on complexity and business criticality
- Execute migration using approaches including retraining, non-retraining replication and hybrid methods
- Rebuild models using modern ML stack on Databricks platform
- Translate legacy feature engineering into PySpark-based pipelines
- Implement Databricks workflows, notebooks and job orchestration for migrated pipelines
- Implement MLflow lifecycle including experiment tracking, model registry, versioning and deployment
- Develop batch scoring pipelines and enable real-time scoring where applicable
- Perform model validation using golden datasets, parity testing and statistical comparison
- Execute performance benchmarking covering accuracy, latency and scalability
- Ensure migration acceptance through business validation and audit-ready sign-off
Mandatory Technical Skills
- Programming
- Python, PySpark, SQL, R
- ML Libraries
- numpy, pandas, scikit-learn, tensorflow, pytorch
- Legacy ML Platforms
- H2O, SAS, SPSS, R-based modeling platforms, AutoML
- Databricks and MLOps
- Databricks notebooks, workflows, Delta Lake, MLflow tracking, MLflow model registry, MLflow deployment