About the Company
Work on large-scale structured and unstructured data sets to solve a wide array of challenging problems using analytical, statistical, machine learning or deep learning approaches.
About the Role
Design, develop and maintain data ETL processes, ensure the processes are fulfilling business needs and SLAs.
Responsibilities
- Facilitate technical planning and optimize the data infrastructure, models and pipelines to improve data platform stability.
- Design and develop Data Products to drive business outcomes.
- Collaborate with business stakeholders to understand their data needs and develop robust and scalable data models.
- Develop processes to answer recurring business questions and identify opportunities for improvement.
- Ensure Data Quality through continuous improvement and monitoring.
Qualifications
- 2+ years of experience in data science, data engineering, software engineering, or a related field.
Required Skills
- Production-level Scala, Python and SQL programming knowledge.
- Experience building and scaling batch/streaming data pipelines, with good understanding of size and performance constraints.
- Understanding of ML algorithms such as classification, regression, clustering, neural networks, SVM, decision trees, boosting techniques, reinforcement learning, etc.
- Experience building data products with Cloud Services and Data Warehousing services in AWS or AliCloud.
- Knowledge of SQL and NoSQL databases is advantageous.
Good work life balance
Stable
Competitive salary