Work with different technology teams across infrastructure, and other divisions to deliver system solutions for the business
Involve in development and system integration relevant systems with CDP / CDSW
Hands-on experiences on Big Data solution and development based on Python technologies stack to design and implement secure, scalable and high-performance data processing pipelines
Responsible for documentation of design, build and implementation deliverables owned by the individual and present it when needed
Build a strong relationship and manage expectations with users and stake holders
Requirements
At least 3 years relevant experience
Strong understanding of Cloudera framework includes CDP (Cloudera Data Platform) and CDSW (Cloudera Data Science Work Bench)
Experience in implementing Data security and access control using Ranger/Atlas
Must have working understanding of Hadoop, Hive and Spark/pySpark architect
Experience in python web-application frameworks (Django)
Experience in shell scripting, automation and troubleshooting technical issues
Other good to have skill sets include global market products:
Experience working in the financial industry with relevant experience in core data science using Cloudera platform based on Python technologies stack
Previous experience on Data science project and understand the global markets products and its underlying pricing components (Market data analytics and identifying risk factors affecting pricing of the products)
Experience in Cloudera manager to monitor Hadoop services
Good knowledge and working experience in Hadoop administration (incl. Hive, Impala, Kafka, zookeeper etc.)
Hands-on Techno-Functional role to analysis and propose solutions for business issues, process changes and functional requirements
Strong team player with excellent communication & inter-personal skills
Strong problem solver who can question and understand proposed solutions and business drivers