About the Role
Senior Data Engineer (Spark Expert)
This role requires for bilingual speakers who speak both Mandarin and English.
A5 Labs is redefining the boundaries of AI-driven security in competitive online environments. We specialize in ensuring fair play, integrity, and trust in high-stakes, strategy-based games such as online poker and real-time competitive gaming platforms.
We are looking for an experienced SeniorData Engineer (5+ years) to join our growing analytics team. You will be responsible for building, expanding, and optimizing our data and pipeline architecture, and supporting cross-functional teams in data flow and collection. This role ensures the delivery of high-quality, scalable, and reliable data infrastructure across all projects.
Key Responsibilities:
- Build and maintain optimal data pipeline architecture.
- Develop large, complex data sets that meet functional and business requirements.
- Identify and implement process improvements: automate manual tasks, optimize data delivery, and redesign infrastructure for scalability.
- Use AWS big data and SQL technologies to design and implement efficient ETL pipelines from multiple data sources.
- Develop data tools and analytics solutions to generate insights on customer behavior, operations, and key business metrics.
- Collaborate with executives, product, data, and design teams to solve technical challenges and support data needs.
- Ensure secure and compliant data management across multiple data centers and AWS regions.
- Build internal tools for analytics and data science teams to improve product innovation.
- Partner with data experts to enhance system performance and functionality.
Requirements:
- 5+ years of hands-on experience with big data tools (e.g., Databricks, Snowflake).
- 5+ years of experience with big data frameworks (e.g., Spark, Hive, Hadoop, EMR).
- 5+ years of experience with SQL/NoSQL databases.
- 3+ years of experience with AWS services (EC2, ECS, MSK, RDS, Redshift).
- 3+ years of Python programming experience.
- Proven experience building and optimizing large-scale data pipelines and architectures.
- Strong knowledge of streaming and batch data processing.
- Solid analytical skills with structured and unstructured datasets.
- Experience deriving value from large, complex, and distributed data sets.
- Strong understanding of data modeling, data access, and storage design.
- Degree in Computer Science, Statistics, Information Systems, or a related quantitative field (Master's preferred).
- Fluent Mandarin required, also can use English as working language.
Nice to Have:
- Experience with Elasticsearch, Solr, or other indexing solutions.
- Familiarity with Spark Streaming, Kafka Streams, or Flink.
- Experience with workflow tools like Airflow or Apache Nifi.
- Understanding of machine learning, numerical analysis, and data analytics.
About the Team:
This role is embedded within AI Engineering team, A5 Labs elite team dedicated to advanced game AI and security. We develop reinforcement learning agents, real-time detection systems, and behavioral analytics tools to tackle cheating, botting, and collusion in poker and other high-skill games.
What We Offer
- Competitive compensation (4.5+ Glassdoor rating; 100% pay satisfaction)
- Fully remote with flexible hours and generous paid leave (45 weeks extra)
- Direct collaboration with world-class AI researchers and engineers
- Product-focused culture we ship real systems, not just white papers
- Multicultural and inclusive team Responsible, Freedom and Collaboration.