Responsibilities
About the team ByteDance and affiliate are developing the next-generation high-performance analytical database, with a mission to enable efficient and real-time data-driven decision-making on PB-level data sets. The initial product was forked from Clickhouse, after which large re-architecture had been taken place. The product now not only improves the efficiency of Clickhouse but also fits into the elastic cloud-native infrastructure with better scalability and resource utilization. With years of polishment in the internal EB-level scenarios, we are now ready to serve our business partners via various cloud vendors. Responsibilities: - Ensure the stability of ByteDance's data platform, including building and maintaining the operations and maintenance system for detection, emergency response, recovery, and to guarantee business continuity. - Manage automated operations and maintenance for ByteDance's in-house big data and open-source products, improving the efficiency of delivery, operations and maintenance, and technical support. - Promote the accumulation of big data operations experience towards documentation, tooling and standardization, enhancing the operational capabilities of multiple operation and maintenance centers.
Qualifications
Minimum Qualifications: - Bachelor's degree or above in a computer-related field. - Experience in Big Data SRE operations or technical support for toB (business-facing) products. - Familiarity with one or more open-source components, such as Hadoop, Spark, Flink, Hive, Presto/Trino, Doris, Kafka, HBase, Hudi, ClickHouse, etc. - Practical experience in troubleshooting big data product issues, with methodology for investigating online big data product problems and the ability to quickly locate issues. - Familiarity with at least one programming language, including but not limited to Shell, Python, Java, Scala, etc. Preferred Qualification: - Possess good communication skills, teamwork abilities, and self-driven capabilities for continuous self-improvement.