Responsibilities:
- Architect, develop, and maintain backend services for RAG and LLM applications, ensuring high performance and scalability.
- Collaborate with research teams to implement and optimize cutting-edge algorithms for language understanding, generation, and manipulation.
- Optimize and tune backend services for performance, scalability, and reliability.
- Implement APIs and interfaces to connect backend services with front-end applications, databases, and external systems.
- Develop and execute testing strategies to ensure robust and error-free software. Automate deployment processes using CI/CD pipelines.
- Implement monitoring, logging, and alerting systems to ensure the health and performance of backend services.
- Create comprehensive documentation for code, architecture, and operational procedures.
- Participate in code reviews, design discussions, and contribute to the overall technical excellence of the engineering team.
- Provide technical guidance and mentorship to junior engineers, fostering a culture of continuous learning and growth within the team.
Requirements:
- Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field. PhD is a plus.
- Minimum of 3 years of experience in software engineering, with a focus on backend development.
- Technical Skills
- Strong experience with cloud platforms (AWS, GCP, Azure) and containerization technologies (Docker, Kubernetes).
- Deep understanding of microservices architecture and RESTful API design.
- Experience with databases and data storage solutions, such as SQL, NoSQL, vector databases, and caching systems. (e.g. PostgreSQL, MongoDB, Milvus, MinIO, Redis)
- Strong programming skills in languages such as Python, Java, C++, or similar.
- Solid grasp of software engineering principles, software design patterns, and architectural best practices.
- Experience with designing and implementing real-time data processing, event-driven architectures using message queuing and publish-subscribe patterns.(e.g. Apache Kafka)
- Proficiency in setting up, maintaining or using CI/CD pipelines using tools (e.g. Jenkins, GitLab CI/CD)
- AI/ML Skills
- Experience with machine learning frameworks (TensorFlow, PyTorch) and deploying models in production.
- Knowledge of Retrieval-Augmented Generation (RAG) and Large Language Models (LLMs) such as GPT-4, Llama, Mistral or similar.
- Soft Skills:
- Excellent problem-solving skills, attention to detail, and a passion for staying at the forefront of technological advancements.
- Effective communication skills, both written and verbal, with the ability to explain complex technical concepts to non-technical stakeholders.
Working Perks & Professional Development
- Great opportunity to get industry and enterprise experience.
- Exposure to the full spectrum of product development.
- Join a young and dynamic team with forward-looking working culture.
- Lots of room to tinker and experiment.