
Search by job, company or skills
Temasek is a global investment company headquartered in Singapore, with a net portfolio value of S$434 billion (US$324 billion, €299 billion, £250 billion, and RMB2.35 trillion) as at 31 March 2025. Marking our unlisted assets to market would provide S$35 billion of value uplift and bring our mark to market net portfolio value to S$469 billion.
Our Purpose So Every Generation Prospers guides us to make a difference for today's and future generations.
Operating on commercial principles, we seek to deliver sustainable returns over the long term.
We have 13 offices in 9 countries around the world: Beijing, Hanoi, Mumbai, Shanghai, Shenzhen, and Singapore in Asia; and Brussels, London, Mexico City, New York, Paris, San Francisco, and Washington, DC outside Asia.
For more information on Temasek, please visit www.temasek.com.sg.
For Temasek Review 2025, please visit www.temasekreview.com.sg.
For Sustainability Report 2025, please visit https://www.temasek.com.sg/content/dam/temasek-corporate/sustainability/2025/Temasek-Sustainability-Report-2025.pdf.
Introduction
AI agents are only as good as the data they can reason over. Poorly structured, stale, or inconsistently governed data is the most common reason enterprise AI products fail to deliver value — not model capability, but data readiness. The AI Data Engineer at Temasek is responsible for building the data foundations that make Temasek's agentic AI systems trustworthy, accurate, and capable of reasoning over the complex, heterogeneous data environment of a global investment institution.
This role sits at the intersection of data engineering and AI systems engineering — responsible for designing and building the data architectures, pipelines, and quality frameworks that allow AI agents to retrieve, reason over, and act on Temasek's investment data. You will work across structured investment data (portfolio positions, financial statements, market data), unstructured data (research reports, company filings, meeting notes, news), and real-time data streams — making all of it accessible, reliable, and AI-readable.
Agent-ready data architecture
Enterprise data quality and governance
Shared, reusable data platform for AI
Experience and background
Technical capabilities
Job ID: 150688315
Skills:
data engineering , Ml, Java, Machine Learning, Natural Language Processing, SAS, data mining, Scala, Big Data, Python, analytics platforms, Ai, data technologies, data lakes, R, ETL processes, cloud computing platforms
Skills:
S3, Lambda, AWS Glue, Redshift, AWS SageMaker, RAG, Flask APIs
Skills:
S3, Lambda, AWS Glue, Redshift, AWS SageMaker, Flask APIs
Skills:
Spark SQL, T-sql, Data Factory, Power Bi, Json, Sql, Pandas, Azure Machine Learning, Python, Parquet, OneLake, Microsoft Entra ID, scikit-learn, Lakehouse Data Warehouse, Azure AI Services, Microsoft Purview, Delta Lake, Microsoft Fabric
Skills:
data engineering , snowflake , Google Cloud Platform, Ffmpeg, Kafka, Hive, Opencv, Spark, Microsoft Azure, AWS, AI data infrastructure, Airflow, big data platforms, cloud platforms, Ray, Flink, video processing technologies, distributed data processing frameworks
We don’t charge any money for job offers