Associate IT Officer, Data and Information Management (Associate Data Engineer)
Build a career with impact. Working at the World Bank Group (WBG) provides a unique opportunity to help countries solve their greatest development challenges. As one of the largest sources of funding and knowledge for developing countries, the WBG is a unique partnership of five global institutions dedicated to ending poverty, increasing shared prosperity, and promoting sustainable development. With 189 member countries and more than 120 offices worldwide, the WBG works with public and private sector partners, investing in groundbreaking projects and using data, research, and technology to develop solutions to the most urgent global challenges. IFC, a member of the World Bank Group, is the largest global development institution focused on the private sector in emerging markets. Working with 2,000 businesses worldwide, we use our six decades of experience to create opportunity where it’s needed most. In FY25, our total exposure in developing countries rose beyond $70 billion, leveraging our capital, expertise and influence to help the private sector end extreme poverty and boost shared prosperity. For more information, visit www.ifc.org. The mission of the Corporate Information Technologies Department (CIT) is to leverage technology to enable IFC’s strategic priorities and to support IFC’s business and operations. By acting as the technology partner and business enabler, the department provides state-of-art information and technology solutions to support IFC’s Investment Advisory Services, Financial Operations, Treasury the Asset Management Company. We work closely with the business to design solutions that automate business processes, help enable business responsiveness to clients, improve operational efficiency and organizational effectiveness, and manage risks. CIT aims to ensure that our solutions meet the business needs of users across a range of functions, including operations and corporate business processes, information, knowledge and learning, data and analytics, and more. We are looking for an Associate Data Engineer to join our growing data and analytics ecosystem. In this role, you will design and build modern data pipelines and platforms that support critical business decisions across IFC. You will work at the intersection of data engineering, cloud technologies, and advanced analytics, helping transform raw data into trusted, actionable insights. This is an exciting opportunity for a hands-on data professional who enjoys solving complex problems and contributing to a global mission. The ideal candidate is an experienced professional operating with a high degree of independence within established frameworks. • Applies advanced technical knowledge to perform in-depth analyses and develop data engineering solutions within their area of specialization • Addresses moderately complex technical challenges, proposing well-reasoned solutions and contributing to design and architectural decisions • Collaborates with internal and external stakeholders to align technical solutions with business needs, facilitating informed decision-making • Provides guidance and knowledge sharing to junior team members, contributing to overall team capability and best practices • Demonstrates the ability to translate business requirements into scalable data solutions, applying professional judgment across a range of assignments Roles and Responsibilities Data Pipeline Engineering and Development: • Design, develop, and maintain scalable and reliable batch and streaming data pipelines using modern data processing frameworks (e.g., PySpark, SQL) • Implement ETL/ELT processes that support ingestion, transformation, and integration of structured and unstructured data from multiple sources, including metadata enrichment and AI-ready preparation for downstream analytics and GenAI use cases • Ensure pipelines are robust, reusable, and aligned with enterprise data architecture standards AI-Enabled Unstructured Data Processing (GenAI/NLP): • Build pipelines to ingest and process unstructured and semi-structured content (e.g., documents, PDFs, presentations, emails, web content), including text extraction and metadata tagging • Prepare high-quality text datasets to support AI use cases such as search, summarization, classification, and question answering • Work with data scientists and AI engineers to productionize AI/GenAI workflows • Apply Responsible AI and security practices (privacy, access control, sensitive data handling, and auditability) Data Modeling and Architecture: • Develop and manage data models aligned with modern data lakehouse architectures, including Medallion (Bronze, Silver, Gold) layers • Build curated datasets optimized for analytics, reporting, and downstream consumption • Contribute to the evolution of IFC’s enterprise data architecture and standards Performance Optimization and Scalability: • Analyze and optimize data processing performance through efficient partitioning, indexing, caching, and storage strategies • Ensure scalability and cost-efficiency of data solutions within cloud environments • Continuously monitor and improve system performance and reliability Data Governance, Security, and Compliance: • Implement and support data governance practices, including data lineage, access control, and auditing mechanisms • Leverage tools such as Unity Catalog to enforce data security and compliance policies • Ensure adherence to enterprise standards for data privacy, security, and risk management Production Support and Operational Excellence: • Deploy, schedule, and monitor data workflows in production using tools such as Databricks Jobs or equivalent orchestration frameworks • Troubleshoot pipeline failures, implement error handling, and ensure high availability of critical data assets • Participate in continuous integration and deployment (CI/CD) processes for data engineering solutions Collaboration and Stakeholder Engagement: • Partner closely with data scientists, analysts, and business stakeholders to understand data requirements and deliver scalable solutions • Support machine learning workflows by providing high-quality, production-ready datasets • Provide technical guidance and mentorship to junior team members where applicable