UNHCR’s Microdata Library

Support to UNHCR to discover, clean, catalogue and anonymize existing data for its microdata library to promote the availability and accessibility of data to external partners such as academics and researchers.

13 Jul, 2023

Overall objectives

The project will continue to discover, clean, document and anonymize the backlog of data in UNHCR’s operations and technical sections for publication on the Raw Internal Data Library (RIDL) and Microdata Library (MDL). It will also assist in institutionalizing the use of the RIDL and will standardize curation procedures and methods.  

It will also familiarize staff members with their roles and the procedures of Data Dissemination. Making data available on the external microdata library will facilitate data sharing with partners and reduce the need to engage in ad hoc data sharing agreements. The dissemination of microdata will also enable analysis and research that can inform and improve future programmes and policies that aim to improve the lives and well-being of persons affected by forced displacement., Moreover, it will reduce duplication in data collection exercises and the burden of forcibly displaced persons to respond to queries. The project will also engage in communication and promotion of activities to encourage the use of the shared data. Overall, it will, improve the quality and reliability of data produced by UNHCR, enhance the credibility of UNHCR as an authoritative source of information on FDPs, and contribute to maximizing the impact of UNHCR’s data collection.

Activity description

The RIDL and MDL platforms are designed to provide a user-friendly format and a secure location for the storage and use of datasets collected by UNHCR and others working on forced displacement issues. These platforms will host a large volumn datasets which UNHCR or its partners have previously collected.

JDC’s support facilitates the work of a Data Curation Team which will ensure that the RIDL and MDL are populated with data that is cleaned and anonymized in a manner consistent with UNHCR’s Data Protection Policy. The Data Curation Team engages operations, partners and technical units to discover, identify, analyze, and prioritize data for storage, curation and dissemination.

The activity will include scoping missions by curators to better understand the current data landscape and existing datasets, and train data producers and users on the use of the platforms. Remote support on the platforms is also being provided to UNHCR field staff, regional offices and Headquarters to ensure safe data management. Regional training workshops will be organized for a target technical and Information Management staff on the use of the platforms and other data-related activities. The Curation Team will also assist in obtaining the necessary approvals to publish suitable data sets on the external MDL platform and will manage the release of data according to protocol.

The Curation Team will also develop and issue technical guidance and training materials for data producers, including best practices for cleaning, anonymization, and creation of metadata (the Data Curation Handbook). Standard language to be included in Project Partnership Agreements to facilitate data sharing from partners (including host governments, NSOs, non-governmental organizations and other organizations) will also be developed. The project will also support the continued technical/IT development of the two platforms.

Engagement with partners

The UNHCR Microdata Library application was set up and will continue to be maintained in close coordination with the World Bank.

Background and Context

While UNHCR routinely collects a wealth of data, both directly and through its partners, the organization as a whole has not been able to capitalize fully on its significant data collection investments. This is due to various reasons:  data are at times lost or forgotten after the data collection and analysis activities are complete; data are not stored in a format that is conducive to further use; or metadata is not documented. In its Data Transformation Strategy, UNHCR has committed “…that by 2025, UNHCR is a trusted leader on data and information related to refugees and other affected populations, thereby enabling actions that protect, include and empower.”

In the interest of promoting efficiency, transparency and the best use of public funds, open access and dissemination of data is increasingly promoted and sometimes even mandated by those funding data collection efforts, and many National Statistical Offices (NSOs) now maintain open data portals. To date, UNHCR has shared mainly aggregated data openly and publicly, and access to microdata has largely been regulated by ad hoc data sharing agreements. 

This project is increasing UNHCR’s commitment to open and responsible data sharing, by discovering, cleaning, cataloguing and anonymizing microdata collected by UNHCR and its partners in both an internal-facing and an external-facing online platform, namely the Raw Internal Data Library (RIDL) and the Microdata Library (MDL). At the internal level, the project will therefore improve data quality, prevent data loss, prevent duplication of data collection efforts, and reduce the burden of trying to find data as well as responding to personalized requests of other staff members. At the external level, it will contribute to the data value chain by promoting, through an ease of access, further analysis by academics and research centers, private sector, development actors and other humanitarian organizations. This analysis can be used to inform programming, policy, and advocacy efforts generating a positive impact on the lives of people affected by forced displacement.

Contact

For further details on this activity, please contact:

Patrick Brock – [email protected]

Additional resources

h

World Bank-UNHCR Data Sharing Agreement

More activities

Enhancing UNHCR's global registration system

UNHCR’s development of its Population Registration and Identity Management Eco-System (PRIMES), which unifies registration, identity management and case management applications.

Microdata from forcibly displaced populations

Support UNHCR’s analysis of comparable socioeconomic data of the forcibly displaced to inform effective strategies, plans, policy and programs.

Intergrating World Bank microdata into the UNHCR microdata library

This project aims to provide technical support from the World Bank so that content on forced displacement from the Bank’s microdata library can be shared on UNHCR’s microdata library.