SEA-ACN OPEN DATA | PEPs, Collaborations & Open Data: A New Era of Transparency

Collaborative Solutions for Tackling Transnational Corruption Through Better Data


In our previous discussion, we emphasized the importance of open data as a powerful tool in the fight against corruption. We also explored the definition of Politically Exposed Persons (PEPs) and highlighted the role of data standards in enhancing information management. You can revisit that discussion here 👉 https://www.facebook.com/share/p/1CNgKJZxHc/

In this final section, we will focus on how fostering collaboration between organizations can help establish effective data standards—ensuring that information related to PEPs is managed more efficiently and used effectively in efforts to combat corruption.

Khairil Yusof from the Sinar Project will discuss three key aspects that are crucial for understanding and applying Politically Exposed Persons (PEPs) data standards

  1. Collaborations and Implementations of PEPs definition
  2. How impact of data standard
  3. The Example of Tools in using PEPs for Journalists

These three topics will give you a clearer understanding of how PEPs data standards work—and why they matter in the global fight against corruption. 

A key factor in strengthening anti-corruption efforts is fostering collaboration between organizations through the development of unified data standards.

 

Khairil Yusof highlights a successful partnership between the Sinar Project and the Centre to Combat Corruption and Cronyism (C4 Centre). By working together to research Politically Exposed Persons (PEPs) using a common data standard, they eliminated redundant data collection efforts. This approach also enabled journalists covering elections or budget processes to easily update and contribute information through their reporting.

Another notable case comes from Myanmar, where Khairil Yusof collaborated with 10 civil society organizations to collect and organize data on members of parliament and politicians. Their goal was to create an accessible and user-friendly system for public use. Despite challenges stemming from language barriers, the adoption of a shared data standard allowed participants to interpret and categorize information consistently through uniform table formats. This streamlined both data collection and usage.

 

However, one lingering challenge remains: the complexity of local dialects, which often differ from the country’s official language and continue to pose difficulties in standardization

One notable tool utilized by Khairil Yusof involves a database from WeVis, which contains approximately 1,400 photos of elected politicians. Each image is tagged with corresponding names and details, enabling the training of facial recognition systems. By linking names with images, the system can accurately identify political figures—offering a powerful tool for verifying identities and detecting potential conflicts of interest.

This demonstrates the value of sharing databases and tools across organizations, both locally and internationally. Having a clear and standardized definition of Politically Exposed Persons (PEPs) enhances investigators’ ability to pinpoint targets and increases the overall effectiveness of the investigative process.

Khairil further highlights his collaboration with the C4 Centre to refine PEP definitions and develop tools for training investigative journalists and civil society groups in Timor-Leste. One case study in the workshop involved mismanagement and corruption within a state-owned waste management enterprise. By applying the PEP framework, they created a “PEPs Map” to identify key individuals requiring investigation. This map could include business partners, government officials on boards of directors, or even indirectly connected individuals such as drivers or spouses—based on PEP criteria used in Malaysia.

In situations where official PEP data is inaccessible, journalists can turn to alternative sources such as personal associates, family members, or close contacts. These individuals might provide critical information for investigations. This method of defining PEPs, combined with investigative techniques, allows journalists to systematically broaden the scope of their inquiries—reducing the likelihood of overlooking vital information.

Many countries continue to face challenges in establishing compatible data standards due to fragmented and inconsistent data across regions. However, the development of universal data standards would allow organizations to exchange and utilize information more effectively, reducing redundant work and significantly speeding up data verification processes.

For example, platforms like WeVis in Thailand and the Sinar Project in Malaysia share a common goal: monitoring parliamentary activities and scrutinizing Politically Exposed Persons (PEPs). By adopting the same data standard, Sinar can seamlessly integrate WeVis data into its system without restructuring it, reducing workload and improving data processing efficiency.

Another promising approach involves using network graphs to visualize connections between PEPs and various organizations. In practice, the Sinar Project processed WeVis’s standardized database to build its own network analysis system without developing a new data structure. This method not only speeds up data processing but also enhances accuracy in identifying corruption risks.

On an international scale, standardized data enables more efficient cross-border comparisons and linkages. Khairil Yusof illustrates this by noting that even if he doesn’t recognize the English name of a Thai political figure, he can use Malaysia’s database as a reference point and cross-check it with data from the International Consortium of Investigative Journalists (ICIJ) to identify connections. This example highlights how shared data standards can enhance cross-border investigative efforts.

Global data standards do more than streamline verification—they empower civil society organizations, investigative journalists, and transparency-focused agencies to scrutinize and analyze individuals involved in corruption with greater accuracy and speed. By reducing barriers related to language and data structures, international collaboration becomes more efficient, strengthening global efforts to combat corruption.

YEAR
2025
AUTHOR

Knowledge hub for Regional Anti-corruption and good governance Collaboration (KRAC)

ORGANIZATION
Related Content

You might also like...

th