How It Works

Our Data Methodology

Transparency is a core principle of our project. This page outlines the automated process we use to collect, structure, and present the data on political prisoners.

Step 01

Automated Data Collection

Our system begins by automatically scraping data from highly reputable human rights organizations. Our primary sources include the OVD-Info API (api.repression.info) and Memorial (memopzk.org). Automated scripts visit these sources periodically to gather raw data, including names, case descriptions, locations, and links to original reports.

Step 02

AI-Powered Data Structuring

Raw text from various sources is often unstructured. We use AI models to read and interpret this raw text, extracting key information into a consistent, structured format. The AI is prompted to identify and categorize data points such as arrest dates, criminal charges, case categories, trial statuses, and potential health concerns. It also generates a concise summary for each case in English.

Step 03

Data Enrichment & Geocoding

Once structured, we enrich the data further. Location names provided in the source material are converted into precise geographic coordinates (latitude and longitude) using the Nominatim API for OpenStreetMap. This geocoding process is what enables the powerful interactive map feature on our platform, allowing for geographic analysis of arrest locations.

Step 04

Storage and Presentation

The final, structured, and enriched data for each prisoner is stored in a PostgreSQL database. This robust database serves as the single source of truth for our entire application. A Node.js API, built with Express, provides the data to our Next.js frontend, which then renders the interactive tables, maps, charts, and detail pages you see on this site.

A Note on Accuracy

This project is heavily reliant on automated processes, including AI-based data extraction. While we strive for the highest level of accuracy, errors in interpretation can occur. The data presented here should be considered a starting point for research, and we always provide a direct link to the original source material on each prisoner's page for verification. If you find a discrepancy, please don't hesitate to contact us.

Explore the Data

See our methodology in action

VIEW ANALYTICS