EUPOL

The European Union Policy-Making (EUPOL) dataset

The EUPOL dataset includes virtually all information contained in the European Commission’s online database PreLex. The mission of PreLex is to monitor the inter-institutional decision-making process of the EU, providing information about various formal events and institutional actors involved in policy-making, as well as cross-references to documents contained in other online databases. In combination with its long-term coverage since the mid 1970s, PreLex is a useful database for studies of EU politics. The EUPOL dataset provides the complete information contained in PreLex in a standardized and machine-readable format. Overall, the latest version of the dataset consists of more than 31,000 decision-making cases, whose features are described by about 2,700 variables. For further information, please see the relevant blog posts.

Dataset description (please cite when using the dataset)

Dataset

EUPOL v03 (1975-2011), raw information downloaded 17-18 January 2012

EUPOL v02 (1975-2010), raw information downloaded 7-10 May 2011

EUPOL v01 (1975-2009), raw information downloaded 8-9 April 2010

Software

The scripts to download, extract, and store the information from PreLex were written in Python 2.6.5, using ActiveState’s PythonWin editor. The scripts rely on the following external Python modules: BeautifulSoup, ClientForm, and Mechanize.

Credits

The computer scripts used to generate the dataset were to a large extent written while I was a postdoctoral fellow in the Department of Public Administration at Leiden University in 2008. The postdoctoral fellowship was co-funded by the Netherlands Institute of Government.

Cross-validation

The table below shows a comparison of the aggregate numbers on the type of legislative procedure coded from the EUPOL v01 and LawLeecher (Kovats 2010) datasets, respectively. While the match is not perfect, the comparison indicates no major discrepancies between the two datasets. The differences are most likely due to slightly different coding decisions rather than differences in the originally extracted information. Therefore, these preliminary results provide us with some confidence in the validity of the two independently programmed extraction procedures.

Legislative procedure EUPOL LawLeecher Difference Absolute Difference
Consultation 6166 6135 31 31
Agreement 1170 1147 23 23
Codecision 1112 1147 -35 35
Cooperation 580 511 69 69
Assent 251 242 9 9
Consultation ECB 16 17 -1 1
Consultation CoA 3 3 0 0
Social Protocol 3 2 1 1
Special Legislative Procedure 2 0 2 2
Total 9303 9204 99 171