Last Updated: 2018-07-05

The classification works on the corpus of all the parliamentary questions (oral and written) presented during the VIII term. The corpus is projected in a vector space where the dimensions are the keywords selected through a technique that involves the use of Markov Chains.
In this space, every text is represented by the TF-IDF (term frequency–inverse document frequency) vector.

On this vector space we've trained two different classifiers (svm and random forest). Combining the two classifiers we reach a precision of 81% on our test set.

As you may understand, classifying parliamentary texts involves knowledge of the domain, care when combining the classifiers and a high quality training. Even when all these elements are there, this semi-automatic classification can hardly be perfect, but it's good to continously try to improve it.

Every feedback and help is then more than welcome!

2018/03/22 Fraud perpetrated by Dutch dairy farmers
2016/01/20 Commission inquiry into possible price-fixing in the pork, beef and dairy sectors in France
2015/01/21 Plan to settle unpaid bills: continuity in the Council's work
2015/05/20 Frontex headquarters agreement
2015/04/17 Headquarters agreement - Frontex
2018/01/29 VAT scheme for the sale of building land
2016/07/19 Pig breeding in Europe: misuse by Germany of flat-rate VAT scheme
2015/07/07 Mobility of apprentices in the EU
2015/05/20 Seat Agreements for EU Agencies
2015/02/19 Blocking of EAFRD appropriations
2014/09/30 Impact of bilateral agreements on EU own resources
2015/02/24 Non-payment dispute
2015/01/21 Eurozone budget
2014/12/19 Contracts concerning multimedia actions at threat of not being respected under the current 2015 EU budget
2015/03/05 Unfair competition in the shallot industry
2015/02/06 The traditional shallot - unfair competition
2018/06/15 European Parliament departments' access to the ABAC financial data warehouse
2016/05/04 Alliances between distributors at national and European levels
2017/10/27 Criminal proceedings against the former head of ELSTAT
2017/10/26 Combating the proliferation of the box tree moth
2015/02/03 Prefinancing for the Youth Employment Initiative
2017/02/02 'Modernising' the EU-Turkey Customs Union
2016/12/09 Growth and job creation in Greece
2015/03/19 Application of the CAP and the compensatory allowance for permanent natural handicaps (ICHN)
2018/02/06 The future of EU milk powder stocks
2015/02/24 Difference between countries with regard to outstanding payments
2015/10/21 Allocation of 'Horizon 2020 - SME instrument' funds
2015/01/14 Task force on take-up of cohesion funds