Welcome to RISIS Datasets Portal!
Access to given datasets or platforms, is free of charge and is offered through a two-step process:

     i) Accreditation: researchers need to register and agree on the conditions of use (good use of data, authorship, mentions to the RISIS project, agreement of posting results and aggregated datasets produced on RISIS website) via signing a charter of good use.

     ii) Selection: researchers need to proposes a project (200 words proposal) based upon the mobilisation of one or more datasets. Projects are reviewed both by the relevant dataset producers and by the RISIS project review board that will give the final agreement for access.

All datasets can be accessed via the visit request option (see below), intending an on site physical visit, with costs covered by RISIS*, by the applicant to the host organisation of the dataset. Some datasets also offer an online access option where the dataset (or major parts of it) can be accessed online after registration and a short description of the research prurpose.

* Cost for travel and on site stay (when needed) will be covered by the RISIS project (see Rules for visits and re-imbursement ).
Visit Request
40 plus datasets in the SMS Platform
The Semantically Mapping Science (SMS) platform provides access to more than 40 interlinked datasets relevant for science, technology and innovation studies, plus a set of tools to link, disambiguate, enrich and geo-locate the data. Datasets cover R&D intensive organizations (universities, companies, hospitals, NGOs, governmental organizations, etc.), research projects, funding programs, university rankings, patents etc. As the datasets are linked with each other, users can retrieve a dataset which is tailored to the research questions to be answered. And, linking datasets increases coverage and the number of variables available for analysis. Some of the RISIS datasets are also included in the SMS data store, but not all. Examples are ETER, OrgReg, Leiden Ranking, MORE, and partly EUPRO.
Visit Request
Cheetah Database (Database on Fast Growing Medium Sized Firms)
Cheetah Database is a dataset with the aim of developing a brand-new data infrastructure as a robust monitoring device covering the economic and technological development of Fast Growing Medium Sized Firms as one of the main pillars of the industrial and technological system.
Visit Request
EUPRO – Database on European Framework Programmes
The EUPRO dataset comprises information on R&D projects and all participating organizations funded by the European Framework Programmes (FP).
Visit Request
JoREP 2.0 - Joint Research Programmes database
JoREP 2.0 is a database on joint R&D programmes. It provides a quantitative basis for the monitoring of investments in joint R&D programmes in the countries belonging to the European Research Area (ERA), pointing out the policy rationales behind them and their impact. The set of data aims at describing when, how and serving what purposes European-level initiatives and bilateral/multilateral joint R&D programmes are combined. JoREP 2.0 also supports the analysis of important ERA dynamics and Europeanization processes through the study of the behaviour of main national actors (i.e. funding agencies).
Visit Request
PROFILE - The German Doctoral Candidates and Doctorate Holders Study
Profile is a longitudinal, multi-cohort panel study focusing on the situation of doctoral candidates and their postdoctoral professional careers. The sample consists of doctoral candidates at universities and funding organizations in Germany. The panel perspective comprises the development throughout the doctoral candidacy, the transition from graduation to subsequent job-placement, and the development through candidacy and subsequent job-placement after graduation. Since the beginning of the study in 2009 seven cohorts of doctoral candidates from selected German universities, graduate schools and scholarship providers have been surveyed on an annual basis until they completed or dropped out of their doctoral training and once after the conferral of their doctorate. Altogether more than 16,000 doctoral candidates participated at least once in a ProFile-Survey from 2009 to 2016. The questionnaire covers a number of topics including supervision, financing of the doctorate, mobility, courses attended, personality traits, career intentions and job placement after conferral of the doctorate among many more.
Access Request
Science and Innovation Policy Evaluation Repository (SIPER)
The Science and Innovation Policy Evaluation Repository (SIPER) is a database consisting of science and innovation policy evaluations from across the world. These evaluations have been characterised in terms of their methods, topics, timings and other properties. SIPER is run by a team based at the Manchester Institute of Innovation Research. All characterisations are done by evaluation experts and researchers trained by experienced evaluators following strict explicit guidelines and quality controls.
Access Request
Visit Request
The MORE Dataset
MORE (Mobility Survey of the Higher Education Sector) is arguably the most comprehensive empirical study of researcher mobility available. The study targets researchers working in different fields at higher education institutes in the EU and associated countries. It provides internationally comparable data on personal characteristics and education, current employment and working conditions, as well as range of measures on international (and sector) mobility, including factors that influence mobility, and effects that can be linked to researcher mobility. MORE also has a longitudinal dimension. It currently consists of a proto-study (MORE I in 2009), a fully-fledge study (MOREII in 2012) while a third iteration is under planning for 2018 (MOREIII).The MOREII dataset (found here)includes 10,500 responses stratified by country and field of science. The MOREI study can also be provided upon request.
Access Request
Visit Request
The RISIS-ETER Dataset
The RISIS-ETER facility is a set of databases providing a register of European Higher Education Institutions and containing basis statistical information on them, including descriptors, geographical information, students and graduates, personnel, finances, and research activities. These databases, created by merging data from national statistical authorities, are the only available comprehensive information on European higher education and thus are of fundamental value for analytical purposes. The facility includes data for HEIs in the years 2008, 2011, 2012, 2013 and 2014 derived from the ETER official dataset (www.eter-project.com), from the EUMIDA database including data on the year 2008 and from additional data from other RISIS facilities including publication indicators from the Leiden ranking, EU-FP participation and coordination from EUPRO and patent data from IFRIS-PATSTAT (under preparation). Data can be downloaded from orgreg.joanneum.at for research purposes and under registration.
Visit Request
The RISIS-FirmReg Dataset
FirmReg is a new facility developed within RISIS in order to provide an ambigous identification of firms accross the three RISIS firms datasets, i.e. , CIB, which complements the IPTS scoreboard; European start-up firms, VICO, and European mid-size fast growing firms. It will be available in spring 2018.
Access Request
Visit Request
The RISIS-OrgReg Dataset
The OrgReg is a public facility, which provides a comprehensive register of public-sector research and higher education organizations in European countries. It covers the following types of organizations: Higher Education Institutions (HEI); Public Research Organizations (PRO); Research Hospitals (RH); Public Administration Research Units (PA); Private Non-Profit research Organizations (PNP). For each entity included in the register, OrgReg provides the following information: an unambiguous and stable (over time) identifier for the entity (ID);demographic information, including foundation and closure year and a tracking of demographic events (for example mergers between entities); The legal and the English name of the entity and its changes over time; A set of characteristics of the entities, including the website, country of establishment, the entity type and level (group, organization, component). These characteristics can change over time; a set of geographical locations for the entity, including postcode, city name, geographical coordinates and NUTS 3-level codes. The OrgReg is available remotely at orgreg.joanneum.at through the Cortext authentication service. It is intended to cover all European countries (EU-28 and associated countries to the European Framework Programme) and the time period from the year 2000 onwards. Users can download the entities list in different formats (either a simple list or a panel list) for their further usage (for example for matching with other datasets).
Visit Request
VICO Updated (VICO 2.0) database
The VICO Updated (VICO 2.0) database contains geographical, industry and accounting information on companies founded starting from 1/1/1988, which have received at least one Venture Capital (VC) or angel investment starting from 1/1/1998, operating in seven European countries (Belgium, Finland, France, Germany, Italy, Spain, and the United Kingdom) and Israel.
Access Request
Visit Request
The CIB Dataset
The CIB (Corporate Invention Board) dataset is a database characterising the patent portfolios of the largest industrial firms worldwide. The CIB combines information extracted from the Industrial R&D Investment Scoreboard (EU Commission), the ORBIS financial database and the PATSTAT-IFRIS patent database - an enriched version of the Patstat EPO database.
Access Request
Visit Request
The Leiden Ranking Dataset
The CWTS Leiden Ranking is a database of a university ranking focusing on output and impact of research. The underlying data are collected and processed from the CWTS version of Web of Science (WOS). These underlying data are available locally at CWTS only.
Access Request
Visit Request
The NANO Dataset
The Nano S&T dynamics database (Nano) developed by IFRIS collects publications and patents between 1991 and 2011 about nano S&T. One central characteristics of emerging S&T is that they do not correspond to pre-existing categorisations and require the elaboration of semantic based queries. IFRIS has developed a dynamic query gathering 1.18 million publications and 735000 priority patents. Four types of enrichments have been organised dealing with: (i) categorisation and harmonisation of institutional affiliations, (ii) geolocalisation of all authors and inventors; (iii) geographical clustering of S&T activities; and (iv) thematic clustering of S&T activities. It offers for each publication, 14 ‘main units of observation’ and 11 for patents.